Skip to content

airr-community/airr-standards

Repository files navigation

image

image

image

AIRR Community Data Standards

Introduction

One of the core initiatives of the Adaptive Immune Receptor Repertoire (AIRR) Community [Breden_2017] is to develop and maintain standards regarding the content, format and exchange of data and metadata from AIRR sequencing (AIRR-seq) experiments, thereby contributing to the global endeavor to make scientific data FAIR (findable, accessible, interoperable, reusable). The current Standards developed from three initially separate initatives (Minimal Information, Data Representation, API), which converged over time. The further development of the Standards is currently carried out by the AIRR Community's Standards Working Group (WG), which was the result of the fusion of the "Data Representation", "Minimal Standards" and parts of the Common Repository WGs.

MiAIRR Minimal Information Standard

The Minimal information about an Adaptive Immune Receptor Repertoire (MiAIRR) data standard defines around 60 metadata fields which should be provided to facilitate evaluation and re-use a the data set. These fields are grouped according to the various stage of the typical workflow (e.g., study, subject, sample, experimental processing, data processing, data annotation). MiAIRR was originally published in 2017 [Rubelt_2017] and has since been intergrated into the AIRR Data Schema, which has now become the authorative reference for MiAIRR. For further information see the MiAIRR section of the AIRR Standards documentation.

AIRR Standardized Data Representation

The standardarized data representation was first published in 2018 [Vander_Heiden_2018], the current version of the schema can be found in this repository as OpenAPI v2 and OpenAPI v3 definitions. The two versions are functionally identical, but users should be aware that support for OpenAPI v2 will be discontinued in the intermediate future.

AIRR Data Commons API

The AIRR Data Commons (ADC) API [Christley_2020] provides programmatic access to query and download AIRR-seq data from ADC repositories.

Implementations

The AIRR Standards are supported by the following services:

Releases and Issues

Official releases can be found in the release section of the repository. There releases are also permanently archived on Zenodo, where they can be referenced to via the archive's DOI [Zenodo_1185414].

Contributions

The AIRR Community is an open initiative and we welcome all contributions. Please review our development process before contributing changes.

References

Copyright 2015 - 2024 by the AIRR Community and contributors. This work is licensed under a Creative Commons Attribution 4.0 International License.

Breden_2017

Breden F et al. Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data. Front Immunol 8:1418 (2017) [PMID:29163494] [DOI:10.3389/fimmu.2017.01418]

Christley_2020

Christley S et al. The ADC API: A Web API for the Programmatic Query of the AIRR Data Commons. Front Big Data (2020) [PMID:33693395] [DOI:10.3389/fdata.2020.00022]

Lees_2020

Lees W et al. OGRDB: a reference database of inferred immune receptor genes. Nucleic Acids Res 48:D964 (2020) [PMID:31566225] [DOI:10.1093/nar/gkz822]

Lees_2023

Lees W et al. AIRR Community curation and standardised representation for immunoglobulin and T cell receptor germline sets. Immunoinformatics 10:100025 (2023) [PMID:37388275] [DOI:10.1016/j.immuno.2023.100025]

Rubelt_2017

Rubelt F et al. AIRR Community Recommendations for Sharing Immune Repertoire Sequencing Data. Nat Immunol 18:1274 (2017) [PMID:29144493] [DOI:10.1038/ni.3873]

Vander_Heiden_2018

Vander Heiden JA et al. AIRR Community Standardized Representations for Annotated Immune Repertoires. Front Immunol 9:2206 (2018) [PMID:30323809] [DOI:10.3389/fimmu.2018.02206]

Zenodo_1185414

Release archive of the AIRR Standards repository. (2018-2022) [DOI:10.5281/zenodo.1185414]