| analyte_class | Allowable Value | Analytes are the target molecules being measured with the assay. | ChromatinDNADNA + RNAEndogenous fluorophoresFluorochromeLipidMetaboliteNucleic acid and proteinPeptidePolysaccharideProteinRNA  | True | 
      
        | acquisition_instrument_vendor | Allowable Value | An acquisition instrument is the device that contains the signal detection hardware and signal processing software. Assays generate signals such as light of various intensities or color or signals representing the molecular mass. | Akoya BiosciencesAndorBGI GenomicsBrukerCytivaEvident Scientific (Olympus)GE HealthcareHamamatsuHuron Digital PathologyIlluminaIn-HouseIonpathKeyenceLeica BiosystemsLeica MicrosystemsMoticNanoStringResolve BiosciencesSciexStandard BioTools (Fluidigm)Thermo Fisher ScientificZeiss Microscopy | True | 
      
        | acquisition_instrument_model | Allowable Value | Manufacturers of an acquisition instrument may offer various versions (models) of that instrument with different features or sensitivities. Differences in features or sensitivities may be relevant to processing or interpretation of the data. | Aperio AT2Aperio CS2Axio Observer 3Axio Observer 5Axio Observer 7Axio Scan.Z1BZ-X710BZ-X800BZ-X810CosMx Spatial Molecular ImagerCustom: MultiphotonDigital Spatial ProfilerDM6 BDNBSEQ-T7EVOS M7000HiSeq 2500HiSeq 4000Hyperion Imaging SystemIN Cell Analyzer 2200Lightsheet 7MALDI timsTOF Flex PrototypeMIBIscopeMoticEasyScan OneNanoZoomer 2.0-HTNanoZoomer S210NanoZoomer S360NanoZoomer S60NanoZoomer-SQNextSeq 2000NextSeq 500NextSeq 550NovaSeq 6000NovaSeq XNovaSeq X PlusOrbitrap Eclipse TribridOrbitrap Fusion Lumos TribridPhenocycler-Fusion 1.0Phenocycler-Fusion 2.0PhenoImager FusionQ ExactiveQ Exactive HFQ Exactive UHMRQTRAP 5500Resolve Biosciences Molecular CartographySCN400STELLARIS 5TissueScope LE Slide ScannerUnknownVS200 Slide ScannerXenium AnalyzerZyla 4.2 sCMOS | True | 
      
        | source_storage_duration_value | Numeric | How long was the source material (parent) stored, prior to this sample being processed. |  | True | 
      
        | source_storage_duration_unit | Allowable Value | The time duration unit of measurement | hourmonthdayminuteyear | True | 
      
        | time_since_acquisition_instrument_calibration_value | Numeric | The amount of time since the acqusition instrument was last serviced by the vendor. This provides a metric for assessing drift in data capture. |  | False | 
      
        | time_since_acquisition_instrument_calibration_unit | Allowable Value | The time unit of measurement | Column-by-columnNot applicableRow-by-rowSnake-by-columnsSnake-by-rows | False | 
      
        | contributors_path | Textfield | The path to the file with the ORCID IDs for all contributors of this dataset (e.g., “./extras/contributors.tsv” or “./contributors.tsv”). This is an internal metadata field that is just used for ingest. |  | True | 
      
        | data_path | Textfield | The top level directory containing the raw and/or processed data. For a single dataset upload this might be “.” where as for a data upload containing multiple datasets, this would be the directory name for the respective dataset. For instance, if the data is within a directory called “TEST001-RK” use syntax “./TEST001-RK” for this field. If there are multiple directory levels, use the format “./TEST001-RK/Run1/Pass2” in which “Pass2” is the subdirectory where the single dataset’s data is stored. This is an internal metadata field that is just used for ingest. |  | True | 
      
        | barcode_read | Allowable Value | Which read file contains the cell or capture spot barcode. This should be included when constructing sequencing libraries with a non-commercial kit. This field is required if the source material is barcoded. This field is used to determine which analysis pipeline to run. | Read 2 (R2)Read 1 (R1)Not applicable | True | 
      
        | barcode_size | Allowable Value | Length of the cell or capture spot barcode in base pairs. Cell and capture spot barcodes are, for example, 3 x 8 bp sequences that are spaced by constant sequences, the offsets. This should be included when constructing sequencing libraries with a non-commercial kit. This field is required if the source material is barcoded.  This field is used to determine which analysis pipeline to run. | 1416408,8,88,68,8Not applicable | True | 
      
        | umi_read | Allowable Value | Which read file contains the UMI barcode. This should be included when constructing sequencing libraries with a non-commercial kit. | Read 2 (R2)Read 1 (R1)Not applicable | True | 
      
        | umi_size | Allowable Value | Length of the umi barcode in base pairs. This should be included when constructing sequencing libraries with a non-commercial kit. This field is required if UMI are present. This field is used to determine which analysis pipeline to run. | 89101214Not applicable | True | 
      
        | assay_input_entity | Allowable Value | This is the entity from which the analyte is being captured. For example, for bulk sequencing this would be “tissue”, while it would be “single cell” for single cell sequencing. This field is used to determine which analysis pipeline to run. | area of interestsingle cellsingle nucleusspottissue (bulk) | True | 
      
        | number_of_input_cells_or_nuclei | Numeric | How many cells or nuclei were input to the assay? This is typically not available for preparations working with bulk tissue. |  | False | 
      
        | library_adapter_sequence | Textfield | 5’ and/or 3’ read adapter sequences used as part of the library preparation protocol to render the library compatible with the sequencing protocol and instrumentation. This should be provided as comma-separated list of key:value pairs (adapter name:sequence). |  | True | 
      
        | library_average_fragment_size | Numeric | Average size of sequencing library fragments estimated via gel electrophoresis or bioanalyzer/tapestation. Numeric value in base pairs (bp). |  | True | 
      
        | library_input_amount_value | Numeric | The amount of cDNA, after amplification, that was used for library construction. |  | False | 
      
        | library_input_amount_unit | Allowable Value | unit of library input amount value | ngul | False | 
      
        | library_output_amount_value | Numeric | Total amount (eg. nanograms) of library after the clean-up step of final pcr amplification step. Answer the question: What is the Qubit measured concentration (ng/ul) times the elution volume (ul) after the final clean-up step? |  | False | 
      
        | library_output_amount_unit | Allowable Value | Units of library final yield. | ngul | False | 
      
        | library_concentration_value | Numeric | The concentration value of the pooled library samples submitted for sequencing. |  | True | 
      
        | library_concentration_unit | Allowable Value | Unit of library concentration value. | ng/ulnM | True | 
      
        | library_layout | Allowable Value | Whether the library was generated for single-end or paired end sequencing | paired-endsingle-end | True | 
      
        | number_of_pcr_cycles_for_indexing | Numeric | Number of PCR cycles performed in order to add adapters and amplify the library. This does not include the cDNA amplification which is captured in the “number of iterations of cDNA amplification” field. |  | True | 
      
        | library_preparation_kit | Allowable Value | Reagent kit used for library preparation | 10X Genomics; Automated Library Construction Kit24 rxns; PN 100042810X Genomics; Chromium Next GEM Automated Single Cell 5' Kit v224 rxns; PN 100029010X Genomics; Chromium Next GEM Automated Single Cell 5' Kit v24 rxns; PN 100029810X Genomics; Chromium Next GEM Single Cell 3' GEMLibrary & Gel Bead Kit v3.116 rxns; PN 100012110X Genomics; Chromium Next GEM Single Cell 3' HT Kit v3.148 rxns; PN 100034810X Genomics; Chromium Next GEM Single Cell 3' HT Kit v3.18 rxns; PN 100037010X Genomics; Chromium Next GEM Single Cell 3' Kit v3.116 rxns; PN 100026810X Genomics; Chromium Next GEM Single Cell 3' Kit v3.14 rxns; PN 100026910X Genomics; Chromium Next GEM Single Cell 5' Kit v216 rxns; PN 100026310X Genomics; Chromium Next GEM Single Cell 5' Kit v24 rxns; PN 100026510X Genomics; Chromium Next GEM Single Cell Fixed RNA Hybridization & Library Kit4 rxns; PN 100041510X Genomics; Chromium NextGem Single Cell Multiome ATAC + Gene Expression Reagent Bundle16 rxn; PN 100028310X Genomics; Chromium NextGem Single Cell Multiome ATAC + Gene Expression Reagent Bundle4 rxn; PN 100028510X Genomics; Chromium Single Cell 3' GEMLibrary & Gel Bead Kit v34 rxns PN 100009210X Genomics; Chromium Single Cell 3' Library & Gel Bead Kit4 rxns; PN 12026710X Genomics; Visium CytAssist Spatial Gene Expression for FFPEHuman Transcriptome11 mm2 reactions; PN 100052210X Genomics; Visium CytAssist Spatial Gene Expression for FFPEHuman Transcriptome6.5mm4 reactions; PN 100052010X Genomics; Visium Spatial for FFPE Gene Expression KitHuman Transcriptome1 slides4 reactions; PN 100033810X Genomics; Visium Spatial for FFPE Gene Expression KitMouse Transcriptome4 rxns; PN 100033910X Genomics; Visium Spatial Gene Expression Slide and Reagent Kit1 slides4 reactions; PN 100018710X Genomics; Visium Spatial Gene Expression Slide and Reagent Kit4 slides16 reactions; PN 1000184CustomIllumina; TruSeq Stranded mRNA Library Prep (48 samples); PN 20020594Illumina; TruSeq Stranded mRNA Library Prep (96 samples); PN 20020595New England BioLabs; NEBNext Ultra II RNA Library Prep Kit for Illumina; PN E7770Parse Biosciences; Evercode WT Mini v2 Kit12 rxns; PN ECW02010Parse Biosciences; Evercode WT v2 Kit48 rxns; PN ECW02030) | False | 
      
        | sample_indexing_kit | Allowable Value | Indexes are needed for multiplexing sequencing libraries for simultaneous sequencing (pooling) and proper attachment to the Illumina flowcell. Each indexing kit would have a number of compatible sequences (“sample indexing sets”) that are used to label some number of samples (the number of sets depend on the kit). | 10X Genomics; Chromium i7 Sample Index Plate (96 rxn); PN 22010310X Genomics; Dual Index Kit TSSet A; PN 100025110X Genomics; Dual Index Kit TTSet A (96 rxn); PN 100021510X Genomics; Single Index Kit NSet A (96 rxn); PN 1000212CustomIllumina; IDT for Illumina - TruSeq RNA UD Indexes v2 (96 Indexes96 Samples); PN 20040871Illumina; TruSeq RNA CD Index Plate (96 Indexes96 Samples); PN 20019792Illumina; TruSeq RNA Single Indexes Set A (12 Indexes48 Samples); PN 20020492Illumina; TruSeq RNA Single Indexes Set B (12 Indexes48 Samples); PN 20020493Integrated DNA Technologies: Custom DNA OligosNanoString Technologies; GeoMx Seq Code Pack; PN GMX-NGS-SEQ-ABNanoString Technologies; GeoMx Seq Code Pack; PN GMX-NGS-SEQ-CDNanoString Technologies; GeoMx Seq Code Pack; PN GMX-NGS-SEQ-EFNanoString Technologies; GeoMx Seq Code Pack; PN GMX-NGS-SEQ-GHNot applicableParse Biosciences; Fragmentation Reagents; PN WX100Parse Biosciences; UDI Plate - WT; PN UDI1001 | False | 
      
        | sample_indexing_set | Textfield | The specific sequencing barcode index set used, selected from the sample indexing kit. Example: For 10X this might be “SI-GA-A1”, for Nextera “N505 - CTCCTTAC” |  | False | 
      
        | is_targeted | Allowable Value | Specifies whether or not a specific molecule(s) is/are targeted for detection/measurement by the assay (“Yes” or “No”). The CODEX analyte is protein. | YesNo | True | 
      
        | expected_entity_capture_count | Numeric | Number of cells, nuclei or capture spots expected to be captured by the assay. For Visium this is the total number of spots covered by tissue, within the capture area. |  | False | 
      
        | sequencing_reagent_kit | Allowable Value | Reagent kit used for sequencing | CustomIllumina; HiSeq 3000/4000 PE Cluster Kit PE-410-1001; PN 1000283Illumina; NextSeq 1000/2000 P2 Reagent v3 Kit (100 Cycles); PN 20046811Illumina; NextSeq 1000/2000 P2 Reagent v3 Kit (200 Cycles); PN 20046812Illumina; NextSeq 1000/2000 P2 Reagent v3 Kit (300 Cycles); PN 20046813Illumina; NextSeq 2000 P3 Reagent Kit (300 Cycles); PN 20040561Illumina; NextSeq 2000 P3 Reagents Kit (100 Cycles); PN 20040559Illumina; NextSeq 500/550 Hi Output Kit 150 Cycles; v2.5; PN 20024907Illumina; NextSeq 500/550 Hi Output Kit 75 Cycles v2.5; PN 20024906Illumina; NextSeq 500/550 Mid Output Kit 150 Cycles v2.5; PN 20024904Illumina; NovaSeq 6000 S1 Reagent Kit (200 Cycles); PN 20012864Illumina; NovaSeq 6000 S1 Reagent v1.5 Kit (100 Cycles); PN 20028319Illumina; NovaSeq 6000 S1 Reagent v1.5 Kit (200 Cycles); PN 20028318Illumina; NovaSeq 6000 S1 Reagent v1.5 Kit (300 Cycles); PN 20028317Illumina; NovaSeq 6000 S2 Reagent v1.5 Kit (100 Cycles); PN 20028316Illumina; NovaSeq 6000 S4 Reagent Kit v1.5 (300 cycles); PN 20028312Illumina; NovaSeq 6000 S4 Reagent v1.5 Kit (200 Cycles); PN 20028313Illumina; NovaSeq 6000 SP Reagent v1.5 Kit (100 Cycles); PN 20028401Illumina; NovaSeq X Series 1.5B Reagent Kit (100 Cycle); PN 20104703Illumina; NovaSeq X Series 1.5B Reagent Kit (200 Cycle); PN 20104704Illumina; NovaSeq X Series 1.5B Reagent Kit (300 Cycle); PN 20104705Illumina; NovaSeq X Series 10B Reagent Kit (100 Cycle); PN 20085596Illumina; NovaSeq X Series 10B Reagent Kit (200 Cycle); PN 20085595Illumina; NovaSeq X Series 10B Reagent Kit (300 Cycle); PN 20085594 | True | 
      
        | sequencing_read_format | Textfield | Number of sequencing cycles in each round of sequencing (i.e., Read1, i7 index, i5 index, and Read2). This is reported as a comma-delimited list. Example: For 10X snATAC-seq (R1,Index,R2,R3) this might be: 50,8,16,50. For SNARE-seq2 this might be: 75,94,8,75 |  | True | 
      
        | sequencing_batch_id | Textfield | The ID for the sequencing run. This could, for example, be the chip ID and should allow users the ability to determine which samples were processed together in a sequencing run. It is recommended that data providers prefix the ID with the center name, to prevent values overlapping across centers. |  | False | 
      
        | capture_batch_id | Textfield | A lab-generated ID to identify which cells were captured at the same time. This would, for example, be an ID to denote which datasets were derived from a single 10X Genomics Chromium Controller run. In the case of the 10X Controller this could be the chip ID and would allow users the ability to determine which samples were processed together in a Chromium controller. It is recommended that data providers prefix the ID with the center name, to prevent values overlapping across centers. |  | False | 
      
        | preparation_instrument_vendor | Allowable Value | The manufacturer of the instrument used to prepare (staining/processing) the sample for the assay. If an automatic slide staining method was indicated this field should list the manufacturer of the instrument. | 10x GenomicsHamamatsuHTX TechnologiesIn-HouseLeica BiosystemsNot applicableRoche DiagnosticsSunChromThermo Fisher Scientific | False | 
      
        | preparation_instrument_model | Allowable Value | Manufacturers of a staining system instrument may offer various versions (models) of that instrument with different features. Differences in features or sensitivities may be relevant to processing or interpretation of the data. | AutoStainer XLChromium ConnectChromium ControllerChromium iXChromium XDiscovery UltraEVOS M7000M3+ SprayerM5 SprayerNanoZoomer S210NanoZoomer S360NanoZoomer S60Not applicableST5020 MultistainerSublimatorSunCollect SprayerTM-SprayerVisium CytAssist | False | 
      
        | metadata_schema_id | Textfield | The string that serves as the definitive identifier for the metadata schema version and is readily interpretable by computers for data validation and processing. Example: 22bc762a-5020-419d-b170-24253ed9e8d9 |  | True | 
      
        | amount_of_input_analyte_value | Numeric | The amount of RNA or DNA input to the assay, typically measured by a Qubit, BioAnalyzer, or TapeStation. In most single cell/nuclei assays, this value isn’t available. |  | False | 
      
        | number_of_iterations_of_cdna_amplification | Numeric | This is the amplification of the cDNA prior to library construction. This is typically a PCR amplification, while for linear amplification methods like aRNA this would be the number of rounds of aRNA. |  | True | 
      
        | preparation_instrument_kit | Allowable Value | The reagent kit used with the preparation instrument. | 10X Genomics; Chromium Next GEM Chip G Single Cell Kit16 rxns; PN 100012710X Genomics; Chromium Next GEM Chip G Single Cell Kit48 rxns; PN 100012010X Genomics; Chromium Next GEM Chip K Automated Single Cell Kit48 rxns; PN 100028910X Genomics; Chromium Next GEM Chip K Single Cell Kit16 rxns; PN 100028710X Genomics; Chromium Next GEM Chip K Single Cell Kit48 rxns; PN 100028610X Genomics; Chromium Next GEM Chip Q Single Cell Kit16 rxns; PN 100042210X Genomics; Chromium NextGem Single Cell Multiome ATAC + Gene Expression Reagent Bundle16 rxn; PN 100028310X Genomics; Chromium NextGem Single Cell Multiome ATAC + Gene Expression Reagent Bundle4 rxn; PN 100028510X Genomics; Visium FFPE Reagent Kit v2-SmallPN 1000436Custom | False | 
      
        | preparation_protocol_doi | Textfield | DOI for the protocols.io page that describes the assay or sample procurment and preparation. For example for an imaging assay, the protocol might include staining of a section through the creation of an OME-TIFF file. In this case the protocol would include any image processing steps required to create the OME-TIFF file. Example: https://dx.doi.org/10.17504/protocols.io.eq2lyno9qvx9/v1 |  | True | 
      
        | umi_offset | Allowable Value | Position in the read at which the UMI barcode starts. This should be included when constructing sequencing libraries with a non-commercial kit. | 0163436Not applicable | True | 
      
        | parent_sample_id | Textfield | Unique SenNet or SenNet identifier of the sample (i.e., block, section or suspension) used to perform this assay. For example, for a RNAseq assay, the parent would be the suspension, whereas, for one of the imaging assays, the parent would be the tissue section. If an assay comes from multiple parent samples then this should be a comma separated list. Example: HBM386.ZGKG.235, HBM672.MKPK.442 or SNT232.UBHJ.322, SNT329.ALSK.102 |  | True | 
      
        | barcode_offset | Allowable Value | Positions in the read at which the cell or capture spot barcodes start. Cell and capture spot barcodes are, for example, 3 x 8 bp sequences that are spaced by constant sequences (the offsets). First barcode at position 0, then 38, then 76. This should be included when constructing sequencing libraries with a non-commercial kit. | 08201,270,38,7610,48,86Not applicable | True | 
      
        | is_custom_probes_used | Allowable Value | State (“Yes” or “No”) whether custom RNA or antibody probes were used. If custom probes were used, they must be listed in the “custom_probe_set.csv” file. | YesNo | True | 
      
        | probe_hybridization_time_value | Numeric | How long was the oligo-conjugated RNA or oligo-conjugated antibody probes hybridized with the sample? |  | True | 
      
        | probe_hybridization_time_unit | Allowable Value | The units for probe hybridization time value. | HourMinute | True | 
      
        | oligo_probe_panel | Allowable Value | This is the probe panel used to target genes and/or proteins. In cases where there is a core panel and add-on modules, the core panel should be selected here. If additional panels are used, then they must be included in the “additional_panels_used.csv” file that’s uploaded with the dataset. | 10x Genomics; Chromium Fixed RNA KitHuman Transcriptome4 rxns x 1 BC; PN 100047410X Genomics; Chromium Next GEM Single Cell Fixed RNA Human Transcriptome Probe Kit16 rxns; PN 100042010X Genomics; Chromium Next GEM Single Cell Fixed RNA Human Transcriptome Probe Kit64 rxns; PN 100045610x Genomics; Visium Human Transcriptome Probe Kit v2 - Small; PN 100046610x Genomics; Visium Human Transcriptome Probe Kit-Large; PN 100036410x Genomics; Visium Human Transcriptome Probe Kit-Small; PN 100036310x Genomics; Visium Mouse Transcriptome Probe Kit - Small; PN 1000365CustomNanoString Technologies; GeoMx Human Whole Transcriptome Atlas4 slides; PN GMX-RNA-NGS-HuWTA-4NanoString Technologies; GeoMx Mouse Whole Transcriptome Atlas4 slides; PN GMX-RNA-NGS-MsWTA-4 | True | 
      
        | is_custom_probes_used | Allowable Value | State (“Yes” or “No”) whether custom RNA or antibody probes were used. If custom probes were used, they must be listed in the “custom_probe_set.csv” file. | YesNo | True | 
      
        | dataset_type | Allowable Value | The specific type of dataset being produced. | 10X Multiome2D Imaging Mass CytometryATACseqAuto-fluorescenceCell DIVECODEXConfocalCosMxCyCIFDBiTDESIEnhanced Stimulated Raman Spectroscopy (SRS)GeoMx (nCounter)GeoMx (NGS)HiFi-SlideHistologyLC-MSLight SheetMALDIMERFISHMIBIMolecular CartographyMUSICnanoSPLITSPhenoCyclerResolveRNAseqRNAseq (with probes)Second Harmonic Generation (SHG)SIMSSNARE-seq2Stereo-seqThick section Multiphoton MxIFVisium (no probes)Visium (with probes)Xenium | True | 
      
        | amount_of_input_analyte_unit | Allowable Value | Units of amount of entity input to assay value | ugng | False |