PanHunter Glossary page
-
Alignment The process of matching sequencing reads to a reference genome to determine their origin.
-
Annotation Information about genomic features (e.g., genes, exons) used to interpret sequencing data.
-
Barcode (sequencing) A short DNA sequence used to label and identify reads from individual samples or cells.
-
Biotype A classification of genes based on their function or characteristics (e.g., protein-coding, rRNA, pseudogene).
-
CDS (Coding Sequence) The portion of a gene that is translated into a protein.
-
Coverage The number of sequencing reads that align to a specific region of a gene or genome.
-
Deduplication The process of removing duplicate reads that originate from the same molecule, often using UMIs.
-
Gene body The full length of a gene from the start (5′ end) to the end (3′ end).
-
Genomic features Functional regions of the genome such as exons, introns, and regulatory regions.
-
Introns Non-coding regions within genes that are removed during RNA processing.
-
Library preparation The experimental process of preparing RNA or DNA samples for sequencing.
-
Mapping rate The percentage of reads that successfully align to the reference genome.
-
Mitochondrial genes Genes located in the mitochondrial genome, often used as indicators of cell quality or stress.
-
Normalization Adjusting data to account for differences (e.g., sequencing depth) so samples can be compared fairly.
-
Outlier A sample or value that differs significantly from others in the dataset.
-
Pseudogene A DNA sequence similar to a gene but typically non-functional.
-
Read (sequencing read) A short DNA or RNA sequence obtained from sequencing.
-
RNA degradation Breakdown of RNA molecules, which can affect data quality and lead to biased results.
-
Spike-in transcripts Artificial RNA sequences added to samples as controls to monitor experimental performance.
-
Transcript An RNA copy of a gene produced during gene expression.
-
UMI (Unique Molecular Identifier) A short sequence attached to reads to distinguish original molecules and remove duplicates.
-
UTR (Untranslated Region) Regions of a gene (5′ or 3′) that are not translated into protein but play regulatory roles.