Skip to contents

The MGI report MRK_GeneTrap.rpt provides associations between MGI genetic markers and gene trap cell lines. For more details about gene trapping, see MGI’s Gene Trap Questions and Answers.

To read this report using the key "gene_trap_ids", use the following code:

# To read all records (~ 17,000), use `read_report("gene_trap_ids")`.
(assoc_to_gene_trap_ids <- read_report(report_key = "gene_trap_ids", n_max = 150L))
## # A tibble: 150 × 8
##    marker_status marker_type marker_id   marker_symbol marker_name    chromosome
##    <fct>         <fct>       <chr>       <chr>         <chr>          <fct>     
##  1 O             Gene        MGI:3698435 0610009E02Rik RIKEN cDNA 06… 2         
##  2 O             Gene        MGI:1914088 0610009L18Rik RIKEN cDNA 06… 11        
##  3 O             Gene        MGI:1915647 0610039H22Rik RIKEN cDNA 06… 11        
##  4 O             Gene        MGI:1914922 0610040B10Rik RIKEN cDNA 06… 5         
##  5 O             Gene        MGI:1922644 0610040F04Rik RIKEN cDNA 06… 6         
##  6 O             Gene        MGI:1923511 0610040J01Rik RIKEN cDNA 06… 5         
##  7 O             Gene        MGI:1915650 0610043K17Rik RIKEN cDNA 06… 4         
##  8 O             Gene        MGI:1915738 1110002J07Rik RIKEN cDNA 11… 10        
##  9 O             Gene        MGI:1915821 1110002L01Rik RIKEN cDNA 11… 12        
## 10 O             Gene        MGI:1929274 1110004F10Rik RIKEN cDNA 11… 7         
## # ℹ 140 more rows
## # ℹ 2 more variables: genetic_map_pos <dbl>, cell_line_id <list>

Gene Trap Cell Lines

The list-column cell_line_id provides associated gene trap cell lines. To unnest cell_line_id use:

assoc_to_gene_trap_ids |>
  dplyr::select("marker_id", "marker_symbol","cell_line_id") |>
  dplyr::filter(marker_id == "MGI:3698435") |>
  tidyr::unnest("cell_line_id")
## # A tibble: 9 × 3
##   marker_id   marker_symbol cell_line_id
##   <chr>       <chr>         <chr>       
## 1 MGI:3698435 0610009E02Rik EUCJ0179g02 
## 2 MGI:3698435 0610009E02Rik IST11122B3  
## 3 MGI:3698435 0610009E02Rik IST12077C2  
## 4 MGI:3698435 0610009E02Rik IST12343D6  
## 5 MGI:3698435 0610009E02Rik IST15046F3  
## 6 MGI:3698435 0610009E02Rik OST106532   
## 7 MGI:3698435 0610009E02Rik OST106540   
## 8 MGI:3698435 0610009E02Rik OST206174   
## 9 MGI:3698435 0610009E02Rik OST412537

Variables

marker_status

marker_status: genetic marker status is a factor of two levels: 'O' for official, and 'W' for withdrawn. Official indicates a currently in-use genetic marker, whereas withdrawn means that the symbol or name was once approved but has since been replaced.

marker_id

marker_id: MGI accession identifier. A unique alphanumeric character string that is used to unambiguously identify a particular record in the Mouse Genome Informatics database. The format is MGI:nnnnnn, where n is a digit.

marker_symbol

marker_symbol: marker symbol is a unique abbreviation of the marker name.

marker_name

marker_name: marker name is a word or phrase that uniquely identifies the genetic marker, e.g. a gene or allele name.

chromosome

chromosome: mouse chromosome name. Possible values are names for the autosomal, sexual or mitochondrial chromosomes.

genetic_map_pos

genetic_map_pos: genetic map position in centiMorgan (cM): a unit of length in a genetic map. Two loci are 1 cM apart if recombination is detected between them in 1% of meioses.

cell_line_id

cell_line_id: a list-column of unique identifier(s) assigned to each gene trap cell line. In gene trapping experiments, this ID is crucial for tracking and cataloging the specific embryonic stem (ES) cell lines that have undergone genome-wide insertional mutations. Each cell_line_id corresponds to a mutant cell line in which a gene has been disrupted by the integration of a selection gene, potentially altering gene expression or protein production. The identifier facilitates the retrieval of information regarding the specific site of integration, which can be further characterized by cloning, cDNA extension, or direct sequencing of the insertion site.