Wahl, M.B.* ; Caldwell, R.B.* ; Kierzek, A.M.* ; Arakawa, H.* ; Eyras, E.* ; Hubner, N.* ; Jung, C.* ; Soeldenwagner, M.* ; Cervelli, M.* ; Wang, Y.-D.* ; Liebscher, V.*
Evaluation of the chicken transcriptome by SAGE of B cells and the DT40 cell line.
BMC Genomics 5:98 (2004)
BACKGROUND: The understanding of whole genome sequences in higher eukaryotes depends to a large degree on the reliable definition of transcription units including exon/intron structures, translated open reading frames (ORFs) and flanking untranslated regions. The best currently available chicken transcript catalog is the Ensembl build based on the mappings of a relatively small number of full length cDNAs and ESTs to the genome as well as genome sequence derived in silico gene predictions. RESULTS: We use Long Serial Analysis of Gene Expression (LongSAGE) in bursal lymphocytes and the DT40 cell line to verify the quality and completeness of the annotated transcripts. 53.6% of the more than 38,000 unique SAGE tags (unitags) match to full length bursal cDNAs, the Ensembl transcript build or the genome sequence. The majority of all matching unitags show single matches to the genome, but no matches to the genome derived Ensembl transcript build. Nevertheless, most of these tags map close to the 3' boundaries of annotated Ensembl transcripts. CONCLUSIONS: These results suggests that rather few genes are missing in the current Ensembl chicken transcript build, but that the 3' ends of many transcripts may not have been accurately predicted. The tags with no match in the transcript sequences can now be used to improve gene predictions, pinpoint the genomic location of entirely missed transcripts and optimize the accuracy of gene finder software.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
Publikationstyp
Artikel: Journalartikel
Dokumenttyp
Wissenschaftlicher Artikel
Typ der Hochschulschrift
Herausgeber
Schlagwörter
GENE-EXPRESSION; SERIAL ANALYSIS; IDENTIFICATION; DISCOVERY; FABRICIUS; GENOME; BURSA
Keywords plus
Sprache
englisch
Veröffentlichungsjahr
2004
Prepublished im Jahr
HGF-Berichtsjahr
0
ISSN (print) / ISBN
1471-2164
e-ISSN
1471-2164
ISBN
Bandtitel
Konferenztitel
Konferzenzdatum
Konferenzort
Konferenzband
Quellenangaben
Band: 5,
Heft: ,
Seiten: ,
Artikelnummer: 98
Supplement: ,
Reihe
Verlag
BioMed Central
Verlagsort
Tag d. mündl. Prüfung
0000-00-00
Betreuer
Gutachter
Prüfer
Topic
Hochschule
Hochschulort
Fakultät
Veröffentlichungsdatum
0000-00-00
Anmeldedatum
0000-00-00
Anmelder/Inhaber
weitere Inhaber
Anmeldeland
Priorität
Begutachtungsstatus
Peer reviewed
POF Topic(s)
Forschungsfeld(er)
PSP-Element(e)
G-500400-001
FE 73823
Förderungen
Copyright
Erfassungsdatum
2004-12-31