Gene PICST_52223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52223 
SymbolTYR1 
ID4851414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1759252 
End bp1760601 
Gene Length1350 bp 
Protein Length449 aa 
Translation table 
GC content43% 
IMG OID640393122 
Productprephenate dehydrogenase 
Protein accessionXP_001387975 
Protein GI126274542 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.135513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG TGTCGAAGTA CACAGCGGCT GAAACAGAGG CTCTCCAAAA ATCCAAGACC 
ATCGGGATCA TTGGGTTGGG AGACATGGGC TATCTCTATG CTAAAAGATT CTCTGAAGCT
GGGTGGAATG TCGTGGGTTG CGATCGTGAA GATCTCTATG AAGAAACAGT ACAAAAGTTT
GCTGATGACA AATTCAAAAT TCTTCGTAAT GGACATTTCG TTTCACGTAT CTCGGATTAT
ATCATCTATT CTGTAGAAGC CGAGAACATC CGGAAAATTA TTTCGATCTA CGGGCCGTCC
ACTAAGTTCG GAGCCATTGT GGGGGGACAG ACCTCATGTA AAGAGCCTGA AATTGCTGCA
TTTGAAGACA TTTTGCCGAA AGATGTGAAA ATCATCTCAG TCCACTCATT ACATGGTCCT
AAAGTTTCTA CTACTGGGCA GCCTTTGGTA CTTATAAATC ATCGTGGAGA TGACGAAAGC
TTTAGATTCG TAGAATGTTT GGTTTCATGT TTGAACTCAA AAATCGTATA TCTTTCAGCC
AAAGAACACG ACAAAATTAC AGCAGATACT CAAGCTGTGA CTCATGCTGC ATTCTTATCG
ATGGGTGTAG CTTGGATGAA CATCAACCAA TATCCGTGGG TTACACCTCG ATGGATCGGA
GGGTTGGAGA ATGCCAAAAT GAACATATCT TTGCGGATAT TCTCTAACAA ATGGCACGTC
TATGCTGGTT TAGCCATCAC AAACCCTTCT GCTCATGAAC AAGTGTTGCA GTACTCGAGT
TCCACGACAT CTTTGTTCAC TCTTATGATC CAGAAAAAAA AGGACGAGCT CAGAGAAAGA
ATGCTCAAGG CCAAGGAATT TGTGTTTGGC CACATCAACG ACCACGATCT ACTCTTGGAC
GACTACATTT TGCAGAAATT CTCGTTGAGT AAAAATCCAC CTGGTGGGAA GCAGCCCAAT
TCACATCTTT CGTTGCTAGC CATTGTAGAC TCGTGGTTCA CTTTGGGAAT AGTGCCTTAC
GACCATATAA TCTGCTCCAC ACCTTTGTTT AGAATCTTCT TGGGAGTCAC AGAGTACTTA
TTTTGCGCCC CTGGATTGCT AGAGGAATGT ATAGAAGATG CTGTTCATGA CGAATCCTTC
AGACAAGACG ACTTGGAATT CACTATCGCC GCCAGAAACT GGGCCAGTAT TATCTCCTTT
GGAAACTTCG AGTTGTACAA GCGAGAGTTT GAAAAGACGC AGAAATTTTT CCTGTCCAAG
TTTTCTGAGG CTAATGTTAT AGGTAACGAA ATGATCAAAA CCATTCTTGA AAGAGTCCAC
GAACGTGAAG CAGCAAACGA GTCTAAATAA
 
Protein sequence
MTAVSKYTAA ETEALQKSKT IGIIGLGDMG YLYAKRFSEA GWNVVGCDRE DLYEETVQKF 
ADDKFKILRN GHFVSRISDY IIYSVEAENI RKIISIYGPS TKFGAIVGGQ TSCKEPEIAA
FEDILPKDVK IISVHSLHGP KVSTTGQPLV LINHRGDDES FRFVECLVSC LNSKIVYLSA
KEHDKITADT QAVTHAAFLS MGVAWMNINQ YPWVTPRWIG GLENAKMNIS LRIFSNKWHV
YAGLAITNPS AHEQVLQYSS STTSLFTLMI QKKKDELRER MLKAKEFVFG HINDHDLLLD
DYILQKFSLS KNPPGGKQPN SHLSLLAIVD SWFTLGIVPY DHIICSTPLF RIFLGVTEYL
FCAPGLLEEC IEDAVHDESF RQDDLEFTIA ARNWASIISF GNFELYKREF EKTQKFFLSK
FSEANVIGNE MIKTILERVH EREAANESK