Gene PICST_74631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74631 
SymbolTYS1 
ID4851408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1745010 
End bp1746268 
Gene Length1259 bp 
Protein Length404 aa 
Translation table 
GC content47% 
IMG OID640393116 
Producttyrosyl-tRNA synthetase 
Protein accessionXP_001387562 
Protein GI126274525 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.158279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAGCCCACA AGCCATGTCT GTCGCCACCG ACCCAGAAGA ACAATACAAG CTCATCACCA 
AGGGCCTTCA GGAGGTCTTA AATGGCCAAA TCATCAAGGA TGTCCTTGAA AAGGAGAAGA
GACCCGTAAA GATCTACTTG GGTACAGCTC CCACTGGGAA GCCCCACTGC GGTTACTTTG
TGCCCATGAT CAAGTTGGCC CATTTCTTGA AGGCTGGATG TGAAGTGACG GTCCTTTTGG
CCGACTTGCA CGCCTACTTA GACAACATGA AGGCTCCATT GGAAGTAGTC CAGTACAGAG
CCAAGTACTA CGAATATGTG ATCAAGGCCA TGTTGAGATC CATCAACGTT CCAATTGACA
AATTAAGATT TGTAGTAGGC TCTGAATACC AGTTGAGCGC ACAGTACACT ATGGATATCT
TCAAGTTGCT GAATGTTGTT TCCCAGAACG ATGCCAAGCG TGCTGGTGCT GATGTCGTCA
AGCAGGTTGC CAACCCATTG TTGTCCGGAT TGATTTACCC ATTGATGCAA GCTCTTGATG
AAGAACATTT GGGTGTTGAT GCCCAGTTTG GAGGTGTTGA CCAGAGAAAG ATTTTTGTGT
TGGCCGAAGA GAACTTGCCT TCCGTAGGCT ACAAGAAGAG AGCTCACTTG ATGAACCCTA
TGGTTCCAGG ATTGGGTCAG GGTGGTAAGA TGTCTGCTTC GGATCCAAAT TCCAAGATTG
ATATTATTGA AGACCCTAAG GTCGTCAAGA AGAAGGTCAA CAGCGCTTAT TGTGCTCCCG
GTGACATCAA AGACAACGGC TTGTTGTCGT TTGTAGAATA CGTAGTCCAA CCCATCCAAG
AATTGTTGGC AGAGCAAGAT GGAGTGTTCA AGTTCGACAT TGACCGTCCG GAAAAGTACG
GTGGTCCAAT CTCGTACACG TCTCTTGACC AGTTGAAAGC AGACTTCGCT TCTGAAAAGT
TGTCGCCAGT CGACTTCAAG GCCGGTGTTG CTGACAAGAT CAACGAGTTG TTGGCTCCTA
TCAAGGCTGA ATTCGATGCC AGCCCTGATT TCCAGGAATA CCAGCAAAAG GGCTACCACC
AGGAACAGCC AAAGGCTGAA AAGAAGACCA AGAAGGTCAA GAACAAGGGT ACCAGATACC
CTGGTGCCGG CAAACCAGAT GGTGCTTCTG CTCCAGAAGC TGAAGCTGAA GCTGTTACTG
CTAAGTTGGA AGAAGCTAAG TTAAATTAGG TATAGATACG TAATAAAGGT AATTTCTAG
 
Protein sequence
MSVATDPEEQ YKLITKGLQE VLNGQIIKDV LEKEKRPVKI YLGTAPTGKP HCGYFVPMIK 
LAHFLKAGCE VTVLLADLHA YLDNMKAPLE VVQYRAKYYE YVIKAMLRSI NVPIDKLRFV
VGSEYQLSAQ YTMDIFKLLN VVSQNDAKRA GADVVKQVAN PLLSGLIYPL MQALDEEHLG
VDAQFGGVDQ RKIFVLAEEN LPSVGYKKRA HLMNPMVPGL GQGGKMSASD PNSKIDIIED
PKVVKKKVNS AYCAPGDIKD NGLLSFVEYV VQPIQELLAE QDGVFKFDID RPEKYGGPIS
YTSLDQLKAD FASEKLSPVD FKAGVADKIN ELLAPIKAEF DASPDFQEYQ QKGYHQEQPK
AEKKTKKVKN KGTRYPGAGK PDGASAPEAE AEAVTAKLEE AKLN