Gene PICST_31908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31908 
SymbolMSY1 
ID4839467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp188251 
End bp189909 
Gene Length1659 bp 
Protein Length552 aa 
Translation table12 
GC content45% 
IMG OID640390782 
Producttyrosyl-tRNA synthetase 
Protein accessionXP_001384692 
Protein GI150865466 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.483177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.847897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGGG GAAGAATCGG CGTATTTGTC CAGTGTCGTT CTTTGAGCAG ACTTAGGGCG 
ACCGCAAGGC CCGTTTCTTT TGGCAAACTG TTGAGGTTCA ACTCTTCCAC TATCAAATCA
GATGTGAATA TTGAACAACT GAAAAATAGT GCTAGCCAGC CAGCTTCTTC AAATACAGAG
AAGATAGAGG CTCCTGTTGA AGAAGACACT CCTGATACCA TCACCATTGT GCCTGCTCTT
AAGGAATTGA CCGGCTCAGC AAATTACCAG CCCGATTTAG ATATGCCTCT TGTTGGCCAT
TTGCAATCAC GGTATCTTGT AGAATCGATA ACCGACGATG CTTTGTTCGA CTTGACTCTG
CCTGAAAGCA CCAAGAAGTT CAAATTGTAC TGTGGAGCCG ATCCAACTGC CGAATCGCTT
CATTTGGGTA ATCTCTTGCC ATTGATGGTT CTTCTTCATT TCAACTTGCG TGGCCACGAT
GTAGTAGGAC TTGTAGGAGG AGCTACTGGT GCTGTAGGAG ATCCCAGTGG CAGAACTACA
GAAAGATCCC AGATTGAAGA TAAAGAGAGA GAAGATAATG TATCCAAGAT CCAGAAACAA
TTGGTGACGT TTTTGGAGAA TGGTGTAGCC TATGCCAAAT CGCGTAACTA TCCTATAGCA
GGAGAAGGAA GGATTTTAAC GGCTAATAAC GCCAGCTGGT GGCTGTCGAT AGGCATGTTA
GAGTTCCTTG CCAAGTATGG CAGACATATC CGTGTATCTT CAATGCTTGC GCGTGATTCA
ATTCAGTCTA GATTGAAGGA CCAACATGGT TTAGGATTCA ACGAGTTCAC CTACCAGATC
TTGCAAGCCT ACGACTTTTG GCACATGTTC CGGGAAGATG GGGTCAACAT GCAGATAGGT
GGCAATGACC AGTGGGGTAA TATTACCGCT GGTATCGACT TGATCTCACG GTTACAGAGA
CATTTTGGAA AAGAAGGTGT AGAGCCACAA AGTGCCTATG GTATGACTGT GCCGTTGTTG
ACTTCTCCCA CGGGGGAAAA GTTTGGCAAA TCGGCTGGGA ATGCTGTTTT CATCGATGAA
AAGTACACCA CGCCGTACCA GATGTACCAA TACTTCATCA ACAGTCCAGA CGATATGGTT
GCAAAGTTGC TTAAAACGCT TACATTGTTG CCATTGAGTA TTATAGACGG TTATATCTTG
CCCAAACACG AATCCGATCC TGGTTTGAGA ATTGCCCAGC GTATCTTGGC TCGTGAAGTT
GTGGACTTGA TCCATGGTGA GGGTGTCGGT GAAGAGATGG CCTACATCAC CAGTTTCTTA
TTTCCTACAC CCGATCAGCC ATTCAACGAT ACTGTATCTG CAGATCGGTT GATCCAGAAT
TTTAGGAGAT CGGGCATTTT GGTGAACTTG AAGTTTTCGG AAATCGAGAA CATCGACGAC
TTACGTATGA GCAGCTTGTT AGCCCAGATC ACCAACAAGT CCCGTAGAGA AGTGAAGCAG
TTGATCAAGT CAGGAGGAAT CTACATGGGT TTGGAGAGAG ATCAGTTTGA GGATCCCGAA
GATGTAGTAT TGTTTGACCG TGATAACCAC TTGATCGACG GCAAGTTACT TCTTGTCAGA
GTGGGCAAGC AGAATTATTA TGTTGTTGAG TTCAGTTAA
 
Protein sequence
MLRGRIGVFV QCRSLSRLRA TARPVSFGKS LRFNSSTIKS DVNIEQSKNS ASQPASSNTE 
KIEAPVEEDT PDTITIVPAL KELTGSANYQ PDLDMPLVGH LQSRYLVESI TDDALFDLTS
PESTKKFKLY CGADPTAESL HLGNLLPLMV LLHFNLRGHD VVGLVGGATG AVGDPSGRTT
ERSQIEDKER EDNVSKIQKQ LVTFLENGVA YAKSRNYPIA GEGRILTANN ASWWSSIGML
EFLAKYGRHI RVSSMLARDS IQSRLKDQHG LGFNEFTYQI LQAYDFWHMF REDGVNMQIG
GNDQWGNITA GIDLISRLQR HFGKEGVEPQ SAYGMTVPLL TSPTGEKFGK SAGNAVFIDE
KYTTPYQMYQ YFINSPDDMV AKLLKTLTLL PLSIIDGYIL PKHESDPGLR IAQRILAREV
VDLIHGEGVG EEMAYITSFL FPTPDQPFND TVSADRLIQN FRRSGILVNL KFSEIENIDD
LRMSSLLAQI TNKSRREVKQ LIKSGGIYMG LERDQFEDPE DVVLFDRDNH LIDGKLLLVR
VGKQNYYVVE FS