Gene Cthe_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2096 
Symbol 
ID4810956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2491872 
End bp2493836 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content45% 
IMG OID640107503 
Productmethionyl-tRNA synthetase 
Protein accessionYP_001038496 
Protein GI125974586 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0073] EMAP domain
[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase
[TIGR00399] methionyl-tRNA synthetase C-terminal region/beta chain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAA AGACTTTTTA CATTACAACG CCCATTTATT ACCCGAGCGA CAAGTTACAT 
ATAGGCCACT CATACACTAC TGTTGCGGCG GATGCCGTGG CAAGATACAA AAGGCTGAAA
GGTTACGATG TAATGTTTCT TACGGGAACG GATGAGCATG GACAGAAGAT TGAACGCAAG
GCGAAGGAAA AGGGAGTTAC CCCGAAACAG TATGTGGATG AAATTGTGGC CGGAATCAAG
GAACTGTGGA AGCTTTTGAA AATAACCAAC GACAGGTTTA TCAGAACGAC GGATCCCCAC
CATGAGAAGA CTGTGCAGAA GATATTTAAA AAGCTCTACG ACCAGGGAGA TATTTACAAG
AGTGAATATG AAGGCTGGTA TTGCACTCCT TGTGAGTCTT TCTGGACGAA GACACAGCTG
GTTGACGGAA AATGTCCCGA CTGCGGAAGG GAAGTGGAAC TTACAAAAGA GGAGAGCTAT
TTCTTCAGGC TTTCCAAATA TCAGGACCGG CTGATAAAAC ATATCGAGGA GAATCCGGAT
TTTATACAGC CTGTTTCCAG ACAGAACGAG ATGCTGAACA ATTTCTTAAG GCCGGGGCTT
GAAGACCTTT GCGTGTCCAG AACCACCTTT GACTGGGGAA TACCGGTGTC CTTTGATGAC
AAGCACGTGG TATATGTTTG GATTGACGCC CTTTCCAACT ATATTACCGC CCTTAACTAC
ATGTCCGAGG ACGATTCGGA TTACCGCAAG TACTGGCCTG CGGATGTTCA TCTTGTGGGA
AAGGAAATAG TGCGTTTCCA CACCATCATA TGGCCTGCAA TGCTCATGGC TTTGGGTGAG
CCTTTGCCGA AGCAGGTATT TGGCCATGGC TGGCTCCTTC TTGAGGGCGG AAAAATGTCC
AAGTCCAAGG GAAATGTGGT CGATCCTGTT GTGCTTGTTG AAAAGTACGG TGTTGACGCG
ATAAGGTATT TCCTCTTAAG AGAGGTTCCT TTCGGTTCGG ACGGAGTATT TTCAAATGAA
GCATTGATAA ACAGGATTAA TTCAGACTTG GCAAACGACC TTGGAAATCT TGTCAGCAGG
ACCGTTGCCA TGATTGACAA ATATTTTGGA GGGAAGCTGC CGCAGGAAAG ACAGGCGGGA
GAATTTGACG ACGACCTTAT AAAGACTGTG ACGGATACAC CTCAAAAGGT TGAGGAACTG
CTTGACCGTT TGCAGTTCAG TACGGCGCTT ACTGAAATCT GGAAAGCCAT TTCCAGAACC
AACAAGTATA TTGACGAGAC AATGCCGTGG GCACTGGCAA AAAGTGAAGA AAACAAGGCA
AGACTTGCCG CTGTTTTATA CAATCTTGCG GAAAGTATCA GAATAGTTTC CATACTCATA
CAGCCGTTTA TGCCTGAAAC TCCTGAAAAG ATATGGCATC AGCTGGGTAT AAACGACAAA
AAGTATGTGG AGTGGGAAAC TGCAAAGAAA TGGGGAGTAT ATCCTGAAGG TGCCGCTGTG
AACAAAGGAG AACCCTTGTT CCCGAGAATT GATGTTAAGA AAGAGCTGGA GGAATTGGAA
AAGCTCACTT TGGCTGCGGC TGAAAATAAG GAAAAGCAAT CACCGAAGCA GGAAACGGAG
AAAAAGGATG CGGAAAAGAA TGAATATATC ACCATAGAGG ATTTTGAGAA GCTGGATTTG
AGGGTTGGCA AGGTTCTTGA GGCTCAGAAG GTTGAAAATG CCGACAAGTT GCTGAAACTA
AAGATTGAAG TTGGCAATGA AGTACGCCAG GTGGTGTCCG GTATTGCAAA GTACTACTCT
CCGGAGGAAT TAAAGGGTAA ATACGTTGTG CTGGTGGCAA ACTTAAAGCC GGTAAAACTC
AGGGGAATAG AGTCGCAGGG TATGATTCTT GCCGCTTCGG ATGACAAGGA CCTGGTACTG
GTGACGATTG ACAAAGAGAT AAACAGCGGA ACCAAGGTTC AGTAA
 
Protein sequence
MDRKTFYITT PIYYPSDKLH IGHSYTTVAA DAVARYKRLK GYDVMFLTGT DEHGQKIERK 
AKEKGVTPKQ YVDEIVAGIK ELWKLLKITN DRFIRTTDPH HEKTVQKIFK KLYDQGDIYK
SEYEGWYCTP CESFWTKTQL VDGKCPDCGR EVELTKEESY FFRLSKYQDR LIKHIEENPD
FIQPVSRQNE MLNNFLRPGL EDLCVSRTTF DWGIPVSFDD KHVVYVWIDA LSNYITALNY
MSEDDSDYRK YWPADVHLVG KEIVRFHTII WPAMLMALGE PLPKQVFGHG WLLLEGGKMS
KSKGNVVDPV VLVEKYGVDA IRYFLLREVP FGSDGVFSNE ALINRINSDL ANDLGNLVSR
TVAMIDKYFG GKLPQERQAG EFDDDLIKTV TDTPQKVEEL LDRLQFSTAL TEIWKAISRT
NKYIDETMPW ALAKSEENKA RLAAVLYNLA ESIRIVSILI QPFMPETPEK IWHQLGINDK
KYVEWETAKK WGVYPEGAAV NKGEPLFPRI DVKKELEELE KLTLAAAENK EKQSPKQETE
KKDAEKNEYI TIEDFEKLDL RVGKVLEAQK VENADKLLKL KIEVGNEVRQ VVSGIAKYYS
PEELKGKYVV LVANLKPVKL RGIESQGMIL AASDDKDLVL VTIDKEINSG TKVQ