Gene GWCH70_2661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2661 
SymbolthrS 
ID7978320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2691249 
End bp2693183 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content44% 
IMG OID644799462 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002950621 
Protein GI239827997 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000386283 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAA TGATTCGCAT TACATTCCCT GATGGAGCGG TAAAGGAGTT TCCAAAAGGA 
ACAACGACAG AACAAATCGC TGCATCGATT AGCCCGGGAT TAAAGAAAAA AGCGATTGCC
GGCAAACTAA ACGATCGTTT TATTGATTTG CGCACACCGA TTCAGGAAGA TGGATCGATT
TCGATCATTA CGCAAGATAT GCCAGAAGCG CTGGACATTT TGCGCCATAG TACTGCCCAT
TTAATGGCCC AAGCGATTAA GCGTCTGTAT AAAAATGTAA AACTTGGCGT CGGTCCGGTC
ATTGAAAATG GTTTCTATTA CGATATCGAT ATGGAAGAAT CATTAACGCC GGAAGACTTG
CCGAAAATTG AACAAGAAAT GCGGAAAATT GTGAAAGAAA ACTTGGAAAT TGTCCGGAAA
GAAGTGAGCC GCGAAGAAGC GATTCGCCTT TACGAAGAAA TTGGCGATGA TTTAAAACTG
GAATTAATTA ACGATATTCC AGAAGGAGAA ACGATCTCCA TTTACGAACA AGGCGAATTT
TTTGACCTTT GCCGCGGTGT CCACGTTCCA TCGACAGGAA AAATTAAAGA GTTTAAGTTG
TTGAACATCT CAGGAGCGTA CTGGCGCGGA GACAGCAATA ATAAAATGCT GCAGCGCATT
TACGGAACGG CGTTCTTCAA AAAAGAAGAT TTAGATGAAT ATCTTCGCCA GTTGCAAGAA
GCAAAAGAAC GCGATCATCG CAAATTAGGA AAAGAGCTTG AATTGTTTAT GACTTCGCAA
AAAGTCGGAC AAGGGCTGCC GCTTTGGCTG CCAAAAGGGG CAACGATTCG CCGCATTATC
GAGCGGTATA TTGTGGACAA AGAAATTGAA TTAGGTTATC AACATGTTTA TACACCAGTG
CTTGGTAGTG TCGAATTATA TAAAACTTCC GGCCACTGGG ATCATTACAA AGACAACATG
TTCCCGCCGA TGGAAATGGA CAATGAACAG CTTGTGCTGC GCCCAATGAA CTGTCCGCAT
CATATGATGA TTTATAAAAG CAAAATCCAT AGCTATCGGG AGCTTCCGAT TCGTATCGCA
GAGCTCGGCA CGATGCATCG CTACGAAATG TCCGGAGCGC TTTCCGGCTT GCAGCGCGTC
CGCGGCATGA CATTAAACGA CGCTCACATT TTTGTGCGTC CAGACCAAAT TAAAGATGAG
TTCAAACGCG TCGTTAACTT AATTTTAGAA GTATACAAAG ACTTTGGCTT GGATGAATAT
TCGTTCCGGC TTTCTTACCG CGATCCACAT GATAAAGAAA AATATTACGA TGATGATGAA
ATGTGGGAAA AAGCGCAAAA CATGCTGCGT GAAGCAATGG ATGAATTGGG ATTAGAGTAT
TATGAAGCCG AAGGGGAAGC GGCGTTTTAT GGTCCGAAAT TAGACGTGCA AGTGCGCACA
GCGCTTGGAA AAGACGAAAC ATTGTCAACG GTGCAGCTTG ATTTCTTATT GCCGGAACGC
TTTGATTTAA CATACATCGG CGAAGACGGC AAACCGCATC GCCCGGTTGT CATCCATCGC
GGGGTTGTTT CCACGATGGA ACGTTTCGTT GCGTTTCTGA TTGAAGAATA TAAAGGCGCG
TTCCCAACTT GGCTTGCCCC AGTGCAAGTC GAAGTGATCC CTGTGTCGCC AGCAGCGCAT
CTCGACTATG CGTATAAAGT GAAAGAAGCG TTGCAATCGC AAGGATTCCG CGTCGAAGTC
GACGAACGCG ATGAAAAAAT TGGCTACAAA ATTCGTGAAG CGCAAATTCA AAAAATCCCT
TACATGCTTG TCGTTGGTGA CAAAGAAATG GCAGAAAATG CCGTCAACGT CCGTAAATAC
GGCGAACAAA AAAGCGAAAC GATGTCTCTC GACGATTTTA TTGCCGCTCT GAAAGCGGAA
GTGCGTCGAA ACTAG
 
Protein sequence
MSEMIRITFP DGAVKEFPKG TTTEQIAASI SPGLKKKAIA GKLNDRFIDL RTPIQEDGSI 
SIITQDMPEA LDILRHSTAH LMAQAIKRLY KNVKLGVGPV IENGFYYDID MEESLTPEDL
PKIEQEMRKI VKENLEIVRK EVSREEAIRL YEEIGDDLKL ELINDIPEGE TISIYEQGEF
FDLCRGVHVP STGKIKEFKL LNISGAYWRG DSNNKMLQRI YGTAFFKKED LDEYLRQLQE
AKERDHRKLG KELELFMTSQ KVGQGLPLWL PKGATIRRII ERYIVDKEIE LGYQHVYTPV
LGSVELYKTS GHWDHYKDNM FPPMEMDNEQ LVLRPMNCPH HMMIYKSKIH SYRELPIRIA
ELGTMHRYEM SGALSGLQRV RGMTLNDAHI FVRPDQIKDE FKRVVNLILE VYKDFGLDEY
SFRLSYRDPH DKEKYYDDDE MWEKAQNMLR EAMDELGLEY YEAEGEAAFY GPKLDVQVRT
ALGKDETLST VQLDFLLPER FDLTYIGEDG KPHRPVVIHR GVVSTMERFV AFLIEEYKGA
FPTWLAPVQV EVIPVSPAAH LDYAYKVKEA LQSQGFRVEV DERDEKIGYK IREAQIQKIP
YMLVVGDKEM AENAVNVRKY GEQKSETMSL DDFIAALKAE VRRN