Gene BAS4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4472 
SymbolthrS 
ID2851578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4382221 
End bp4384158 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content39% 
IMG OID637507709 
Productthreonyl-tRNA synthetase 
Protein accessionYP_030719 
Protein GI49187467 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.419718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATG TAGTTAAAAT TACTTTCCCT GATGGAGCTG TGAAGGAGTT TCCAAAAGGC 
GTAACAACTG AAGAAATCGC AGCTTCTATT AGCCCAGGCT TAAAGAAAAA AGCTGTGGCT
GGAAAATTAA ATGATGAGAT GATTGATCTT GTTACACCAA TCGAAGAAGA TGGTGCAGTT
TCTATCATTA CATTAGATTC TGAAGATGGC TTATATATTT TACGCCACTC AACAGCTCAC
CTTTTAGCGC AAGCGTTAAA ACGTTTATAT AAAGATGTTA AGGTTGAGCT TGGTATTGGT
CCAGTAATTG AAAATGGCTT CTACTACGAT ATTGATATGG AAGAAGCAAT TACAGTTGAA
GACTTCAAGA AAATCGAAAA AGAAATGCAA AAGATTGTGA ACGAGAACTT AGAAATCGTT
CGTCATGAAG TACCACGTGC AGAAGCAATT CGTCGCTTTG AAGAAATCGG CGATGAGTTA
AAATTAGATT TAATTAATGA TCTTCCAGAA GATGCAGTTA TTTCAATTTA TGAGCAAGGC
GAATTCTTCG ACCTTTGCCG TGGTGTTCAC CTTCCATCTA CAGGAAAAAT TAAAGTATTT
AAATTATTAA GCGTTGCGGG TGCTTACTGG CGTGGCGATA GCAATAATAA AATGCTACAA
CGTATTTACG GTACTGCATT TGTTAAGAAA GCAGAATTAG ATGAGCACTT ACGTATGCTT
GAAGAAGCGA AAGAGCGCGA TCACCGTAAA TTAGGTAAAG AATTAAAACT ATTTACTAAT
AGCCAAAAAG TAGGACAAGG TTTACCACTT TGGTTACCAA AAGGTGCAAC AATCCGCCGC
ATTATCGAGC GTTACATCGT TGATAAAGAA GCAAGCTTAG GCTATGATCA CGTATATACT
CCAGTACTAG GAAGCAGAGA GCTTTATGAA ACTTCTGGTC ACTGGAATCA CTACCGTGAT
GGTATGTTCC CATCAATGGA AATGGATAAT GAAGAGTTAG TTCTTCGTCC AATGAACTGC
CCTCACCACA TGATGGTTTA TAAAAACGAT ATTCACAGCT ACCGTGAATT ACCAATCCGT
ATTGCGGAAC TTGGAACAAT GCACCGCTAT GAAATGTCAG GTGCGTTATC TGGGTTACAA
CGTGTACGCG GAATGACTTT AAACGATGCG CACATTTTCG TTCGCCCAGA TCAAATTAAA
GAAGAGTTAA AACGTGTTGT AAACTTAACT CTAGAAGTGT ACAAAGATTT CGGTTTAGAG
AACTACTCAT TCCGTCTATC TTATCGTGAC CCAGAAGATA CTAAAAAGTA CTATGCTGAT
GATGAGATGT GGGAAAAAGC ACAAGGTATG TTAAAAGAAG CTATGGATGA AATGGGTCTT
GATTACTACG AAGCTGAAGG TGAAGCGGCA TTCTACGGTC CAAAACTTGA CGTTCAAGTT
CGTACTGCTC TTGGAAAAGA CGAAACACTT TCAACTGTAC AATTAGACTT CTTACTTCCA
GAACGCTTTG AACTATCTTA CGTTGGTGAA GACGGTAAAC AACACCGTCC AGTTGTAATT
CACCGTGGTG TTGTATCAAC TATGGAACGT TTCGTAGCAT TCTTAATTGA AGAATACAAA
GGTGCATTCC CAACTTGGTT AGCTCCAGTT CAAGTACAAG TAATTCCAGT TTCTCCGCAA
GTACATTTAG ACTATGCGAA GAAAGTACAA GACGAATTGC GCCGTGCTGG TATCCGTGTT
GAATTAGATA CTCGTGAAGA GAAAATTGGT TACAAAATCC GTGAAGCACA AATGCAAAAA
ATTCCGTACA TGCTTGTAGT AGGTGACAAT GAAGTAACTG AAAACGGCGT AAACGTACGT
AAATACGGTG AACAAAAATC AGAAACAATC GCATTAGATG CTTTTGTTGA TATGATTAAA
GTAGAAGGAA AACGATAA
 
Protein sequence
MADVVKITFP DGAVKEFPKG VTTEEIAASI SPGLKKKAVA GKLNDEMIDL VTPIEEDGAV 
SIITLDSEDG LYILRHSTAH LLAQALKRLY KDVKVELGIG PVIENGFYYD IDMEEAITVE
DFKKIEKEMQ KIVNENLEIV RHEVPRAEAI RRFEEIGDEL KLDLINDLPE DAVISIYEQG
EFFDLCRGVH LPSTGKIKVF KLLSVAGAYW RGDSNNKMLQ RIYGTAFVKK AELDEHLRML
EEAKERDHRK LGKELKLFTN SQKVGQGLPL WLPKGATIRR IIERYIVDKE ASLGYDHVYT
PVLGSRELYE TSGHWNHYRD GMFPSMEMDN EELVLRPMNC PHHMMVYKND IHSYRELPIR
IAELGTMHRY EMSGALSGLQ RVRGMTLNDA HIFVRPDQIK EELKRVVNLT LEVYKDFGLE
NYSFRLSYRD PEDTKKYYAD DEMWEKAQGM LKEAMDEMGL DYYEAEGEAA FYGPKLDVQV
RTALGKDETL STVQLDFLLP ERFELSYVGE DGKQHRPVVI HRGVVSTMER FVAFLIEEYK
GAFPTWLAPV QVQVIPVSPQ VHLDYAKKVQ DELRRAGIRV ELDTREEKIG YKIREAQMQK
IPYMLVVGDN EVTENGVNVR KYGEQKSETI ALDAFVDMIK VEGKR