Gene BAS5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5037 
Symbol 
ID2852000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4908448 
End bp4909428 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content40% 
IMG OID637508292 
Productpeptide chain release factor 2 
Protein accessionYP_031276 
Protein GI49188023 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1186] Protein chain release factor B 
TIGRFAM ID[TIGR00020] peptide chain release factor 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGCG CAGGATTTTG GGATGACCAA CAAGGCGCAC AAGCTGTAAT TAATGAAGCG 
AATGCACTGA AAGATATGGT TGGAAAGTTC CGTCAGCTAG ATGAGACGTT CGAGAATCTA
GAAATTACGC ATGAGCTTTT AAAAGAAGAG TATGATGAAG ATTTACATGA GGAGCTTGAA
TCAGAAGTAA AAGGTTTAAT TCAAGAAATG AATGAGTATG AACTTCAGTT ACTACTTAGC
GATCCTTATG ATAAAAATAA AGCGATTTTA GAATTACACC CAGGTGCTGG TGGAACAGAG
TCACAAGACT GGGGCTCTAT GTTACTACGT ATGTACACAC GTTGGGCTGA AAAACGTGGA
TTTAAAGTAG AAACAGTTGA CTACTTACCA GGTGATGAAG CTGGTATTAA GAGTGTTACG
TTATTAATTA AAGGTCATAA CGCTTACGGT TACTTAAAGG CAGAGAAAGG TGTACATCGT
CTTGTACGTA TTTCTCCATT CGATTCTTCA GGCCGTCGCC ATACATCGTT CGTATCTTGT
GAAGTTGTAC CTGAGTTCAA TGATGAAGTT GAAATTGAAG TGCGTACAGA AGACTTGAAA
ATTGATACGT ATCGTGCAAG TGGAGCTGGT GGACAGCACG TTAATACGAC AGATTCAGCA
GTTCGTATTA CGCATACGCC GACAAATACG GTTGTAACGT GTCAGTCAGA GCGTTCTCAA
ATTAAAAACC GTGAGCATGC GATGAAGATG TTAAAAGCGA AATTATATCA AAAGAAATTA
GAAGAGCAAC AAGCGGAGTT AGATGAAATT CGCGGAGAAC AAAAGGAAAT TGGATGGGGT
AGTCAAATCC GTTCTTACGT ATTCCACCCG TATTCTCTTG TGAAAGACCA CCGTACAAAT
ACAGAGGTCG GTAACGTGCA AGCAGTTATG GATGGAGAAA TTGACCCATT CATTGATGCT
TACTTACGTT CTCGCATCTA A
 
Protein sequence
MMGAGFWDDQ QGAQAVINEA NALKDMVGKF RQLDETFENL EITHELLKEE YDEDLHEELE 
SEVKGLIQEM NEYELQLLLS DPYDKNKAIL ELHPGAGGTE SQDWGSMLLR MYTRWAEKRG
FKVETVDYLP GDEAGIKSVT LLIKGHNAYG YLKAEKGVHR LVRISPFDSS GRRHTSFVSC
EVVPEFNDEV EIEVRTEDLK IDTYRASGAG GQHVNTTDSA VRITHTPTNT VVTCQSERSQ
IKNREHAMKM LKAKLYQKKL EEQQAELDEI RGEQKEIGWG SQIRSYVFHP YSLVKDHRTN
TEVGNVQAVM DGEIDPFIDA YLRSRI