Gene BAS2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2037 
Symbol 
ID2849061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2047477 
End bp2048883 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content36% 
IMG OID637505287 
Producthypothetical protein 
Protein accessionYP_028300 
Protein GI49185048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.107274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTCC GACCAGAAGT AGGAGAAAAA ATTAGTCTGA ATAAAGATGT TTATCGTTTT 
GAGAAACACC CAGCTGTAAT TGGTATTGAA ATGCCGTATG GGCAAGAAGG TAGACAAGGA
ACAGTCTATC AACTGCAACA TGAAAATGGC ATGGAACGAA TTGCATTAAA AGTTTTTAAG
GAACGCTATC GCGAGGAAAA ACATCAACTA GCATTTTTGA AACCACTTTC TTCCATAGCG
GGTCTTAAAG TATGTTCACG CTATATCGTT ACTAAAGAAG AACATATATC TGCTATCGAA
AAATCAGAAG ACCTTGCTAA TAGTATTGTA ATGCCTTGGG TTGAAGGACC AACTTGGGCT
GATATTTTAC AAGAACAACG AATGTTATCA AAAGAACAAT GCTTCTTTAT TGCAGAAGCA
TTTCTTACAA CACTTAAAAT GATGGAGGAA AATGAAGTTG CTCATAATGA TTTATCGTCT
AGCAACGTAC TCATACCTTT CTTAAGTGAA AATCCAATTG AAGGCCAACA CTATATCGAA
CTTGTTGATG TTGAGCAAAT GTATGGTCCA AAAACGAAAA GACCTTCGTT ATTGCCAGCA
GGTTCAGCGG GTTACGCACC AATGTATTTA AAAAGTGGAG TATGGCAAAA AGAAGCTGAT
CGATTTGCAG GTGCCATTTT ATTAGGAGAA ATATTGAGTT GGTGTAGTGA AGAAGTTCGA
AATAAAAAAT GGACGGATGC AAGTTACTTT AAAACGGAAG AAATGCAGAA AGAATGTGAA
AGATATACGT TACTTCAGCA AGTATTACAT AATCAATGGA ATGGGGAAAT TGCAAAATTA
TTTAAGCAGG CATGGAGTAG TAACTCTTTC GCAGAATGCC CGAGCTTTGC ACAGTGGTAC
GATGTATTTC ATAGCGTGAG AGAAAGAATA AAAATTGATG CGGAAAGGCA GTTAGCAGAA
GAACACTCTC TTTTTGTATC AAAATGTTTG GAAATTGCAA GATTATTAGA AGAGAGAGGA
TTTAAACAAG CGGCATTATA TGAGTATAAA ATAATTTTCA ATTCACTCAA TCCATCAACA
GCTCTGCAAA AAGAACTCGC ATATATCATT CAAACTATGG AGAGTCAAGA GCCTGAAATA
AATAAAAAAA TGGTCCTACA ACATTATTTG GAATTAGCTA CTGAATTGGA ACGAGAAAAC
AATGCAGCAT TTGCTTGTTT CGTCTATTCA CGAATCGTAC AATTTCCAAA CATTGATCAG
GCGTTAAAAC AGGAAATTGC AAGCATTATT GAAGAGATAA AAGAAGGGCA AGGAACAGAG
ACGCAGCAAG AAGTAGCAGC TACAATTACA GTTCCAAATA GTATTCTACA GAGCCGGAAA
AAAAACGAAA AAACAAGTGG AATATGA
 
Protein sequence
MGFRPEVGEK ISLNKDVYRF EKHPAVIGIE MPYGQEGRQG TVYQLQHENG MERIALKVFK 
ERYREEKHQL AFLKPLSSIA GLKVCSRYIV TKEEHISAIE KSEDLANSIV MPWVEGPTWA
DILQEQRMLS KEQCFFIAEA FLTTLKMMEE NEVAHNDLSS SNVLIPFLSE NPIEGQHYIE
LVDVEQMYGP KTKRPSLLPA GSAGYAPMYL KSGVWQKEAD RFAGAILLGE ILSWCSEEVR
NKKWTDASYF KTEEMQKECE RYTLLQQVLH NQWNGEIAKL FKQAWSSNSF AECPSFAQWY
DVFHSVRERI KIDAERQLAE EHSLFVSKCL EIARLLEERG FKQAALYEYK IIFNSLNPST
ALQKELAYII QTMESQEPEI NKKMVLQHYL ELATELEREN NAAFACFVYS RIVQFPNIDQ
ALKQEIASII EEIKEGQGTE TQQEVAATIT VPNSILQSRK KNEKTSGI