Gene BAS1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1750 
Symbol 
ID2851725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1772070 
End bp1773278 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID637505001 
Productenterotoxin 
Protein accessionYP_028014 
Protein GI49184762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.385286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA AACCTTATAA AGTAATGGCT CTTTCAGCAC TTATGGCAGT ATTTGCAGCA 
GGGAATATTA TGCCGGCCCA TACGTATGCA GCTGAAAGTA CTGTGAAACA AGCTCCAGTT
CATGCGGTAG CAAAAGCTTA TAATGACTAT GAAGAATACT CATTAGGACC AGAAGGCTTA
AAAGATGCAA TGGAAAGAAC AGGTTCAAAC GCTTTAGTAA TGGATCTGTA TGCTTTAACA
ATCATTAAAC AAGGTAATGT TAACTTTGGA AATGTATCGA CTGTTGATGC TGCTTTAAAA
GGAAAAGTGA TTCAGCACAA GGATACAGCT AGAGGAAATG CGAAGCAATG GTTAGATGTA
TTAAAGCCAC AGCTTATTTC AACGAATCAA AATATCATTA ACTATAATAC GAAATTCCAA
AACTATTATG ATACTTTAGT TGCTGCGGTT GATGCAAAAG ATAAAGCGAT ACTTACGAAA
GGGTTAACTA GATTATCAAG TAGTATTAAT GAAAATAAAG CGCAAGTAGA TCAGTTAGTA
GAAGACTTGA AGAAATTCCG AAATAAAATG ACGTCGGATA CGCAAAACTT CAAGGGTGAT
GCAAATCAAA TTACATCTAT TTTAGCTAGT CAAGACGCTG GAATCCCGCT TCTGCAAAAT
CAAATTACAA CGTACAATGA AGCAATTAGT AAATATAATG CAATTATTAT CGGTTCATCA
GTTGCGACAG CTCTAGGGCC AATTGCAATT ATCGGTGGTG CAGTAGTTAT TGCTACAGGT
GCAGGAACGC CACTAGGAGT CGCATTAATT GCAGGAGGCG CAGCGGCTGT AGGCGGTGGT
ACAGCTGGAA TCGTATTAGC GAAGAAAGAG CTTGATAATG CACAAGCTGA AATTCAAAAA
ATAACTGGAC AAATTACAAC TGCTCAATTA GAAGTAGCTG GGTTAACGAA CATTAAGACA
CAAACGGAGT ATTTAACAAA TACAATTGAT ACTGCAATTA CAGCGTTGCA AAACATTTCA
AACCAATGGT ACACAATGGG ATCAAAATAC AATTCTTTAC TTCAAAATGT AGATTCAATT
AGTCCAAACG ACCTTGTTTT CATTAAAGAA GATTTAAACA TTGCGAAAGA TAGCTGGAAA
AACATTAAAG ACTATGCAGA AAAGATTTAT GCTGAAGATA TTAAAGTAGT AGATACGAAA
AAAGCATAA
 
Protein sequence
MTKKPYKVMA LSALMAVFAA GNIMPAHTYA AESTVKQAPV HAVAKAYNDY EEYSLGPEGL 
KDAMERTGSN ALVMDLYALT IIKQGNVNFG NVSTVDAALK GKVIQHKDTA RGNAKQWLDV
LKPQLISTNQ NIINYNTKFQ NYYDTLVAAV DAKDKAILTK GLTRLSSSIN ENKAQVDQLV
EDLKKFRNKM TSDTQNFKGD ANQITSILAS QDAGIPLLQN QITTYNEAIS KYNAIIIGSS
VATALGPIAI IGGAVVIATG AGTPLGVALI AGGAAAVGGG TAGIVLAKKE LDNAQAEIQK
ITGQITTAQL EVAGLTNIKT QTEYLTNTID TAITALQNIS NQWYTMGSKY NSLLQNVDSI
SPNDLVFIKE DLNIAKDSWK NIKDYAEKIY AEDIKVVDTK KA