Gene BAS5118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5118 
SymboltagH 
ID2848860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4998667 
End bp5000316 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content32% 
IMG OID637508373 
Productteichoic acids export protein ATP-binding subunit 
Protein accessionYP_031357 
Protein GI49188104 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATA AAGTTAAGTT TGAGCACGTT ACAAAAAAGT ATAAGCTGTA CAACAAGCCT 
TTTGATAAGC TTAAGGATTT ATTTTTTAGG AGCAAAGATG GAGAATATCA TTATGCTTTG
AATAATATTT CTTTTGAAGT TCCAGAGGGT GAAATTGTTG GAATTGTAGG TTTAAATGGT
TCTGGGAAGA GTACATTGTC GAATTTAATT GCTGGTGTTA CGATGCCTAA TAAAGGAACA
GTGGATATTA AAGGTTCAGC TGCATTGATA GCAATTTCTT CAGGGCTTAA TGGTCAATTA
ACAGGGATTG AAAATATCGA ATTAAAAGGT TTAATGATGG GAATTACAAA AGAGAAGATT
AAGGAAATTA TTCCTGAAAT CATTGACTTT GCTGATATTG GAAAGTTTAT GTATCAACCT
GTTAAAACTT ATTCAAGTGG AATGAAATCT AGACTAGGAT TTGCGATATC AGTTCATATT
AATCCTGATA TTTTAGTGAT AGATGAGGCA CTTTCTGTTG GTGATCAAAC GTTTACGAAA
AAGTGCCTAG ATAAGATGAA TGAATTTAAA GAGCAAGGGA AAACAATCTT CTTTATTAGT
CATTCACTTT CTCAAGTGAA AAGTTTCTGT ACAAAAGCGT TATGGCTTCA TTATGGTCAA
GTAAAAGAAT ACGGGGATAT AAAAGAGATT GTCGATCATT ATGATGAATT TCTGAAGAAA
TATAATCAAA TGAGTGTTGA GGAAAGAAAA GACTTAAGAA AAGAACAAAT ATCTCAATTT
CAACATGGCT TACTACAAGA GGATCAAACC GGTAGGGAGA GAAAACGTAA AAAAGGAAAA
AAAACAAGCC GAAAGTTTAA AAAGAAGAGG GTTCTGATTA CAGGAGTTTG TATAGCTCTA
TTAACAGGTA TAATTTCAAC AGGCTATTAT TATAAAAATT TACTACCATT CAACAGTGAA
AATAAATATG CCGAAAAAGT TGCTTCAAAA GAGAATGTGA CTGAATCTAA GCAAATGGTA
AAGAAGGAAA AAGGAGCAGC AAAGTATATT GTAAATAGTA ATGGAATCAG TATTCGTGAA
GAAGCAGATG CAAGCAGTAA ACGATTAGCT ATAGCAAACT TCGGGGATAT TTTTACTATA
TCTGATAGCA ATAAAAATGA GAAAAAAGAT GTTGAATGGA TACAAATTAC ATTATCAAAT
GGTGAAATTG GATGGATAAG CACAAAGTTT ATTGAACCGT TTAAATCGAA TAATAACATA
ATCGAGGATG CTAAGTTAGC AGATGTAACT GCGTTGTTAA AACGTGTATA TGGAGGGAAT
ATGGTAAGTG CTCCTACTTA TTTTGGTAAA ACACTAAATG AGTTAGAGAC AACTTATCCT
CAACCTTTAA ATCCATTACC AAGTATGACG GGAAAAACGA TTGTTAAAGA TGGCAATATT
CAATTTGGGA TTTCACAAGA TAAGGTAGTG GAGGTTGTAT TCCAAGATAT TTCAATGTCG
ATTGCAAAGT TACATGAATT ATTAGGAAAA GAAAGCTTAA GTAATGATGC AGAGAAAAAC
TACTTCTATG AAACAAAAAG TTACTATATT GCAGCCCGTT CAGATCAGAC GCATAAAGAA
ATTCAATCAA TATCGATTGT AAAGAAATAA
 
Protein sequence
MNYKVKFEHV TKKYKLYNKP FDKLKDLFFR SKDGEYHYAL NNISFEVPEG EIVGIVGLNG 
SGKSTLSNLI AGVTMPNKGT VDIKGSAALI AISSGLNGQL TGIENIELKG LMMGITKEKI
KEIIPEIIDF ADIGKFMYQP VKTYSSGMKS RLGFAISVHI NPDILVIDEA LSVGDQTFTK
KCLDKMNEFK EQGKTIFFIS HSLSQVKSFC TKALWLHYGQ VKEYGDIKEI VDHYDEFLKK
YNQMSVEERK DLRKEQISQF QHGLLQEDQT GRERKRKKGK KTSRKFKKKR VLITGVCIAL
LTGIISTGYY YKNLLPFNSE NKYAEKVASK ENVTESKQMV KKEKGAAKYI VNSNGISIRE
EADASSKRLA IANFGDIFTI SDSNKNEKKD VEWIQITLSN GEIGWISTKF IEPFKSNNNI
IEDAKLADVT ALLKRVYGGN MVSAPTYFGK TLNELETTYP QPLNPLPSMT GKTIVKDGNI
QFGISQDKVV EVVFQDISMS IAKLHELLGK ESLSNDAEKN YFYETKSYYI AARSDQTHKE
IQSISIVKK