Gene BAS5298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5298 
Symbol 
ID2851143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5184084 
End bp5185652 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content38% 
IMG OID637508551 
ProductBCCT family osmoprotectant transporter 
Protein accessionYP_031535 
Protein GI49188282 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATGA AGAGTCGGAA AACGGATTGG CCTGTATTTC TTATTAGTGG TGGCTCACTT 
TTATTATTTG TAATTGCGGT TTTTTTAAAT AAAAGTTACG TAGAGGGAGT CATTAATAGC
AGTTTTGCAG CTTCAATTAA ATACTTTGGT GCTTTTTGGC AGATTTTATT AATTGGTACA
TTTGTTGTTG CAATGTGTAT GGCGTTCTCA AAGTATGGGA GAGTTAAACT TGGGGGATTA
GAAAAACCTG AGATTAGTAC GACGAAATGG CTCGCTATTA TAATGTCTAC ATTACTGGCC
GGAGGTGGTG TTTTCTGGGC AGCGGCAGAG CCGATGTACC ATTTAATGAC GGTGCCACCA
ATACATGAAG GTATAACTGC TGGAACGAAA GAAGCAGTAA TGCCTGCTTT AGCACAAAGT
TATATGCACT GGGGTTTCCT AGCTTGGACG ATTTTAGGGA CAATTAGTGC GGTAGTTATG
ATGTATGGGC ATTATCATAA AGGTATGCCG TTAAAACCTC GAACGCTTTT ATATCCTATT
TTTGGAGAGA AATTGCGAAA GAGTTGGCTT GGAACACTAA TTGATGCATT TGCCATTATT
GCAGTGGCTG CAGGGACGAT TGGTCCAATC GGATTTTTAG GTCTACAAGC AAGTTATGGC
CTACAAGCAT TGTTTAACAT CCCGGATGTG TTTACAACTC AATTAGCCAT TATTGTTTGT
GTAGTAGCTG TTTCTACTAT ATCTGCGGTG ACTGGTATTG ATAAAGGGAT TCAAATTATA
AGTAATTTAA ATGTTAGATT GGCAATTTTA TTAATGGTAT TCGTATTACT ATTTGGACCA
GGTGGATTTA TTATTGATTC ATTTGTTTCT TCGTTTGGAT TTTATATAAA TGAATTTATT
CCAATGAGCA CATATCGTGG TGATACAACT TGGTTAGGGT CGTGGACAAT CTTTTTCTGG
GGATGGTTTA TTGGGTATGG ACCGATGATG GCAATTTTAG TGAGTCGTAT TTCAAGAGGA
AGAACAATTC GAGAAATCAT CGTTGCAATT GGAATTATCG CACCGATTAT TACAACGTTT
TGGTTTACGA TTCTGGGAGG ATCAGGTGTG TTTTATGAGT TAATGAAGCC TGGTTCTATC
TCAAGCGCAC TAAGTGAATC GGGTATGCCA GCTGCTATGA TTGCAATTAC AGAGCAACTG
CCACTCTCTC ATATTATTGG ACCTGCATTT CTTTTATTAA CAATTTTATT TGTAGTGACA
ACAGGAGATT CAATGGCGTA TTCGATTTCA ATGGCAGTAA CTGGAGATGG GGATCCTAGA
ATTAGTTTGC GAGTTTTTTG GTCGCTTATT ATGGGAACCG TTGCAGCGAT TCTTTTATAC
ATGGGTGAGG GTAGTATTAA TGCATTGCAA TCATTCATCG TAGTAACAGC TGTCCCAGTA
TCCATTCTGT TATTCCCAAT GCTATGGCTA GCGCCAAAAG TTGCGGGGGA ATTAGCTTTA
AAGCAAGGTA TTGTAAAAGA AGAAGAGAAA ACCTCCTTCT TGTTCCAAAA AGCTAGTAAA
TCAAAGTAA
 
Protein sequence
MRMKSRKTDW PVFLISGGSL LLFVIAVFLN KSYVEGVINS SFAASIKYFG AFWQILLIGT 
FVVAMCMAFS KYGRVKLGGL EKPEISTTKW LAIIMSTLLA GGGVFWAAAE PMYHLMTVPP
IHEGITAGTK EAVMPALAQS YMHWGFLAWT ILGTISAVVM MYGHYHKGMP LKPRTLLYPI
FGEKLRKSWL GTLIDAFAII AVAAGTIGPI GFLGLQASYG LQALFNIPDV FTTQLAIIVC
VVAVSTISAV TGIDKGIQII SNLNVRLAIL LMVFVLLFGP GGFIIDSFVS SFGFYINEFI
PMSTYRGDTT WLGSWTIFFW GWFIGYGPMM AILVSRISRG RTIREIIVAI GIIAPIITTF
WFTILGGSGV FYELMKPGSI SSALSESGMP AAMIAITEQL PLSHIIGPAF LLLTILFVVT
TGDSMAYSIS MAVTGDGDPR ISLRVFWSLI MGTVAAILLY MGEGSINALQ SFIVVTAVPV
SILLFPMLWL APKVAGELAL KQGIVKEEEK TSFLFQKASK SK