Gene BAS5271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5271 
Symbol 
ID2852414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5153438 
End bp5154637 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content38% 
IMG OID637508525 
Productmajor facilitator family transporter 
Protein accessionYP_031509 
Protein GI49188256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000222558 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCAAGG TGAAAGAAAT TTCGAAGCGA AAGCTACTTG GTATAGCGGG GCTTGGATGG 
TTATTTGATG CAATGGATGT TGGAATGCTT TCATTTGTAA TGGTGGCATT GCAAAAAGAT
TGGGGATTAA GTACGCAAGA AATGGGCTGG ATAGGCAGCA TTAATTCAAT TGGTATGGCA
GTTGGAGCGC TCGTTTTTGG AATACTATCA GATAAAATAG GGCGGAAATC AGTCTTTATT
ATTACATTAT TATTATTTTC TATCGGTAGT GGTTTAACGG CTTTAACGAC AACACTTGCT
ATGTTCCTTG TTTTAAGATT TTTAATCGGT ATGGGGCTAG GGGGAGAGCT TCCAGTTGCC
TCTACATTAG TATCAGAGAG TGTTGAAGCA CATGAACGCG GCAAAATAGT TGTGTTATTA
GAAAGTTTTT GGGCAGGTGG ATGGTTAATT GCGGCTCTTA TCTCGTATTT TGTTATACCG
AAATATGGTT GGGAAGTTGC GATGATATTA AGTGCGATTC CGGCGCTATA TGCTTTATAT
TTAAGATGGA ATTTACCGGA TTCTCCGAGA TTCCAAAAGG TTGAAAAAAG GCCATCTGTT
ATCGAAAATA TAAAGTCAGT TTGGTCTGGA GAATACCGTA AGGCAACAAT TATGTTATGG
ATTTTATGGT TTTCTGTTGT CTTTTCCTAT TATGGAATGT TCCTTTGGTT ACCTAGTGTA
ATGGTATTAA AAGGATTTAG TTTAATAAAA AGTTTCCAAT ACGTACTTAT TATGACGTTA
GCTCAATTGC CGGGTTATTT CACAGCTGCT TGGTTTATTG AACGTCTTGG TCGTAAGTTT
GTTTTAGTTA CGTATTTAAT TGGTACAGCA TGCAGTGCTT ACTTATTTGG AGTAGCAGAG
TCATTAACAG TATTAATCGT AGCAGGCATG TTACTATCCT TCTTTAATTT AGGTGCTTGG
GGTGCATTAT ATGCCTACAC ACCTGAACAA TATCCAACAG TTATTCGTGG TACAGGTGCA
GGGATGGCAG CAGCATTTGG TCGTATTGGT GGTATTCTTG GACCGCTATT AGTAGGATAT
TTAGTTGCTT CACAGGCTTC ACTATCACTA ATATTTACGA TTTTCTGTGG ATCCATTTTA
ATAGGCGTAT TTGCTGTAAT TATACTTGGG CAAGAAACGA AACAACGAGA ATTAGTATAA
 
Protein sequence
MGKVKEISKR KLLGIAGLGW LFDAMDVGML SFVMVALQKD WGLSTQEMGW IGSINSIGMA 
VGALVFGILS DKIGRKSVFI ITLLLFSIGS GLTALTTTLA MFLVLRFLIG MGLGGELPVA
STLVSESVEA HERGKIVVLL ESFWAGGWLI AALISYFVIP KYGWEVAMIL SAIPALYALY
LRWNLPDSPR FQKVEKRPSV IENIKSVWSG EYRKATIMLW ILWFSVVFSY YGMFLWLPSV
MVLKGFSLIK SFQYVLIMTL AQLPGYFTAA WFIERLGRKF VLVTYLIGTA CSAYLFGVAE
SLTVLIVAGM LLSFFNLGAW GALYAYTPEQ YPTVIRGTGA GMAAAFGRIG GILGPLLVGY
LVASQASLSL IFTIFCGSIL IGVFAVIILG QETKQRELV