Gene BAS0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0800 
Symbol 
ID2853080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp847854 
End bp849110 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content36% 
IMG OID637504062 
Productmajor facilitator family transporter 
Protein accessionYP_027076 
Protein GI49183824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTTT GGAGTATGCA TCGAAATATA AAAATTAGAA TTATAACTTC GTTTTTAACA 
CGAACTGTAT CCACGATGAT TTTTCCATTT ATGGCGATTT ATTTTTCAAT AAAGTTAGGT
AGTGCGATTG CTGGTGCGTT ACTACTCATT AATGTCATGG CTTCATTAGT AATTGGTTTA
TACGGTGGAT ATGTTGGAGA TCGGCTTGGT CGTAAAAAGG TGATGATTAT TGGTCAAAGT
ATACAAGTCA TTTCCATTGC TTGTATGGGG ATTGCAAATT CGGATTACGT AGATTCACCG
TGGCTCACAT TTGTATTTAT GCTAGTGAAT AGCCTAGGAT CTGGACTTAT GAATCCTGCG
ACAGAGGCGA TGTTAATTGA TGTGAGTACA CCGGAAAATC GAAAAGTGAT GTACAGCATT
AACTACTGGG CGATTAATTT ATCCATTGCA ATTGGAGCGA TATTTGGCGG ATTATTGTTT
GAAAACTATA GATTACAATT GTTTATCGTA TTAACGCTTG TTGCGATTAT TACTTTATAT
GTGATGGCTG TATATATGGA AGAAGTGTAC GTAGCTAGAA AAACAGTAGA GAAGAAAAAT
GTATTAAAAG ATATGGCGGA TAGTTATAAA GTTGTGATGA AAGATAGAGC GTTTTTAATT
TTTTGTGCAG CGAGTATATG TACGTTATCG TTAGAGTTTC AAATTAATAA TTATTTAGGA
GTACGCTTGC AGAAGGAATT TGAAACGGTG CACTTTTTCT TCGGGAATGG TTTTACGTTT
GATTTAACAG GTATTCGCAT GCTGAGCTGG ATTTCAGCAG AGAATACAAT TTTAGTTGTG
TTATGTTCGG CGCTTCTTAT TAAAATGTTG AAACGCTTCA ATGATTTGAA AATCTTATAT
GTCGGCTTAT TCATTTATAC AATTGGATTT ACAATACTCG GAACGAGCAA TAGCTTATGG
ATTTTATTAA TTGCAGGGCT TTTCCAAACG GTAGGCGAGA TGATGTATGT GCCAGTGCGT
CAATCTATTA TGGCAGATAT GGTGCCGAAT GAGGCGAGAG GTTCATATAT GGCGATTAAC
GGAATGGTTT TTCAAGTGGC AAAAATGAAC GGGGCATTAG GTGTTATGCT AGGTTCATTT
ATCGCATCTT GGGGCATGAG TGCTCTGTAC TTTATCGTTG GTATGAGCAG TATTTTATTA
TTTATGAAGG CGATAGGGAA AGAGAAGTAT GAGAGGCAGA TTTCTCAGAT TGGATAA
 
Protein sequence
MGFWSMHRNI KIRIITSFLT RTVSTMIFPF MAIYFSIKLG SAIAGALLLI NVMASLVIGL 
YGGYVGDRLG RKKVMIIGQS IQVISIACMG IANSDYVDSP WLTFVFMLVN SLGSGLMNPA
TEAMLIDVST PENRKVMYSI NYWAINLSIA IGAIFGGLLF ENYRLQLFIV LTLVAIITLY
VMAVYMEEVY VARKTVEKKN VLKDMADSYK VVMKDRAFLI FCAASICTLS LEFQINNYLG
VRLQKEFETV HFFFGNGFTF DLTGIRMLSW ISAENTILVV LCSALLIKML KRFNDLKILY
VGLFIYTIGF TILGTSNSLW ILLIAGLFQT VGEMMYVPVR QSIMADMVPN EARGSYMAIN
GMVFQVAKMN GALGVMLGSF IASWGMSALY FIVGMSSILL FMKAIGKEKY ERQISQIG