Gene BAS0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0949 
Symbol 
ID2849486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1005397 
End bp1006617 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content31% 
IMG OID637504209 
Producttransporter 
Protein accessionYP_027223 
Protein GI49183971 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.800881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATT TGACTAAAAA GACAAATTTT CTAATATTCA TTTTAGCAAT TAGTTGTGGC 
TCACTTGTTG CGAATATTTA TTATGCACAG CCAATTGTAC AATTCATTGC AAAAGACTTG
AATATCGCTT CGGATTTATC TGGATTGCTC ACTACTTTGA CGCAAATTGG ATATGGATTG
GGCTTGTTTT TTATCGTACC AATGGCAGAT TTATTCAAAA GTAAGAAAAT AATAGGTATT
CTTATCGGAC TCACTATTAT TTCATTGATT GGTACGCTAA TTTCGACAAA TGGAATTGTT
TTTTTAATAC TAACAACTGT AATTGGTATT GGAGCCTGTG CAGCTCAAAT GTTAGTTCCG
CTAACAATGA GGATTGTACC TATTGAAGAG ATGGGTAAAT ATGTGGGTAA AGTAATGAGT
GGTTTATTAA TTGGGATTAT GATTGCTCGC CCATTATCTA TCGGAATAAC TGAATGGTTC
GGCTGGAGAA TGGTATTTCT TTTTTCACTA ATCATTCTAG TTGCTGTATT ACTTTTACTT
ATAAAATTTT TGCCCAACTA TGAAGTAGTA TCAAATAGTA ACATGTCATA TTCAAATTTA
ATAGCTTCTA TGGTAAAACT GCTACTACAT ACTTCTCCGT TACAACAAAG AGCTTTTTAT
CACGCATGTT TATTTGCAAC ATTTAGTCTT TATTGGACAG TTATTCCAAT CTTATTACGG
TCAGAACCAT TACATTTCTC AAATAATGAA ATTGCATTGT TTGGATTTGC TGCAATAGCT
GGAGCTTTAT TAACTCCTAC TATTGGTAAA ATCGCAGATA AAGGCTATAT TTTTACAATG
ACTAATGTAT CAATGGCGCT CGTACTATTA TCTATCGTAC TATTATTTTT TGTTCAAGAT
CATTCACTTT TTAGTGTGAT TGTAATACTT ATTTCAGGTA TTAGCATCGA TATTGGTGTA
GCAGGAAATT TATTATTAGG TCAAAAAGTT ATCTTTAGTT TGAATCCTGA GATAAGAAAC
AGACTGAATG GATTATATAT GACCATTTTC TTTTTGGGAG GAGCCTTTGG TTCATGTATT
GGAAGTTATA CGTACTATAA ATTTAATAGC GAAGTACCGT TACTCATTGG AGCGGCTTTA
CCTTTAATCG CCTTATTTGT GCATTTAATA AAAAATAATG CGATACATTT ATCAAAAACG
AAAAATAAAT ATATGTCTTA A
 
Protein sequence
MINLTKKTNF LIFILAISCG SLVANIYYAQ PIVQFIAKDL NIASDLSGLL TTLTQIGYGL 
GLFFIVPMAD LFKSKKIIGI LIGLTIISLI GTLISTNGIV FLILTTVIGI GACAAQMLVP
LTMRIVPIEE MGKYVGKVMS GLLIGIMIAR PLSIGITEWF GWRMVFLFSL IILVAVLLLL
IKFLPNYEVV SNSNMSYSNL IASMVKLLLH TSPLQQRAFY HACLFATFSL YWTVIPILLR
SEPLHFSNNE IALFGFAAIA GALLTPTIGK IADKGYIFTM TNVSMALVLL SIVLLFFVQD
HSLFSVIVIL ISGISIDIGV AGNLLLGQKV IFSLNPEIRN RLNGLYMTIF FLGGAFGSCI
GSYTYYKFNS EVPLLIGAAL PLIALFVHLI KNNAIHLSKT KNKYMS