Gene GBAA_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_1014 
Symbol 
ID2814814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1005503 
End bp1006723 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content31% 
IMG OID637787987 
Producttransporter 
Protein accessionYP_017642 
Protein GI47526293 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.32709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATT TGACTAAAAA GACAAATTTT CTAATATTCA TTTTAGCAAT TAGTTGTGGC 
TCACTTGTTG CGAATATTTA TTATGCACAG CCAATTGTAC AATTCATTGC AAAAGACTTG
AATATCGCTT CGGATTTATC TGGATTGCTC ACTACTTTGA CGCAAATTGG ATATGGATTG
GGCTTGTTTT TTATCGTACC AATGGCAGAT TTATTCAAAA GTAAGAAAAT AATAGGTATT
CTTATCGGAC TCACTATTAT TTCATTGATT GGTACGCTAA TTTCGACAAA TGGAATTGTT
TTTTTAATAC TAACAACTGT AATTGGTATT GGAGCCTGTG CAGCTCAAAT GTTAGTTCCG
CTAACAATGA GGATTGTACC TATTGAAGAG ATGGGTAAAT ATGTGGGTAA AGTAATGAGT
GGTTTATTAA TTGGGATTAT GATTGCTCGC CCATTATCTA TCGGAATAAC TGAATGGTTC
GGCTGGAGAA TGGTATTTCT TTTTTCACTA ATCATTCTAG TTGCTGTATT ACTTTTACTT
ATAAAATTTT TGCCCAACTA TGAAGTAGTA TCAAATAGTA ACATGTCATA TTCAAATTTA
ATAGCTTCTA TGGTAAAACT GCTACTACAT ACTTCTCCGT TACAACAAAG AGCTTTTTAT
CACGCATGTT TATTTGCAAC ATTTAGTCTT TATTGGACAG TTATTCCAAT CTTATTACGG
TCAGAACCAT TACATTTCTC AAATAATGAA ATTGCATTGT TTGGATTTGC TGCAATAGCT
GGAGCTTTAT TAACTCCTAC TATTGGTAAA ATCGCAGATA AAGGCTATAT TTTTACAATG
ACTAATGTAT CAATGGCGCT CGTACTATTA TCTATCGTAC TATTATTTTT TGTTCAAGAT
CATTCACTTT TTAGTGTGAT TGTAATACTT ATTTCAGGTA TTAGCATCGA TATTGGTGTA
GCAGGAAATT TATTATTAGG TCAAAAAGTT ATCTTTAGTT TGAATCCTGA GATAAGAAAC
AGACTGAATG GATTATATAT GACCATTTTC TTTTTGGGAG GAGCCTTTGG TTCATGTATT
GGAAGTTATA CGTACTATAA ATTTAATAGC GAAGTACCGT TACTCATTGG AGCGGCTTTA
CCTTTAATCG CCTTATTTGT GCATTTAATA AAAAATAATG CGATACATTT ATCAAAAACG
AAAAATAAAT ATATGTCTTA A
 
Protein sequence
MINLTKKTNF LIFILAISCG SLVANIYYAQ PIVQFIAKDL NIASDLSGLL TTLTQIGYGL 
GLFFIVPMAD LFKSKKIIGI LIGLTIISLI GTLISTNGIV FLILTTVIGI GACAAQMLVP
LTMRIVPIEE MGKYVGKVMS GLLIGIMIAR PLSIGITEWF GWRMVFLFSL IILVAVLLLL
IKFLPNYEVV SNSNMSYSNL IASMVKLLLH TSPLQQRAFY HACLFATFSL YWTVIPILLR
SEPLHFSNNE IALFGFAAIA GALLTPTIGK IADKGYIFTM TNVSMALVLL SIVLLFFVQD
HSLFSVIVIL ISGISIDIGV AGNLLLGQKV IFSLNPEIRN RLNGLYMTIF FLGGAFGSCI
GSYTYYKFNS EVPLLIGAAL PLIALFVHLI KNNAIHLSKT KNKYMS