Gene BAS2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2698 
Symbol 
ID2848734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2678385 
End bp2679587 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content34% 
IMG OID637505943 
Producttransporter 
Protein accessionYP_028956 
Protein GI49185704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0119917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCGTA ATAAAAATGT TTGGATTGTT TTAATTGGGG AGTTTATTGC TGGTTTAGGG 
TTATGGCTTG GAATTCTTGG CAACCTGGAA TTTATGCAAA AATATGTCCC TTCTGATTTC
ATGAAATCAG TTATATTGTT TATCGGACTA TTAGCAGGTG TTCTAGTGGG ACCTATGGCT
GGTCGTATCA TCGATCAATA TGAAAAGAAA AAAGTCCATC TTTATGCTGG TTTTGGTCGT
GTTATTAGTG TTATTTTTAT GTTTTTCGCT ATCCAATTTG AAAGTATCGC CTTTATGATT
GCATTTATGG TTGCACTTCA AATTTCAGCA GCATTTTATT TCCCTGCATT ACAATCTGTA
ATTCCACTCA TCGTACGTGA GCATGAGTTA TTACAAATGA ACGGTGTACA TATGAATGTA
GGTACAATCG CTCGTATTGC AGGTACTTCA CTAGGTGGAA TTCTTTTAGT TGTAATGAGT
TTACAATATA TGTACGCCTT CTCAATGGCA GCATATGCTT TATTATTCCT CTCAACTTTC
TTCCTACAAT TCGAAGATAA GAAATCAACA ACACCAAGTA AACAAGCTGC AAAAGATAAT
AGCTTTATGG AAGTATTTCG TATTTTAAGA GGAATTCCGA TTGCTTTCAC AGCACTTATA
TTAAGTATTA TCCCTCTATT ATTTATAGCT GGATTTAATT TAATGGTAAT TAATATTAGC
GAAATGCAAC ATGATCCAAC GATTAAAGGC TTTATATATA CGATTGAAGG TATCGCATTT
ATGTTAGGCG CCTTCGTTAT TAAACGTTTA TCTGATCATT TCAAACCTGA AAAGTTACTA
TATTTCTTCG CTGTTTGTAC CGCTTTTGCA CATCTATCAT TGTTCTTTAG CGATATAAAA
TGGATGTCTC TTACATCATT TGGATTGTTT GGTTTTAGTG TTGGTTGTTT CTTCCCTATT
ATGTCGACAA TTTTCCAAAC GAAAGTGGAA AAGAGCTATC ACGGCCGACT CTTCTCATTC
CGTAATATGT TTGAAAGAGT GATGTTCCAA ATTGTCTTAC TTGGCACAGG CTTCTTCTTA
GATACGATTG GATTGCAATA TATGGTTCTT ATTTTCGGTG TTATTTCATT ATTCATTATT
TTCATATCGC TTTCTAAACA GAAACAGTAC GAAAAACAAC CATCGCAATC TGCGAATTTA
TAA
 
Protein sequence
MWRNKNVWIV LIGEFIAGLG LWLGILGNLE FMQKYVPSDF MKSVILFIGL LAGVLVGPMA 
GRIIDQYEKK KVHLYAGFGR VISVIFMFFA IQFESIAFMI AFMVALQISA AFYFPALQSV
IPLIVREHEL LQMNGVHMNV GTIARIAGTS LGGILLVVMS LQYMYAFSMA AYALLFLSTF
FLQFEDKKST TPSKQAAKDN SFMEVFRILR GIPIAFTALI LSIIPLLFIA GFNLMVINIS
EMQHDPTIKG FIYTIEGIAF MLGAFVIKRL SDHFKPEKLL YFFAVCTAFA HLSLFFSDIK
WMSLTSFGLF GFSVGCFFPI MSTIFQTKVE KSYHGRLFSF RNMFERVMFQ IVLLGTGFFL
DTIGLQYMVL IFGVISLFII FISLSKQKQY EKQPSQSANL