Gene BAS3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3194 
Symbol 
ID2851828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3171994 
End bp3173241 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID637506438 
Productmajor facilitator family transporter 
Protein accessionYP_029451 
Protein GI49186199 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGGAA TTATAGGGAA GAGGGGAAAT CAATTGAATT CATATACAGC ATCGTCTTCA 
GAAGTTCAGA CGAATCGAAG AAGTATATTT GCGTTATTAG CGCTAGCAAT TAGTGCGTTC
GGGATTGGGA CAACTGAATT TGTTAGTGTC GGTTTATTAC CATCTATTTC GAAAGATTTA
CATGTGTCGG TGACAACAGC TGGTTTAACA GTTTCTTTAT ATGCGTTAGG AGTAGCATTT
GGTGCTCCAG TATTAACGTC GTTAACAGCT AATATGTCAC GAAAAACGTT ATTAATGTGG
ATTATGATTA TTTTCATTAT TGGTAACGGA ATTGCGGCTG TCGCAACAAG CTTCACTGTA
TTACTTATTG CGCGAATTGT GTCTGCACTT TCGCATGGTG TGTTTATGTC AATTGGTTCA
ACGATTGCTG CGGCACTCGT ACCAGAAAAT AAACGTGCTA GCGCGATTGC GATTATGTTT
ACTGGCGTAA CAGTCGCAAC TATTACAGGT GCACCAATTG GAACATTTAT CGGTCAACAA
TTTGGCTGGA GAACATCATT TTTAGCAATT GTAGTCATTG GAATTATTGC TTTAATCGCA
AATAGTATTC TCATTCCATC TAATATGAAA AAAGGTACGT CTGTATCATT CCGCGATCAA
TTTAAACTGG TTACGAACGG AAGACTGTTA CTTGTTTTCA TTATTACTGC ACTTGGATAC
GGCGGTACAT TCGTAACATT TACGTATTTA TCTCCGTTAT TACAAGAAGT AACAGGATTT
AAAGCTAATA CGGTTACGAT CATTTTATTA GTATATGGAA TCGCTATTGC AATAGGGAAT
GTGATTGGCG GGAAATTATC GAATCATAAT CCAATTCGAG CGCTATTTTA CATGTTCTTT
ATTCAAGCGA TTATATTATT TGTTTTAACA TTTACAGCGC CATTTAAAGT AGCTGGGTTA
ATTACAATTA TTTTCATGGG ACTATTCGCA TTTATGAATG TTCCAGGGTT ACAAGTATAT
GTCGTAATGT TAGCTGAACG ATTTGTACCG AGTGCTGTCG ATGTTGCATC GGCAATTAAT
ATTGCGGCTT TTAATGCTGG GATTGCTCTT GGTGCTTATT TAGGTGGTAT TGTAACGAAT
TCGTTAGGGT TAATTCATAC GGCTTGGGTA GGCGGCATTA TGGTAGTAGG TGCTGTTATT
TTAACAGCAT GGAGTATGTC ATTAGAAAAA CGAGATCAAG TAAAATAA
 
Protein sequence
MIGIIGKRGN QLNSYTASSS EVQTNRRSIF ALLALAISAF GIGTTEFVSV GLLPSISKDL 
HVSVTTAGLT VSLYALGVAF GAPVLTSLTA NMSRKTLLMW IMIIFIIGNG IAAVATSFTV
LLIARIVSAL SHGVFMSIGS TIAAALVPEN KRASAIAIMF TGVTVATITG APIGTFIGQQ
FGWRTSFLAI VVIGIIALIA NSILIPSNMK KGTSVSFRDQ FKLVTNGRLL LVFIITALGY
GGTFVTFTYL SPLLQEVTGF KANTVTIILL VYGIAIAIGN VIGGKLSNHN PIRALFYMFF
IQAIILFVLT FTAPFKVAGL ITIIFMGLFA FMNVPGLQVY VVMLAERFVP SAVDVASAIN
IAAFNAGIAL GAYLGGIVTN SLGLIHTAWV GGIMVVGAVI LTAWSMSLEK RDQVK