Gene BAS3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3100 
Symbol 
ID2851204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3079565 
End bp3080764 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content36% 
IMG OID637506344 
Productmajor facilitator family transporter 
Protein accessionYP_029357 
Protein GI49186105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.085117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGTG AAAAACTTTG GACGAAGGAT TTCCTCGGAA CTTGTTTTAG TAGTCTCTTT 
CTCTTTTTAA CATTTTACAT GCTAATGACT ACTCTGCCTG TCTATGTAAT AGACGGGCTA
AAAGGAAAAC CAGAGGAAAT TGGTTTAGTT GCAACTGTTT TTCTTATTTC ATCTGTTTTA
TGTAGACCAT TCACAGGAAA ATGGCTAGAT GATTTAGGAA GAAAGAAAAT ATTATTTATT
TCACTTTCAT TATTTTTAGC CGCTACTGTT ATGTATTTCG GTGCGCAAAG TTTATTTTTA
TTACTTGCTC TTCGCTTCTT ACATGGTATT GGGTTTGGGA TGGCAACTAC TGCAACTGGT
ACGATTGTAA CTGATGTTGC ACCAGCTCAT AGACGAGGCG AAGCACTTGC CTATTTCGGC
GTATTTATGA GTCTGCCGAT GGTAATTGGT CCTTTTTTAG GTTTAACAAT TATTTCTCAT
TTTTCGTTTA CTGTATTATT TATCGTTTGT TCCGTATTTT CATTACTGGC ATTTTTATTA
GGACTACTTG TAAATATTCC ACATGAAGCA CCTGTAAGCA AACAAAAACA AGAAAAAATG
AAATGGAAAG ACTTACTTGA ACCATCTTCT ATTCCAATCG CTCTTACAGG ATTTGTTTTA
GCCTTTTCTT ATAGTGGTAT TTTATCCTTT ATTCCTATTT ATGCAAAAGA GCTCGGTTTA
GCTGATATTG CAAGTTACTT CTTTATTTTA TATGCACTTG TTGTTGTCAT TTCTCGTCCA
TTTACAGGTA AAATTTTCGA TCGCTTCGGT GAAAACGTAC TTGTTTATCC TGCTATTATT
ATTTTCACAA TTGGGATGTT TATTTTAAGT CAGGCGCAAA CGCCATTTTG GTTCCTTGGC
GCAGGTATGC TAATTGGTTT AGGTTATGGA ACATTAATTC CTAGCTTCCA AACGATTGCG
ATTTCTGCCG CTCCAAACCA TAGACGTGGT TCTGCGACAG CTACGTACTT CTCATTCTTT
GATAGTGGTA TTGGATTTGG TTCTTTCATT TTAGGTATAG TCGCAGCGAA ATCAAGTTAC
CATAATATGT ATTTTATCGC GGCTATTATC GTTGCTTTCA CTTTACTTCT ATATTATGGA
TTACACGGCC GCAAACAAAA ATTCAAGAAA CAACGTACAG ATGGACAAAT ATCCGCTTAG
 
Protein sequence
MQSEKLWTKD FLGTCFSSLF LFLTFYMLMT TLPVYVIDGL KGKPEEIGLV ATVFLISSVL 
CRPFTGKWLD DLGRKKILFI SLSLFLAATV MYFGAQSLFL LLALRFLHGI GFGMATTATG
TIVTDVAPAH RRGEALAYFG VFMSLPMVIG PFLGLTIISH FSFTVLFIVC SVFSLLAFLL
GLLVNIPHEA PVSKQKQEKM KWKDLLEPSS IPIALTGFVL AFSYSGILSF IPIYAKELGL
ADIASYFFIL YALVVVISRP FTGKIFDRFG ENVLVYPAII IFTIGMFILS QAQTPFWFLG
AGMLIGLGYG TLIPSFQTIA ISAAPNHRRG SATATYFSFF DSGIGFGSFI LGIVAAKSSY
HNMYFIAAII VAFTLLLYYG LHGRKQKFKK QRTDGQISA