Gene BAS3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3097 
Symbol 
ID2851359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3076704 
End bp3077900 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content39% 
IMG OID637506341 
Productmajor facilitator family transporter 
Protein accessionYP_029354 
Protein GI49186102 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.399635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATAC GTTTTACGTT TTGGATTATG GTTGGAATTG TAGCAATCTC TGGTTTGTCA 
CAAGGGATGC TTTTACCGGC CATTGCAATG ATTTTTGAAC AAGAAGGGGT TAGTTCAAGT
ATTAATGGTA TTCATGCGAC GGCACTATAT ATTGGGATAT TAGTTATTTC CCCGTTTCTT
GAAAAACCGA TGCAAAAGTT TGGAATGAAG CCAATTATTG TGATTGGTGG GTTTCTCGTT
ATCATTTCAT TATTCTTTTT TACACAAACT TTTTCATTCT GGGTATGGTT TATCCTTAGA
TTTCTAGTTG GAGTCGGAGA TCATATGCTG CATGTCGGAA CACAAACATG GATTACGACA
ACAGCAGATC CAAGTAAAAT AGGGAGACAG GTATCGATAT ACGGTGTATT CTTCGGAATT
GGTTTTGCCG TTGGCCCGTA TTTAGCAAGC ACTGTTCAGT ACGGTCTTGC AACGCCATTT
ATTATATCTA CTATACTTTG TTTAATAGGT TGGCTGTTAC TATTACCAAC AAAAAATGCA
TTCCCAGCGC AAGATGAAAG AGAAGTGAAG AGTGAATCAT CATTTTCTCG TTATAAACAA
GTTGTTGGAT TAGGATGGAT TGCACTGCTG GGACCACTTG CATATGGCGT ACTGGAAGCG
ATGTTAAATA GTAACTTACC AGTATACGCG CTTCGAAAAG GATGGTCCGT CTCAGAAGTA
TCCTTCTTAT TGCCAGCGTT TGCAGTTGGG GGCATTATTA CACAAATTCC GCTCGGTATA
TTAAGTGACA AATACGGAAG GGACCGTATT TTAACGTGGA CGTTCTGCAT AAGTACGGGC
ATTTTCTTAC TGGCAGCCGT ATTTGATCAC TATTACTGGA TCGTCTTTGC CTGCATGTTG
TTAGCGGGTA TGGTCATTGG ATCGTGCTTC TCGTTAGGAC TTGGATTTAT GACAGATTTA
TTACCGAGAC ACTTATTGCC AGCAGGAAAT ATATTATCTG GAATCGCCTT TAGTTTAGGA
AGTATACTCG GACCTGTATT AGGAGGCGTA TTTATAGAAA AAATACAGTA TACAAGCTTT
TTTGTTGCGG TTATGATTAT AATAGGAACT CTCGCAATAT TATATATGAT CTACATGAAG
AATCAATTTG CATCAAGAAA AATAGAAAGT AGGTTAGGGC ATGACAAAAC AACCTAA
 
Protein sequence
MSIRFTFWIM VGIVAISGLS QGMLLPAIAM IFEQEGVSSS INGIHATALY IGILVISPFL 
EKPMQKFGMK PIIVIGGFLV IISLFFFTQT FSFWVWFILR FLVGVGDHML HVGTQTWITT
TADPSKIGRQ VSIYGVFFGI GFAVGPYLAS TVQYGLATPF IISTILCLIG WLLLLPTKNA
FPAQDEREVK SESSFSRYKQ VVGLGWIALL GPLAYGVLEA MLNSNLPVYA LRKGWSVSEV
SFLLPAFAVG GIITQIPLGI LSDKYGRDRI LTWTFCISTG IFLLAAVFDH YYWIVFACML
LAGMVIGSCF SLGLGFMTDL LPRHLLPAGN ILSGIAFSLG SILGPVLGGV FIEKIQYTSF
FVAVMIIIGT LAILYMIYMK NQFASRKIES RLGHDKTT