Gene BAS0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0749 
Symbol 
ID2850549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp803082 
End bp804599 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content37% 
IMG OID637504011 
Productmajor facilitator family transporter 
Protein accessionYP_027025 
Protein GI49183773 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000682732 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACGC CTCTTTCTTC ATCATATTGT AAACCTTTTC AAATATATGT AGTGAAAAAG 
TATATATCCC CTTCTCATTC ACTACTTTTT GAATCCGCTT TCGTTAAAGA AAAAACAATT
TCTATCATTC ATAACAACTA CCAGTCTATA CATCTATACT TTACATCTTT TTTTCATAGT
TGTGTAGTCT TTCCACAAAT TTGTCAGTTT TTTACCTCGA AAATTGTTGT ACGTTTTCAA
GGCACCTTCC TTGACTGTTA CATTGGCCAA TACATATACT CGAAAGAAAA GAGAAATTTT
TATATAGGGG ATGAATATAT GGGAGAAGCA ATACTCGTAA AACGAGAACC GTTATGGACG
AAAGAATTTG TCGCGCTAAT TCTAGCAAAC TTATGTATGT TTTTAGGATT TCAAATGTTA
ATTCCGACTT TACCTGTTTA TGTGAAAGAA ATTGGAGGTA CAAGTTCCAA TATTGGATTT
GTTGTCGGTA TGTTTACTGT TGCAGCACTT TTTGTTAGAC CGCTAACTGG AAATGCCTTG
CAAAAATTCA GCAAAAAAAT CATTTTAATG ATTGGCACTG CTATTTGTTT ACTCGCTATG
GGTAGTTATC TTTTCGCTTC AACGGTTTTC CTATTGCTTG CCGTTCGAAT TTTACACGGA
GCCGGTTTTG GTATTACAAC GACTACATAT GGAACTGTCG TTTCCGATTT AATTCCGCAA
GCTCGCCGCG GTGAGGGCAT GGGATATTTC GGCCTTTCTG GAACGATTGC AATGGCGCTC
GGCCCACTTA TCGGACTTTG GCTTATGCAA ACATATAACT TCACGATTCT TTTTTTATGT
GCACTTTCCT GTACAATTGT TTCATTAATA TTAACGAAAC TACTTCAAAT TAAAAAATCA
CCACAACCAC CAAAGCAAAC ATCTGGTACC TTTCTCGATG GATTTATTGA GCGCAAAGCT
TTACTTCCTT CATTATTAAT ATTATGTATT ACATTAATGT ACGGGGGAAT CGGAAGCTTT
ATTACGTTAT TTGCTACGGA AGTCGGCATT GCTGATATAA GCCTCTTCTT ATTTAATGCA
CTTGCAATCG CTGTTACTCG TCCATTTTCC GGAAAGCTAT ATGATGCGAA AGGTCATTCA
TTCGTAATTA TTCCAGGAGT TATTATTACG TTTGCAGGGA TTATTTTATT ATCGTATACA
ACTACCATTC CAAGCTTAAT TATTGCAGCA GCATGCTACG GAAGTGGTTT CGGAGCCATT
CAACCTGCAC TACAAGCGTG GATGATCGAC CGAGTAGCAC CGCACCGACG AGGAGTCGCA
ACAGCTACAT TCTTCTCAGC ATTTGACCTT GGCATTGGTG CTGGCGCGAT TATTTTTGGA
TTCATTGCAC ATTTTACAAA CTACGCAACT GTATATCGTT ATTCCTCTCT ACTACTTATT
GCTTTCCTGT TCATTTACAT TACAAGTGTA AAAAAACAAA AGCATGGTGA TAAAAATACG
GAAAAAGCTG CTGGATAA
 
Protein sequence
MYTPLSSSYC KPFQIYVVKK YISPSHSLLF ESAFVKEKTI SIIHNNYQSI HLYFTSFFHS 
CVVFPQICQF FTSKIVVRFQ GTFLDCYIGQ YIYSKEKRNF YIGDEYMGEA ILVKREPLWT
KEFVALILAN LCMFLGFQML IPTLPVYVKE IGGTSSNIGF VVGMFTVAAL FVRPLTGNAL
QKFSKKIILM IGTAICLLAM GSYLFASTVF LLLAVRILHG AGFGITTTTY GTVVSDLIPQ
ARRGEGMGYF GLSGTIAMAL GPLIGLWLMQ TYNFTILFLC ALSCTIVSLI LTKLLQIKKS
PQPPKQTSGT FLDGFIERKA LLPSLLILCI TLMYGGIGSF ITLFATEVGI ADISLFLFNA
LAIAVTRPFS GKLYDAKGHS FVIIPGVIIT FAGIILLSYT TTIPSLIIAA ACYGSGFGAI
QPALQAWMID RVAPHRRGVA TATFFSAFDL GIGAGAIIFG FIAHFTNYAT VYRYSSLLLI
AFLFIYITSV KKQKHGDKNT EKAAG