Gene SbBS512_E3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3849 
Symbol 
ID6270304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3575612 
End bp3576871 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID641727705 
Productmajor facilitator superfamily transporter 
Protein accessionYP_001882140 
Protein GI187730432 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC 
GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCCATTGT CATGTTTAAC
TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT
GTGATGGGCT TTAGCGCTTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC
TTGCTGAGCC GTCCTCATGC CGGACGTTAC GCCGATTTGC TGGGCCCCAA AAAGATTGTC
GTCTTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGAT ATCTGACGGC AGGATTAACC
GCCAGTCTGC CCGTCATCAG CCTGTTATTA CTTTGCCTGG GACGCGTGAT CCTTGGGATT
GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTGTGGGGCG TTGGCGTGGT TGGCTCGCTG
CATATCGGGC GAGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT
GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT
ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGATCCCGC GTCCGACGGT AAAAGCCAGT
AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCA
CTGGCACTGG CTTCTGCCGG ATTTGGCGTC ATCGCCACCT TTATCACGCT GTTTTATGAC
GCTAAAAGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT
ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGCG GCTTAAACGT AGCGATGATT
TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG
GCGAAAATCG GCGTCTTACT GGCGGGGGCA GGGTTTTCGC TGGTGTTCCC GGCATTAGGT
GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC
GTATTTATGG ATTTATCGCT TGGCGTGACC GGACCACTGG CTGGGCTGGT GATGAGCTGG
GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG
ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
 
Protein sequence
MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD 
VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADLLGPKKIV VFGLCGCFLS GLGYLTAGLT
ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG
APLGVVFYHW GGLQALALII MGVALVAILL AIPRPTVKAS KGKPLPFRAV LGRVWLYGMA
LALASAGFGV IATFITLFYD AKSWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI
CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT
VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS