Gene SbBS512_E0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0349 
Symbol 
ID6271181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp339351 
End bp340715 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content52% 
IMG OID641724587 
Producttransporter, major facilitator family 
Protein accessionYP_001879137 
Protein GI187730885 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATT ATAAAATGAC GCCAGGTGAG AGGCGCGCGA CCTGGGGTTT AGGGACCGTA 
TTCTCGTTGC GCATGCTGGG CATGTTTATG GTTCTGCCGG TTCTGACCAC GTATGGCATG
GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT
CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCCGACC GTATTGGTCG CAAACCATTA
ATTGTCGGTG GGCTGGCGGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT CTCTGACTCC
ATCTGGGGAA TTATTCTGGG CCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT
ATGGCGCTGC TTTCCGATCT CACGCGCGAA CAAAACCGCA CCAAAGCAAT GGCGTTTATC
GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC
AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACGAC CGGCATTGCG
TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG
GTGAAAGGCA GTTTCAGTAA AGTGCTGGCG GAACCGCGGC TGCTGAAACT CAACTTTGGC
ATTATGTGTC TGCATATTTT GCTGATGTCG ACGTTTGTTG CCCTGCCCGG ACAACTGGCT
GATGCAGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACAAT GCTAATCGCC
TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC
TTTGTCTTCT GCGTCGGGTT GATCGTGGTT GCGGAAATTG TGTTGTGGAA CGCGCAAACG
CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TAGCGTTTAA TTTGATGGAA
GCCCTTCTGC CTTCACTTAT CAGTAAAGAG TCGCCAGCAG GTTACAAAGG TACAGCGATG
GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCACT GGGCGGCTGG
ATTGACGGCA TGTTTGACGG TCAGGGCGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG
TGGCTGGCAG TCGCCAGTAC CATGAAAGAA CCGGCGTATG TCAGCAGTTT GCGCATTGAA
ATCCCGGCGA ACATTGCCGC AAACGAGGCG TTAAAAGTGC GTTTGCTAGA AACTGAAGGC
ATCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCTT ATGTGAAAAT CGACAGCAAA
GTGACGAATC GCTTTGATGT TGAACAGGCA ATTCGCCAGG CATAA
 
Protein sequence
MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT 
QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV
MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA
LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHILLMS TFVALPGQLA
DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLWNAQT
QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW
IDGMFDGQGV FLAGAMLAAV WLAVASTMKE PAYVSSLRIE IPANIAANEA LKVRLLETEG
IKEVLIAEEE HSAYVKIDSK VTNRFDVEQA IRQA