Gene ECH74115_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3271 
Symbol 
ID6970966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3004456 
End bp3005814 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content57% 
IMG OID643387084 
Producttransporter, major facilitator family 
Protein accessionYP_002271548 
Protein GI209398712 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.557085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC GACGTGATTT ACAAGCACTT ATCGATGCTG CTCCGGTCGG CAAGATGCAG 
TGGCGCGTTA TCATCTGCTG TTTTCTGGTG GTAATGCTCG ATGGTTTTGA CACCGCGGCT
ATCGGATTTA TCGCCCCTGA TATTCGCACT CACTGGCAAC TCACCGCAGG CGATCTCGCC
CCGCTTTTTG GCGCGGGTCT GCTAGGATTA ACCGCAGGCG CACTGCTTTG TGGCCCGTTG
TCCGATCGCT TTGGTCGCAA GCGGGTGATT GAGCTGTGCG TCTTTTTGTT TGGCGCGTTA
AGCCTCGCCT CGGCTTTCTC ACCGGATCTG CAAACGCTGG TCTTCCTGCG CTTTCTCACC
GGCCTGGGTC TGGGCGGCGC GATGCCGAAC ACCATCACCA TGACCTCGGA GTATCTGCCT
GCCCGCCGCC GCGGCGCGCT GGTGACGCTG ATGTTTTGCG GCTTTACGCT GGGTTCGGCG
TTTGGCGGGA TTGTCAGTGC GCAGCTGGTG CCGGTGATTG GCTGGCACGG CATTCTGGTG
TTGGGGGGCG TGTTACCGCT GATGCTGTTT GTCGCACTGC TGGTAGTTCT GCCGGAATCG
CCACGCTGGC AGGTGCGCCG CCAGCTACCG CAGGCGGTGA TTGCGAAAAC CGTCAGCGCC
ATCACCCGCG AGCGTTATGT CGATACCCAC TTTTATCTCA TTGAATCAGC CTCCGTTACA
AAAGGCAGCA TTCGTCAACT GTTTATGGGA CGTCAGTTAC CCATCACGTT GATGCTGTGG
GTAGTGTTCT TTATGAGCCT GCTGATTATT TACCTGCTCT CAAGCTGGAT GCCGACGCTA
CTGAACCATC GCGGCATCGA CCTGCAACAC GCGTCCAGGG TCACCGCCGC GTTCCAGATT
GGTGGTACGT TAGGCGCGCT GGCGCTGGGT GTGCTGATGG ATAAGTTCAA TCCTTTCCGC
GTGCTGACGC TGAGCTATGC CATAGGCGCT ATATGCATCG TGATGATTGG CTTAAGCCAG
GATGGATTAT GGTTGATGGC GCTGGCGATT TTCGGCACTG GCATTGGCAT TAGCGGATCG
CAGGTGGGGC TGAATGCGCT GACCGCAACA CTTTATCCAA CACAAAGTCG AGCCACGGGC
GTGAGCTGGT CAAATGCTAT CGGCCGCTGC GGCGCGATTG TTGGTTCGCT GTCTGGCGGC
GTGATGATGG CGATGAATTT CTCTTTCGAC ACCCTGTTTT TCATTATCGC TGTGCCTGCG
GCGATCAGTG CGGTGATGTT AACCCTGCTG ATTACCGTGG TTCGCCAGTC GACCTCTGTC
CCTGACTCAC TGCCTCGCGC AGGCGTTGTG AACGAATAA
 
Protein sequence
MTQRRDLQAL IDAAPVGKMQ WRVIICCFLV VMLDGFDTAA IGFIAPDIRT HWQLTAGDLA 
PLFGAGLLGL TAGALLCGPL SDRFGRKRVI ELCVFLFGAL SLASAFSPDL QTLVFLRFLT
GLGLGGAMPN TITMTSEYLP ARRRGALVTL MFCGFTLGSA FGGIVSAQLV PVIGWHGILV
LGGVLPLMLF VALLVVLPES PRWQVRRQLP QAVIAKTVSA ITRERYVDTH FYLIESASVT
KGSIRQLFMG RQLPITLMLW VVFFMSLLII YLLSSWMPTL LNHRGIDLQH ASRVTAAFQI
GGTLGALALG VLMDKFNPFR VLTLSYAIGA ICIVMIGLSQ DGLWLMALAI FGTGIGISGS
QVGLNALTAT LYPTQSRATG VSWSNAIGRC GAIVGSLSGG VMMAMNFSFD TLFFIIAVPA
AISAVMLTLL ITVVRQSTSV PDSLPRAGVV NE