Gene ECH74115_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0511 
Symbol 
ID6970669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp514941 
End bp516305 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID643384559 
Producttransporter, major facilitator family 
Protein accessionYP_002269073 
Protein GI209400276 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.905004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATT ATAAAATGAC GCCAGGTGAG CGGCGCGCGA CCTGGGGTTT AGGGACCGTA 
TTCTCGTTGC GCATGCTGGG CATGTTCATG GTTCTGCCGG TTCTGACCAC GTACGGCATG
GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT
CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCCGACC GCATTGGTCG CAAACCATTA
ATTGTCGGTG GGCTGGCAGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT CTCTGACTCC
ATCTGGGGAA TTATTCTGGG TCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT
ATGGCGCTGC TTTCCGATCT CACGCGCGAA CAAAACCGCA CCAAAGCAAT GGCGTTTATC
GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC
AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACGAC CGGCATTGCG
TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG
GTGAAAGGCA GTTTCAGTAA GGTGCTGGCG GAACCGCGGC TGCTGAAACT CAACTTTGGC
ATTATGTGTC TGCATATTTT GCTGATGTCG ACGTTTGTTG CCCTGCCCGG ACAACTGGCT
GATGCGGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACAAT GCTAATCGCC
TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC
TTTGTCTTCT GCGTCGGGTT GATCGTGGTT GCGGAAATTG TGTTGAGGAA CGCGCAAACG
CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TGGCGTTTAA TTTGATGGAA
GCCCTCCTGC CCTCACTTAT CAGTAAAGAG TCGCCAGCAG GTTACAAAGG TACGGCGATG
GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCGCT GGGCGGCTGG
ATTGACGGCA TGTTTGACGG TCAGGGGGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG
TGGCTGGCAG TCGCCAGTAC CATGAAAGAA CCGCCGTATG TCAGCAGTTT GCGCATTGAA
ATCCCGGCGA ACATTGCCGC AAACGAGGCG TTAAAAGTGC GTTTGCTGGA AACTGAAGGC
ATCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCTT ATGTGAAAAT CGACAGCAAA
GTGACGAATC GCTTTGATGT TGAACAGGCA ATTCGCCAGG CATAA
 
Protein sequence
MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT 
QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV
MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA
LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHILLMS TFVALPGQLA
DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLRNAQT
QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW
IDGMFDGQGV FLAGAMLAAV WLAVASTMKE PPYVSSLRIE IPANIAANEA LKVRLLETEG
IKEVLIAEEE HSAYVKIDSK VTNRFDVEQA IRQA