Gene ECH74115_4793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4793 
Symbol 
ID6971648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4432843 
End bp4434093 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID643388489 
Productmajor facilitator superfamily transporter 
Protein accessionYP_002272917 
Protein GI209399143 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACT GTTGTAAAAA TGTGGTGATC CTCATGCCCG AACCCGTAGC CGAACCCGCG 
CTAAACGGAT TGCGCCTGAA TTTGCGCATT CTCTCCATTG TCATGTTTAA CTTCGCCAGC
TACCTCACCA TCGGGTTGCC GCTCGCTGTA TTACCGGGCT ATGTCCATGA TGTGATGGGC
TTTAGCGCTT TCTGGGCAGG ATTGGTTATC AGCCTGCAAT ATTTCGCCAC CTTGCTGAGC
CGTCCTCATG CCGGACGTTA CGCCGATTTG CTGGGCCCCA AAAAGATTGT CGTCTTCGGT
TTATGCGGCT GCTTTTTGAG CGGTCTGGGA TATCTGACGG CAGGATTAAC CGCCAGTCTG
CCCGTCATCA GCCTGTTATT ACTTTGCCTG GGACGCGTCA TCCTTGGGAT TGGGCAAAGT
TTTGCCGGAA CGGGATCGAC CCTGTGGGGT GTTGGCGTGG TTGGCTCGCT GCATATCGGG
CGGGTGATTT CGTGGAACGG CATTGTCACT TACGGGGCGA TGGCGATGGG TGCGCCGTTA
GGCGTCGTGT TTTATCACTG GGGCGGCTTG CAGGCGTTAG CGTTAATTAT TATGGGCGTG
GCGCTGGTGG CCATTTTGTT GGCGATCCCG CGTCCGACGG TAAAAGCCAG TAAAGGCAAA
CCGCTGCCGT TTCGCGCGGT GCTTGGGCGC GTCTGGCTGT ACGGTATGGC GCTGGCACTG
GCTTCCGCCG GATTTGGCGT CATCGCCACC TTTATCACGC TGTTTTATGA CGTTAAAGGT
TGGGACGGTG CGGCTTTCGC GCTGACGCTG TTTAGCTGTG CGTTTGTCGG TACGCGTTTG
TTATTCCCTA ACGGCATTAA CCGTATCGGC GGCTTAAACG TAGCGATGAT TTGCTTTAGC
GTTGAGATAA TCGGCCTGCT ACTGGTTGGC GTGGCGACTA TGCCGTGGAT GGCGAAAATC
GGCGTCTTAC TGGCGGGGGC CGGGTTTTCG CTGGTGTTCC CGGCATTGGG TGTAGTGGCG
GTAAAAGCGG TTCCGCAGCA AAATCAGGGG GCGGCGCTGG CAACCTACAC CGTATTTATG
GATTTATCGC TTGGCGTGAC CGGACCACTG GCTGGGCTGG TGATGAGTTG GGCGGGCGTA
CCGGTGATTT ATCTGGCGGC GGCGGGACTG GTCGCAATCG CGTTATTACT GACGTGGCGA
TTAAAAAAAC GGCCTCCGGA ACACGTCCCT GAGGCCGCCT CATCATCTTA A
 
Protein sequence
MKHCCKNVVI LMPEPVAEPA LNGLRLNLRI LSIVMFNFAS YLTIGLPLAV LPGYVHDVMG 
FSAFWAGLVI SLQYFATLLS RPHAGRYADL LGPKKIVVFG LCGCFLSGLG YLTAGLTASL
PVISLLLLCL GRVILGIGQS FAGTGSTLWG VGVVGSLHIG RVISWNGIVT YGAMAMGAPL
GVVFYHWGGL QALALIIMGV ALVAILLAIP RPTVKASKGK PLPFRAVLGR VWLYGMALAL
ASAGFGVIAT FITLFYDVKG WDGAAFALTL FSCAFVGTRL LFPNGINRIG GLNVAMICFS
VEIIGLLLVG VATMPWMAKI GVLLAGAGFS LVFPALGVVA VKAVPQQNQG AALATYTVFM
DLSLGVTGPL AGLVMSWAGV PVIYLAAAGL VAIALLLTWR LKKRPPEHVP EAASSS