Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4793 |
Symbol | |
ID | 6971648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4432843 |
End bp | 4434093 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388489 |
Product | major facilitator superfamily transporter |
Protein accession | YP_002272917 |
Protein GI | 209399143 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACT GTTGTAAAAA TGTGGTGATC CTCATGCCCG AACCCGTAGC CGAACCCGCG CTAAACGGAT TGCGCCTGAA TTTGCGCATT CTCTCCATTG TCATGTTTAA CTTCGCCAGC TACCTCACCA TCGGGTTGCC GCTCGCTGTA TTACCGGGCT ATGTCCATGA TGTGATGGGC TTTAGCGCTT TCTGGGCAGG ATTGGTTATC AGCCTGCAAT ATTTCGCCAC CTTGCTGAGC CGTCCTCATG CCGGACGTTA CGCCGATTTG CTGGGCCCCA AAAAGATTGT CGTCTTCGGT TTATGCGGCT GCTTTTTGAG CGGTCTGGGA TATCTGACGG CAGGATTAAC CGCCAGTCTG CCCGTCATCA GCCTGTTATT ACTTTGCCTG GGACGCGTCA TCCTTGGGAT TGGGCAAAGT TTTGCCGGAA CGGGATCGAC CCTGTGGGGT GTTGGCGTGG TTGGCTCGCT GCATATCGGG CGGGTGATTT CGTGGAACGG CATTGTCACT TACGGGGCGA TGGCGATGGG TGCGCCGTTA GGCGTCGTGT TTTATCACTG GGGCGGCTTG CAGGCGTTAG CGTTAATTAT TATGGGCGTG GCGCTGGTGG CCATTTTGTT GGCGATCCCG CGTCCGACGG TAAAAGCCAG TAAAGGCAAA CCGCTGCCGT TTCGCGCGGT GCTTGGGCGC GTCTGGCTGT ACGGTATGGC GCTGGCACTG GCTTCCGCCG GATTTGGCGT CATCGCCACC TTTATCACGC TGTTTTATGA CGTTAAAGGT TGGGACGGTG CGGCTTTCGC GCTGACGCTG TTTAGCTGTG CGTTTGTCGG TACGCGTTTG TTATTCCCTA ACGGCATTAA CCGTATCGGC GGCTTAAACG TAGCGATGAT TTGCTTTAGC GTTGAGATAA TCGGCCTGCT ACTGGTTGGC GTGGCGACTA TGCCGTGGAT GGCGAAAATC GGCGTCTTAC TGGCGGGGGC CGGGTTTTCG CTGGTGTTCC CGGCATTGGG TGTAGTGGCG GTAAAAGCGG TTCCGCAGCA AAATCAGGGG GCGGCGCTGG CAACCTACAC CGTATTTATG GATTTATCGC TTGGCGTGAC CGGACCACTG GCTGGGCTGG TGATGAGTTG GGCGGGCGTA CCGGTGATTT ATCTGGCGGC GGCGGGACTG GTCGCAATCG CGTTATTACT GACGTGGCGA TTAAAAAAAC GGCCTCCGGA ACACGTCCCT GAGGCCGCCT CATCATCTTA A
|
Protein sequence | MKHCCKNVVI LMPEPVAEPA LNGLRLNLRI LSIVMFNFAS YLTIGLPLAV LPGYVHDVMG FSAFWAGLVI SLQYFATLLS RPHAGRYADL LGPKKIVVFG LCGCFLSGLG YLTAGLTASL PVISLLLLCL GRVILGIGQS FAGTGSTLWG VGVVGSLHIG RVISWNGIVT YGAMAMGAPL GVVFYHWGGL QALALIIMGV ALVAILLAIP RPTVKASKGK PLPFRAVLGR VWLYGMALAL ASAGFGVIAT FITLFYDVKG WDGAAFALTL FSCAFVGTRL LFPNGINRIG GLNVAMICFS VEIIGLLLVG VATMPWMAKI GVLLAGAGFS LVFPALGVVA VKAVPQQNQG AALATYTVFM DLSLGVTGPL AGLVMSWAGV PVIYLAAAGL VAIALLLTWR LKKRPPEHVP EAASSS
|
| |