Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5425 |
Symbol | |
ID | 6968680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5072079 |
End bp | 5073395 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643389077 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002273486 |
Protein GI | 209396272 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.460942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.126626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA TCGTAAATGA AACCGTGAGT GTGAAGAAAA AGCCCAGCGG CAGGCGGGTG ATTTTTGCCT CGGCGTTTGG CAATGCGCTC GAATTTTTTG ATTTTGGCGT TTACAACTTT TTTGTGGTCT ATATCAGCAC GCTATTTTTT CCGCCCAGTG CAGATAAAAA CGTCGCGCTA CTGCTGGCGT TTGCCACTTT CGGCGTGAGC TTTTTTATGC GCCCGCTCGG CGGCATTATT GTGGGGGCCT GGGCCGATCG TTTCGGACGC AAACCCGCGA TGGTGTTCAC CATTGCGCTG ATGAGTCTTG GTACACTGAT GATCGGCATC GCACCGACGT ATGAAACGGC GGGCTATTGG GGAACCGCGA CGCTGGTGCT AGCGCGTCTA ATTCAGGGCG TTGCGGCAGG AGGCGAAGTC GGTGCCTCAA TGTCTTTGTT GGTTGAATCA GCTCCGGCAA ATCGTCGCGG TTTTTACAGC AGTTGGTCAC TGGCGACTCA AGGCCTGGCA ACAACTTTTG GAGGTGTTGT CGCTTTAGGT TTAAGTGCAT GGCTGCCCTT TGCGACCGGT TCAGAAACCG TCATGGCTGA ATGGGGCTGG CGTGTGCCGT TTTTTATTGG CGTGTTATTA GCGCCTGTCG GCTGTTGGTT GCGTTTGAGT CTGGAAAATG ATGTTCCTGA GCCAGCACAT AATAAGAAGG CCGCAGCCAG CGAAAGCGCC TTCTCCTTAC TTATGCAACA TAAAGCGACT ATCGCTAACG GTATCTTGCT CGCTATTGGT AGTACAGTCG CAACCTATAT CTCCCTCTTT TATTACGGTA CATGGGCGGC GAAGTATTTA GGAATGAACC AAAACTATTC TCATGCAGCG ATGTTACTGG CGGGCGTTAT TACGTTTGTG GGTGCGCTGC TGGTGGGGAT GTTGTGCGAT TCGGTCGGGC GTAAAAAGCT GATTTTAATC TCCCGTGTGA TGGTGCTGAT CTGTAGCTGG CCGTCATTCT GGCTGTTGGT GAATTACCCA AGCCCAGGCA TGTTGCTGAC GGTGGTTTTC GTGATGGTCA GCTTTACCAC GCTTGGCGGT GTGCCAGTGA TGTTGTTGAT CTCTGAACTG TTGCCGAAAC GGATTCGGGC ATTGGGTTTT GCGCTGGTTT ATAGCATTGG TGTGGCAATC TTTGGCGGTT TTGCTCAATA TTTCGCTACG CAATCCATCG TGTTGCTGGA TAGTTTGACG GCTCCGGCGT GGTATCTGGG TGGAGGCACG TTGCTGTCGA TGTTGGCGTT GTTGTACGTA AAAGAACCGG CCAAAGAATT GCAGTAA
|
Protein sequence | MTAIVNETVS VKKKPSGRRV IFASAFGNAL EFFDFGVYNF FVVYISTLFF PPSADKNVAL LLAFATFGVS FFMRPLGGII VGAWADRFGR KPAMVFTIAL MSLGTLMIGI APTYETAGYW GTATLVLARL IQGVAAGGEV GASMSLLVES APANRRGFYS SWSLATQGLA TTFGGVVALG LSAWLPFATG SETVMAEWGW RVPFFIGVLL APVGCWLRLS LENDVPEPAH NKKAAASESA FSLLMQHKAT IANGILLAIG STVATYISLF YYGTWAAKYL GMNQNYSHAA MLLAGVITFV GALLVGMLCD SVGRKKLILI SRVMVLICSW PSFWLLVNYP SPGMLLTVVF VMVSFTTLGG VPVMLLISEL LPKRIRALGF ALVYSIGVAI FGGFAQYFAT QSIVLLDSLT APAWYLGGGT LLSMLALLYV KEPAKELQ
|
| |