Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0511 |
Symbol | |
ID | 6970669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 514941 |
End bp | 516305 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384559 |
Product | transporter, major facilitator family |
Protein accession | YP_002269073 |
Protein GI | 209400276 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.905004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATT ATAAAATGAC GCCAGGTGAG CGGCGCGCGA CCTGGGGTTT AGGGACCGTA TTCTCGTTGC GCATGCTGGG CATGTTCATG GTTCTGCCGG TTCTGACCAC GTACGGCATG GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCCGACC GCATTGGTCG CAAACCATTA ATTGTCGGTG GGCTGGCAGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT CTCTGACTCC ATCTGGGGAA TTATTCTGGG TCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT ATGGCGCTGC TTTCCGATCT CACGCGCGAA CAAAACCGCA CCAAAGCAAT GGCGTTTATC GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACGAC CGGCATTGCG TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG GTGAAAGGCA GTTTCAGTAA GGTGCTGGCG GAACCGCGGC TGCTGAAACT CAACTTTGGC ATTATGTGTC TGCATATTTT GCTGATGTCG ACGTTTGTTG CCCTGCCCGG ACAACTGGCT GATGCGGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACAAT GCTAATCGCC TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC TTTGTCTTCT GCGTCGGGTT GATCGTGGTT GCGGAAATTG TGTTGAGGAA CGCGCAAACG CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TGGCGTTTAA TTTGATGGAA GCCCTCCTGC CCTCACTTAT CAGTAAAGAG TCGCCAGCAG GTTACAAAGG TACGGCGATG GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCGCT GGGCGGCTGG ATTGACGGCA TGTTTGACGG TCAGGGGGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG TGGCTGGCAG TCGCCAGTAC CATGAAAGAA CCGCCGTATG TCAGCAGTTT GCGCATTGAA ATCCCGGCGA ACATTGCCGC AAACGAGGCG TTAAAAGTGC GTTTGCTGGA AACTGAAGGC ATCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCTT ATGTGAAAAT CGACAGCAAA GTGACGAATC GCTTTGATGT TGAACAGGCA ATTCGCCAGG CATAA
|
Protein sequence | MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHILLMS TFVALPGQLA DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLRNAQT QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW IDGMFDGQGV FLAGAMLAAV WLAVASTMKE PPYVSSLRIE IPANIAANEA LKVRLLETEG IKEVLIAEEE HSAYVKIDSK VTNRFDVEQA IRQA
|
| |