Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2369 |
Symbol | |
ID | 6970544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2242127 |
End bp | 2243296 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386242 |
Product | transporter, major facilitator family |
Protein accession | YP_002270726 |
Protein GI | 209397024 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000585363 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.236969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA ACTATCCGTT GCTGGCGCTG GCGATTGGCG CGTTTGGTAT CGGGACAACG GAGTTCTCGC CAATGGGCTT GTTGCCCGTC ATTGCGCGCG GTGTGGATGT CTCGATTCCC GCTGCCGGAA TGTTAATCAG TGCCTATGCA GTTGGCGTAA TGGTTGGCGC GCCGCTGATG ACGCTTCTAC TTTCTCATCG TGCCCGCCGC AGTGCGTTGA TTTTCCTGAT GGCAATTTTC ACGCTCGGCA ACGTACTTTC CGCCATCGCG CCGGATTATA TGACCCTGAT GCTTTCACGC ATTTTGACCA GCCTGAATCA CGGAGCATTT TTTGGTTTGG GTTCAGTCGT GGCCGCAAGC GTGGTGCCAA AACATAAACA GGCCAGCGCA GTTGCCACTA TGTTTATGGG GTTAACCCTG GCAAATATCG GTGGCGTGCC GGCGGCGACC TGGTTGGGTG AAACCATCGG CTGGCGGATG TCATTTCTGG CAACGGCGGG GCTGGGAGTG ATTTCAATGG TAAGTCTGTT CTTCTCATTA CCTAAAGGTG GTGCAGGGGC ACGACCTGAA GTGAAAAAAG AGCTGGCGGT ATTAATGCGT CCGCAGGTGC TGTCTGCATT GCTGACGACG GTACTGGGAG CTGGTGCAAT GTTTACTCTC TACACCTATA TCTCTCCGGT ACTGCAAAGT ATTACCCACG CAACACCGGT GTTCGTCACG GCAATGCTGG TGCTGATTGG TGTCGGATTC TCTATCGGTA ACTATCTCGG CGGCAAACTG GCAGATCGTT CAGTTAACGG CACGTTGAAA GGCTTTTTGT TGCTGCTGAT GGTGATTATG CTGGCAATCC CGTTCCTGGC CCGCAATAAG TTCGGCGCAG CTATTAGCAT GGCGGTGTGG GGCGCTGCAA CCTTTGCGGT CGTACCGCCG TTACAGATGC GCGTGATGCG TGTCGCCAGT GAAGCGCCAG GTCTGTCTTC ATCAGTCAAT ATTGGTGCCT TTAATCTTGG AAATGCGCTG GGAGCAGCTG CTGGTGGTGC GGTAATTTCC GCTGGGCTGG GATACAGCTT TGTGCCGGTG ATGGGGGCGA TTGTCGCGGG ACTGGCATTA TTGCTGGTGT TTATGTCAGC CAGAAAACAA CCTGAAACAG TTTGCGTTGC TAACAGCTAA
|
Protein sequence | MKINYPLLAL AIGAFGIGTT EFSPMGLLPV IARGVDVSIP AAGMLISAYA VGVMVGAPLM TLLLSHRARR SALIFLMAIF TLGNVLSAIA PDYMTLMLSR ILTSLNHGAF FGLGSVVAAS VVPKHKQASA VATMFMGLTL ANIGGVPAAT WLGETIGWRM SFLATAGLGV ISMVSLFFSL PKGGAGARPE VKKELAVLMR PQVLSALLTT VLGAGAMFTL YTYISPVLQS ITHATPVFVT AMLVLIGVGF SIGNYLGGKL ADRSVNGTLK GFLLLLMVIM LAIPFLARNK FGAAISMAVW GAATFAVVPP LQMRVMRVAS EAPGLSSSVN IGAFNLGNAL GAAAGGAVIS AGLGYSFVPV MGAIVAGLAL LLVFMSARKQ PETVCVANS
|
| |