Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3925 |
Symbol | |
ID | 6970190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3636703 |
End bp | 3637887 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387699 |
Product | major facilitator family transporter |
Protein accession | YP_002272147 |
Protein GI | 209398442 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.51646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAC CTAATCATGA GCTTAGCCCG GCGCTAATCG TGCTGATGTC TATCGCCACC GGTCTGGCGG TCGCCAGCAA CTATTACGCC CAGCCATTGC TCGACACCAT CGCGCGTAAC TTTTCCCTTT CCGCCAGTTC GGCAGGCTTT ATTGTTACCG CCGCGCAGTT GGGCTATGCC GCCGGTCTGC TGTTTCTTGT TCCCCTCGGT GATATGTTTG AACGCCGCCG CCTGATTGTC TCGATGACCT TACTGGCAGC GGGCGGTATG TTAATCACTG CCAGCAGTCA GTCACTGGCG ATGATGATCC TCGGTACGGC ATTAACCGGT TTATTCTCAG TCGTGGCACA AATTCTGGTT CCGCTGGCAG CGACGCTGGC TTCACCGGAT AAACGCGGCA AAGTGGTTGG CACCATTATG AGCGGTCTGC TGTTGGGGAT ATTGCTGGCA CGAACCGTTG CCGGGTTACT GGCAAACCTC GGCGGCTGGC GCACCGTCTT TTGGGTTGCC TCGGTGTTAA TGGCTCTGAT GGCGCTGGCA TTATGGCGTG GTCTGCCACA AATGAAATCA GAAACCCACC TCAACTACCC ACAGTTGTTG GGTTCCGTTT TCAGCATGTT TATCAGCAAT AAGATCCTGC GCACCCGCGC GTTGCTGGGC TGCCTGACCT TTGCCAACTT CAGCATTCTC TGGACCTCAA TGGCCTTTTT GCTTGCCGCT CCACCTTTTA ACTACAGCGA TGGTGTAATT GGTCTGTTTG GGCTTGCGGG AGCTGCCGGA GCGTTGGGCG CTCGTCCGGC GGGCGGTTTT GCCGACAAGG GCAAATCGCA CCACACCACA ACTTTCGGTC TGCTTCTGCT GTTACTTTCA TGGCTGGCAA TCTGGTTTGG TCACACTTCC GTACTGGCGT TGATTATCGG CATTCTGGTG CTGGACCTCA CCGTACAGGG CGTGCATATC ACTAACCAGA CGGTAATTTA TCGAATACAT CCTGATGCGC GTAATCGCCT GACCGCAGGT TACATGACCA GCTACTTTAT TGGCGGTGCC GCCGGTTCGC TAATTTCAGC CTCAGCCTGG CAACATGGCG GTTGGGCTGG CGTTTGTCTG GCTGGCGCGA CGATTGCCCT GGTTAACTTA CTGGTCTGGT GGCGAGGTTT TCATCGTCAG GAAGCCGCAA ATTAA
|
Protein sequence | MTKPNHELSP ALIVLMSIAT GLAVASNYYA QPLLDTIARN FSLSASSAGF IVTAAQLGYA AGLLFLVPLG DMFERRRLIV SMTLLAAGGM LITASSQSLA MMILGTALTG LFSVVAQILV PLAATLASPD KRGKVVGTIM SGLLLGILLA RTVAGLLANL GGWRTVFWVA SVLMALMALA LWRGLPQMKS ETHLNYPQLL GSVFSMFISN KILRTRALLG CLTFANFSIL WTSMAFLLAA PPFNYSDGVI GLFGLAGAAG ALGARPAGGF ADKGKSHHTT TFGLLLLLLS WLAIWFGHTS VLALIIGILV LDLTVQGVHI TNQTVIYRIH PDARNRLTAG YMTSYFIGGA AGSLISASAW QHGGWAGVCL AGATIALVNL LVWWRGFHRQ EAAN
|
| |