Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3372 |
Symbol | |
ID | 6969819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3116691 |
End bp | 3117881 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387181 |
Product | transporter, major facilitator family |
Protein accession | YP_002271644 |
Protein GI | 209400975 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA AGTTATGGAC GAAGGATTTT TGGGCAATAA CCATCATCAG CTTTATTATT TTCTTCGTCT TTTATGTTTT ACTAACATTG TTGCCAATTT ATATCTCTGA CCGCTTGCAT GCCTCTCCTG ATAAAGCAGG TTTGTTGGTG ACTTTATTTT TAATTGCGGC GATTGTTATT CGACCCTTTG CCGGGCAATG GGTGGGTAAA TATTCGAATA AAACTATTCT GGTGCTCTCT TCTCTGGCCT TTTTGGTGGT CACTGCGCTG TATCCTTTTT GCCACTCAAT AGAATCACTG CTTTTTATTA GGGTGCTTCA TGGTATTACC TTCGGGGTTA TCACAACGGT AAAGGGAACG ATTTCCGCGC GGCTGATCCC GGCCTCCCGA CGTGGGGAGG GCATCAGTTT TTTCTCTCTG GCAATGGGGC TGGCAATGGT GGTCGGGCCG TGGATTGGCC TGAATATGGC GCGCTGGGAG GCCTTTAATA TGGCTTTCTG GTTATGTACT GGCGTTGCGG CGGTGGGGAT TATCCTGTCG CTGATTATGA CCGTGCCGCC GGTTATCAGC CATGCCGACG GTTCAAAGCC AAAGATGGGC TTCGCCGCCA TGTTCGATCG CGCGGCATTG CCGTTTGCCA TGGTTACATT CTTTATGACC TTTTCGTATG CCGGGGTTTC TGCCTTTCTG GCGCTTTACG CCCGCGAACT TAATCTGATG TCGGCGGCCA GTAATTTCCT GCTCTGCTAC GCCATCTTCC TGATGATCTG CCGTACCTTC ACCGGCAATG TTTGCGACAA AAAAGGCCCG AAATATGTGG TTTACCCCTG CCTGCTGTTC TTTACGGTTG GGCTGGTGGT TCTCGGCTAC ACCCAGGGCA GCGTAATGAT GGTCGTTTCT GGCGCGTTGA TTGGTATCGG GTATGGTTCC GTGACGCCAG TTTTTCAGAC GCAGATTATC AGTTCAGTGG AACCGCATAA AATCGGTGTC GCAAACTCCC TCTTCTTCAA TGCGATGGAT GCAGGCCTGG CGCTGGGAGC CTGTGTGATG GGGATGATGG TTGCACATAC TGGCTACCGA ATGATTTATC TGCTGGGCGC ACTATTAGTG GTAGTGGCTG GTGGAGTCTA TGCGCTGCAA ATGAAGGGAA AAAGCGGTGT CGCGCTAGTA GTGGCAAAAG AAATTCATTA A
|
Protein sequence | MKEKLWTKDF WAITIISFII FFVFYVLLTL LPIYISDRLH ASPDKAGLLV TLFLIAAIVI RPFAGQWVGK YSNKTILVLS SLAFLVVTAL YPFCHSIESL LFIRVLHGIT FGVITTVKGT ISARLIPASR RGEGISFFSL AMGLAMVVGP WIGLNMARWE AFNMAFWLCT GVAAVGIILS LIMTVPPVIS HADGSKPKMG FAAMFDRAAL PFAMVTFFMT FSYAGVSAFL ALYARELNLM SAASNFLLCY AIFLMICRTF TGNVCDKKGP KYVVYPCLLF FTVGLVVLGY TQGSVMMVVS GALIGIGYGS VTPVFQTQII SSVEPHKIGV ANSLFFNAMD AGLALGACVM GMMVAHTGYR MIYLLGALLV VVAGGVYALQ MKGKSGVALV VAKEIH
|
| |