Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1444 |
Symbol | mdtH |
ID | 6969700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1426848 |
End bp | 1428056 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385417 |
Product | multidrug resistance protein MdtH |
Protein accession | YP_002269911 |
Protein GI | 209400380 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.723421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.006681 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCCGCG TGTCGCAGGC GAGGAACCTG GGTAAATATT TCCTGCTCAT CGATAATATG CTGGTCGTGC TGGGGTTCTT TGTTGTCTTC CCGCTGATCT CTATCCGCTT CGTTGATCAA ATGGGCTGGG CCGCCGTCAT GGTCGGTATT GCCCTCGGCC TACGCCAGTT TATTCAGCAA GGTCTGGGTA TTTTCGGCGG CGCAATTGCC GACCGCTTTG GTGCCAAACC GATGATTGTT ACCGGTATGC TGATGCGTGC CGCCGGATTC GCCACAATGG GTATCGCCCA CGAACCGTGG CTATTGTGGT TTTCATGCCT GCTCTCGGGA CTCGGTGGCA CGTTGTTTGA TCCGCCGCGT TCGGCGCTGG TGGTGAAACT AATCCGTCCA CAACAGCGTG GTCGTTTTTT CTCGCTGTTG ATGATGCAGG ACAGTGCCGG TGCGGTCATT GGCGCATTGT TGGGGAGCTG GCTGTTGCAA TACGACTTTC GCCTGGTCTG CGCCACAGGG GCCGTTTTGT TTGTGCTGTG TGCAGCGTTC AATGCGTGGT TGTTACCGGC ATGGAAACTC TCCACCGTAC GCACGCCCGT TCGCGAAGGC ATGACCCGCG TGATGCGCGA CAAGCGTTTT GTCACCTATG TCCTGACGCT GGCGGGTTAC TACATGCTGG CTGTACAAGT GATGCTGATG CTGCCAATTA TGGTCAACGA TGTGGCTGGC GCGCCCTCTG CCGTTAAATG GATGTATGCC ATTGAAGCGT GTCTGTCGTT AACGTTGCTC TACCCTATCG CCCGCTGGAG TGAAAAGCAT TTTCGTCTGG AACACCGGTT GATGGCTGGG CTGTTGATAA TGTCATTAAG CATGATGCCG GTGGGCATGG TCAGCGGCCT GCAACAACTT TTCACCCTGA TTTGTCTGTT TTATATCGGG TCGATCATTG CCGAGCCTGC GCGTGAAACC TTAAGTGCTT CGCTGGCGGA CGCAAGAGCT CGCGGCAGCT ATATGGGGTT TAGCCGTCTG GGTCTGGCGA TTGGCGGCGC TATTGGTTAT ATCGGTGGCG GCTGGCTGTT TGACCTGGGC AAATCGGTGC ACCAGCCAGA GCTTCCGTGG ATGATGCTGG GCATTATTGG CATCTTCACT TTCCTTGCGC TGGGTTGGCA GTTTAGCCAG AAACGCGCCG CGCGTCGTTT GCTTGAACGC GACGCCTGA
|
Protein sequence | MSRVSQARNL GKYFLLIDNM LVVLGFFVVF PLISIRFVDQ MGWAAVMVGI ALGLRQFIQQ GLGIFGGAIA DRFGAKPMIV TGMLMRAAGF ATMGIAHEPW LLWFSCLLSG LGGTLFDPPR SALVVKLIRP QQRGRFFSLL MMQDSAGAVI GALLGSWLLQ YDFRLVCATG AVLFVLCAAF NAWLLPAWKL STVRTPVREG MTRVMRDKRF VTYVLTLAGY YMLAVQVMLM LPIMVNDVAG APSAVKWMYA IEACLSLTLL YPIARWSEKH FRLEHRLMAG LLIMSLSMMP VGMVSGLQQL FTLICLFYIG SIIAEPARET LSASLADARA RGSYMGFSRL GLAIGGAIGY IGGGWLFDLG KSVHQPELPW MMLGIIGIFT FLALGWQFSQ KRAARRLLER DA
|
| |