Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2064 |
Symbol | mdtH |
ID | 6145060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2081968 |
End bp | 2083176 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616940 |
Product | multidrug resistance protein MdtH |
Protein accession | YP_001744116 |
Protein GI | 170681348 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.182289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCG TGTCGCAGGC GAGGAACCTG GGTAAATATT TCCTGCTCAT CGATAATATG CTGGTCGTGC TGGGGTTCTT TGTTGTCTTC CCGCTGATCT CTATCCGCTT CGTTGATCAA ATGGGCTGGG CCGCCGTCAT GGTCGGTATT GCCCTCGGCC TACGCCAGTT TATTCAGCAA GGTCTGGGTA TTTTCGGCGG AGCAATTGCC GACCGCTTTG GTGCCAAACC GATGATTGTT ACCGGTATGC TGATGCGTGC CGCCGGATTC GCCACAATGG GTATCGCCCA CGAACCGTGG CTACTGTGGT TTTCATGCCT GCTTTCGGGA CTCGGCGGCA CGTTGTTTGA TCCTCCGCGT TCGGCGCTGG TGGTGAAACT AATCCGTCCA CAACAGCGTG GTCGTTTTTT TTCGCTGTTG ATGATGCAGG ACAGTGCCGG TGCGGTCATT GGCGCATTGT TGGGGAGCTG GCTGTTGCAA TACGACTTTC GCCTGGTCTG CGCCACAGGG GCAGTTTTGT TTGTGCTGTG TGCAGCGTTC AATGCGTGGT TGTTACCGGC ATGGAAACTC TCCACCGTAC GCACGCCCGT TCGCGAAGGC ATGACCCGCG TGATGCGCGA CAAGCGTTTT GTCACCTATG TCCTGACGCT GGCGGGTTAC TACATGCTGG CTGTACAAGT GATGCTGATG CTGCCAATTA TGGTCAACGA CGTGGCTGGC GCGCCCTCTG CCGTTAAATG GATGTATGCC ATTGAAGCGT GTCTGTCGTT AACGTTGCTC TATCCTATCG CCCGCTGGAG TGAAAAGCAT TTTCGTCTGG AACATCGGTT GATGGCTGGG CTGTTGATAA TGTCATTAAG CATGATGCCA GTGGGCATGG TCAGCGGTCT GCAACAACTT TTCACCCTGA TTTGTCTGTT TTATATCGGG TCGATCATTG CCGAGCCTGC GCGTGAAACC TTAAGTGCTT CGCTGGCAGA CGCAAGAGCT CGCGGCAGCT ATATGGGGTT TAGCCGTCTG GGTCTGGCGA TTGGCGGCGC TATTGGTTAT ATCGGTGGCG GCTGGCTGTT TGACCTGGGC AAATCGGCGC ACCAGCCAGA GCTTCCGTGG ATGATGCTGG GCATTATTGG CATCTTCACT TTCCTTGCGC TGGGTTGGCA GTTTAGCCAG AAACGCACCG CGCGTCGTTT GCTTGAACGC GACGCCTGA
|
Protein sequence | MSRVSQARNL GKYFLLIDNM LVVLGFFVVF PLISIRFVDQ MGWAAVMVGI ALGLRQFIQQ GLGIFGGAIA DRFGAKPMIV TGMLMRAAGF ATMGIAHEPW LLWFSCLLSG LGGTLFDPPR SALVVKLIRP QQRGRFFSLL MMQDSAGAVI GALLGSWLLQ YDFRLVCATG AVLFVLCAAF NAWLLPAWKL STVRTPVREG MTRVMRDKRF VTYVLTLAGY YMLAVQVMLM LPIMVNDVAG APSAVKWMYA IEACLSLTLL YPIARWSEKH FRLEHRLMAG LLIMSLSMMP VGMVSGLQQL FTLICLFYIG SIIAEPARET LSASLADARA RGSYMGFSRL GLAIGGAIGY IGGGWLFDLG KSAHQPELPW MMLGIIGIFT FLALGWQFSQ KRTARRLLER DA
|
| |