Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2261 |
Symbol | mdtH |
ID | 6269572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2057858 |
End bp | 2059066 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641726278 |
Product | multidrug resistance protein MdtH |
Protein accession | YP_001880762 |
Protein GI | 187733205 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGCG TATCGCAGGC GAGGAACCTG GGTAAATATT TCCTGCTCAT CGATAATATG CTGGTCGTGC TGGGGTTCTT TGTTGTCTTC CCGCTGATCT CTATCCGCTT CGTTGATCAA ATGGGCTGGG CCGCCGTCAT GGTCGGTATT GCTCTCGGTC TACGCCAGTT TATTCAGCAA GGTCTGGGTA TTTTCGGCGG CGCAATTGCC GACCGCTTTG GTGCCAAACC GATGATTGTT ACCGGTATGC TGATGCGCGC CGCCGGATTC GCAACAATGG GTATCGCCCA CGAACCGTGG CTATTGTGGT TTTCATGCCT GCTCTCGGGA CTCGGCGGCA CGTTGTTTGA TCCGCCGCGT TCGGCGCTGG TGGTGAAACT AATCCGTCCA CAACAACGTG GTCGTTTTTT CTCGCTGTTG ATGATGCAGG ACAGTGCCGG TGCGGTCATT GGCGCATTGT TGGGGAGCTG GCTGTTGCAA TACGACTTTC GCCTGGTCTG CGCCACAGGG GCCGTTTTGT TTGTGCTGTG TGCGGCGTTC AATGCGTGGT TGTTACCGGC ATGGAAACTC TCCACCGTAC GCACGCCCGT TCGCGAAGGC ATGACCCGCG TGATGCGTGA CAAGCGTTTT GTCACCTATG TCCTGACGCT GGCGGGTTAC TACATGCTGG CTGTACAAGT GATGCTGATG CTGCCAATTA TGGTCAACGA CGTGGCTGGC GCGCCCTCTG CCGTTAAATG GATGTATGCC ATTGAAGCGT GTCTGTCGTT AACGTTGCTC TACCCTATCG CCCGCTGGAG TGAAAAGCAT TTTCGTCTGG AACACCGGTT GATGGCTGGG CTGTTGATAA TGTCATTAAG CATGATGCCG GTGGGCATGG TCAGCGGCCT GCAACAACTT TTCACCCTGA TTTGTCTGTT TTATATCGGG TCGATCATTG CCGAGCCTGC GCGTGAAACC TTAAGTGCTT CGCTGGCGGA CGCAAGAGCT CGCGGCAGCT ATATGGGGTT TAGCCGTCTG GGTCTGGCGA TTGGCGGCGC TATTGGTTAT ATCGGTGGTG GCTGGCTGTT TGACCTGGGC AAATCGGCGC ACCAGCCAGA GCTTCCGTGG ATGATGCTGG GAATTATTGG CATCTTCACT TTCCTTGCGC TGGGTTGGCA GTTTAGTCAG AAACGCGCCG CGCGTCGTTT GCTGGAACGC GACGCCTGA
|
Protein sequence | MSRVSQARNL GKYFLLIDNM LVVLGFFVVF PLISIRFVDQ MGWAAVMVGI ALGLRQFIQQ GLGIFGGAIA DRFGAKPMIV TGMLMRAAGF ATMGIAHEPW LLWFSCLLSG LGGTLFDPPR SALVVKLIRP QQRGRFFSLL MMQDSAGAVI GALLGSWLLQ YDFRLVCATG AVLFVLCAAF NAWLLPAWKL STVRTPVREG MTRVMRDKRF VTYVLTLAGY YMLAVQVMLM LPIMVNDVAG APSAVKWMYA IEACLSLTLL YPIARWSEKH FRLEHRLMAG LLIMSLSMMP VGMVSGLQQL FTLICLFYIG SIIAEPARET LSASLADARA RGSYMGFSRL GLAIGGAIGY IGGGWLFDLG KSAHQPELPW MMLGIIGIFT FLALGWQFSQ KRAARRLLER DA
|
| |