Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2077 |
Symbol | mdtG |
ID | 6145402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2091807 |
End bp | 2093033 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616953 |
Product | drug efflux system protein MdtG |
Protein accession | YP_001744129 |
Protein GI | 170679802 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.894184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCCT GTGAAAATGA CACCCCTATA AACTGGAAAC GAAACCTGAT CGTCGCCTGG CTAGGCTGTT TTCTTACCGG TGCCGCCTTC AGTCTGGTAA TGCCCTTCTT ACCCCTCTAC GTTGAGCAGC TTGGCGTTAC CGGTCACTCC GCCCTGAATA TGTGGTCCGG TATTGTCTTC AGCATTACAT TTTTATTTTC GGCCATCGCC TCACCGTTTT GGGGTGGACT CGCCGACCGT AAAGGCCGAA AACTCATGCT ATTACGCTCT GCTCTCGGCA TGGGCATCGT GATGGTGTTG ATGGGACTGG CACAAAATAT CTGGCAGTTT TTGATCCTGC GGGCGCTTCT TGGTTTACTT GGCGGATTTG TCCCCAACGC TAATGCTCTT ATCGCCACAC AAGTACCGCG TAATAAAAGC GGCTGGGCGC TGGGTACGCT CTCCACAGGC GGCGTTAGCG GCGCGTTGCT CGGCCCAATG GCTGGCGGTT TGCTCGCTGA TAGCTACGGC TTACGTCCGG TATTCTTTAT TACCGCCAGT GTGCTCATAC TCTGCTTTTT CGTCACCCTG TTTTGCATCA GAGAAAAATT CCAGCCGGTC AGCAAAAAAG AGATGCTGCA TATGCGGGAA GTGGTGACAT CACTTAAAAA CCCGAAACTG ATACTCAGCC TGTTTGTCAC CACGTTAATC ATCCAGGTGG CGACGGGCTC AATTGCCCCC ATTCTGACGC TGTATGTCCG CGAACTGGCG GGTAACGTCA GTAACGTCGC TTTTATCAGT GGCATGATCG CCTCGGTGCC AGGCGTGGCG GCTCTGCTGA GTGCACCACG ACTCGGCAAA CTTGGCGATC GAATCGGCCC CGAAAAGATC CTGATTACGG CGCTGATCTT TTCTGTACTG CTGTTGATCC CAATGTCATA CGTTCAGACG CCATTACAAC TTGGGATTTT ACGTTTTTTG CTCGGTGCCG CCGATGGTGC ACTACTCCCC GCCGTACAGA CACTGTTGGT TTACAACTCG AGCAACCAAA TCGCCGGGCG TATCTTCAGC TATAACCAGT CGTTTCGTGA TATTGGAAAC GTTACCGGAC CATTGATGGG AGCCGCGATT TCAGCGAACT ACGGTTTCAG AGCGGTATTT CTCGTCACCG CTGGCGTAGT GTTATTCAAC GCAGTCTATT CATGGAACAG TCTACGTCGT CGTCGAATAC CCCAGGTATC GAACTGA
|
Protein sequence | MSPCENDTPI NWKRNLIVAW LGCFLTGAAF SLVMPFLPLY VEQLGVTGHS ALNMWSGIVF SITFLFSAIA SPFWGGLADR KGRKLMLLRS ALGMGIVMVL MGLAQNIWQF LILRALLGLL GGFVPNANAL IATQVPRNKS GWALGTLSTG GVSGALLGPM AGGLLADSYG LRPVFFITAS VLILCFFVTL FCIREKFQPV SKKEMLHMRE VVTSLKNPKL ILSLFVTTLI IQVATGSIAP ILTLYVRELA GNVSNVAFIS GMIASVPGVA ALLSAPRLGK LGDRIGPEKI LITALIFSVL LLIPMSYVQT PLQLGILRFL LGAADGALLP AVQTLLVYNS SNQIAGRIFS YNQSFRDIGN VTGPLMGAAI SANYGFRAVF LVTAGVVLFN AVYSWNSLRR RRIPQVSN
|
| |