Gene EcolC_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0204 
Symbol 
ID6064464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp234266 
End bp235423 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content55% 
IMG OID641599605 
Productmultidrug efflux system protein MdtE 
Protein accessionYP_001723212 
Protein GI170018258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0378135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA GAAGAAAGCT GTTAATACCG TTGTTATTCT GCGGCGCGAT GCTCACCGCC 
TGCGATGACA AATCGGCGGA AAACGCCGCC GCCATGACGC CTGAGGTCGG TGTCGTCACA
CTCTCCCCCG GTTCGGTCAA TGTGTTGAGC GAATTGCCCG GTAGAACCGT TCCTTATGAA
GTTGCCGAGA TACGTCCCCA GGTGGGCGGT ATTATCATTA AACGCAACTT TATCGAAGGC
GATAAAGTGA ACCAGGGCGA TTCGCTGTAT CAGATTGATC CTGCACCTTT ACAGGCCGAG
CTAAACTCCG CCAAAGGCTC GCTGGCGAAA GCGCTCTCTA CCGCCAGCAA TGCCCGCATC
ACCTTTAACC GCCAGGCATC GTTGCTGAAG ACCAACTACG TTAGCCGTCA GGATTACGAC
ACCGCGCGCA CCCAGTTGAA TGAAGCAGAA GCCAATGTCA CCGTCGCCAA AGCGGCTGTT
GAACAGGCGA CGATCAATCT GCAATACGCG AATGTCACCT CGCCGATTAC GGGCGTCAGC
GGGAAATCGT CGGTGACCGT CGGCGCACTC GTTACCGCTA ATCAGGCAGA TTCGCTGGTT
ACCGTACAAC GTCTGGACCC GATTTATGTC GATCTCACGC AGTCGGTGCA AGATTTCTTA
CGCATGAAAG AAGAGGTCGC CAGTGGGCAA ATCAAACAGG TTCAGGGCAG TACGCCAGTA
CAGCTCAATC TGGAAAATGG TAAACGCTAC AGCCAGACCG GCACGCTGAA ATTCTCCGAC
CCGACAGTGG ATGAAACCAC GGGCTCCGTG ACGTTACGGG CGATTTTCCC CAACCCAAAT
GGTGACTTGC TGCCTGGCAT GTACGTCACG GCATTAGTGG ATGAAGGTAG CCGCCAGAAT
GTATTACTGG TGCCGCAGGA AGGCGTCACC CACAACGCCC AGGGTAAAGC AACGGCGCTC
ATTCTGGATA AAGACGATGT CGTGCAGCTA CGCGAAATTG AAGCCAGCAA AGCCATCGGC
GACCAGTGGG TCGTCACCTC TGGCTTGCAG GCTGGCGATC GGGTGATCGT TTCCGGTTTG
CAACGCATTC GTCCGGGTAT CAAAGCACGA GCAATTTCCT CCAGCCAGGA AAACGCCAGC
ACCGAATCGA AACAATAA
 
Protein sequence
MNRRRKLLIP LLFCGAMLTA CDDKSAENAA AMTPEVGVVT LSPGSVNVLS ELPGRTVPYE 
VAEIRPQVGG IIIKRNFIEG DKVNQGDSLY QIDPAPLQAE LNSAKGSLAK ALSTASNARI
TFNRQASLLK TNYVSRQDYD TARTQLNEAE ANVTVAKAAV EQATINLQYA NVTSPITGVS
GKSSVTVGAL VTANQADSLV TVQRLDPIYV DLTQSVQDFL RMKEEVASGQ IKQVQGSTPV
QLNLENGKRY SQTGTLKFSD PTVDETTGSV TLRAIFPNPN GDLLPGMYVT ALVDEGSRQN
VLLVPQEGVT HNAQGKATAL ILDKDDVVQL REIEASKAIG DQWVVTSGLQ AGDRVIVSGL
QRIRPGIKAR AISSSQENAS TESKQ