Gene ECH74115_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3014 
SymbolmdtA 
ID6972317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2794284 
End bp2795531 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID643386849 
Productmultidrug efflux system subunit MdtA 
Protein accessionYP_002271317 
Protein GI209400133 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.105913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGCA GTTATAAATC CCGTTGGGTA ATCGTAATCG TGGTAGTTAT CGCCGCCATC 
GCCGCATTCT GGTTCTGGCA AGGCCGCAAT GACTCCCAGA GTGCAGCCCC AGGGGCGACG
AAACAAGCGC AGCAATCGCC AGCGGGTGGT CGCCGTGGTA TGCGTTCCGG CCCATTAGCC
CCGGTTCAGG CGGCGACCGC CGTAGAACAG GCAGTTCCGC GTTACCTCAC CGGGCTTGGC
ACCATTACCG CTGCTAATAC TGTTACGGTG CGTAGCCGCG TGGACGGTCA ACTGATGGCG
TTACATTTCC AGGAAGGCCA GCAGGTCAAA GCTGGCGATT TACTGGCAGA AATTGACCCC
AGCCAGTTCA AAGTTGCATT AGCACAAGCC CAGGGCCAAC TGGCAAAAGA TAAAGCCACG
CTTACCAACG CCCGCCGTGA TCTGGCGCGT TATCAGCAAC TGGCAAAAAC CAATCTCGTC
TCCCGCCAGG AGCTGGATGC CCAACAGGCG CTGGTCAGTG AAACCGAAGG CACCATTAAG
GCTGATGAAG CAAGCGTCGC CAGCGCGCAG CTGCAACTCG ACTGGAGCCG TATTACCGCA
CCAGTCGATG GTCGCGTTGG TCTCAAGCAG GTTGATGTTG GTAACCAAAT CTCCAGTGGT
GATACCACCG GGATCGTGGT GATCACCCAG ACGCATCCTA TCGATTTGCT CTTTACCCTG
CCGGAAAGCG ATATCGCTAC CGTCGTGCAG GCGCAAAAAG CCGGAAAACC GCTGGTGGTA
GAAGCCTGGG ATCGCACCAA CTCGAAGAAA TTAAGTGAAG GCACGCTGTT AAGTCTCGAT
AACCAAATCG ATGCCACTAC CGGTACGATT AAAGTGAAAG CACGCTTTAA TAATCAGGAT
GATGCGCTGT TTCCCAATCA GTTTGTTAAC GCGCGCATGT TAGTCGACAC CGAACAAAAC
GCCGTAGTGA TCCCAACAGC CGCCCTGCAA ATGGGCAATG AAGGCCATTT TGTCTGGGTG
CTGAATAGCG AAAACAAGGT CAGCAAACAT CTGGTGACGC CGGGCATTCA GGACAGTCAG
AAAGTGGTGA TCCGCGCAGG TATTTCTGCG GGCGATCGCG TGGTGACAGA CGGCATTGAT
CGCCTGACCG AAGGGGCGAA AGTGGAAGTG GTGGAAGCCC AGAGCGCCAC CACTCCGGAA
GAGAAAGCCA CCAGCCGCGA ATACGCGAAA AAAGGAGCTC GCTCCTGA
 
Protein sequence
MKGSYKSRWV IVIVVVIAAI AAFWFWQGRN DSQSAAPGAT KQAQQSPAGG RRGMRSGPLA 
PVQAATAVEQ AVPRYLTGLG TITAANTVTV RSRVDGQLMA LHFQEGQQVK AGDLLAEIDP
SQFKVALAQA QGQLAKDKAT LTNARRDLAR YQQLAKTNLV SRQELDAQQA LVSETEGTIK
ADEASVASAQ LQLDWSRITA PVDGRVGLKQ VDVGNQISSG DTTGIVVITQ THPIDLLFTL
PESDIATVVQ AQKAGKPLVV EAWDRTNSKK LSEGTLLSLD NQIDATTGTI KVKARFNNQD
DALFPNQFVN ARMLVDTEQN AVVIPTAALQ MGNEGHFVWV LNSENKVSKH LVTPGIQDSQ
KVVIRAGISA GDRVVTDGID RLTEGAKVEV VEAQSATTPE EKATSREYAK KGARS