Gene EcSMS35_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0986 
SymbolmdtA 
ID6144829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp999683 
End bp1000930 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID641615873 
Productmultidrug efflux system subunit MdtA 
Protein accessionYP_001743065 
Protein GI170683959 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.607592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGCA GTTATAAATC CCGTTGGGTA ATCGTAATCG TGGTGGTTAT CGCCGCCATC 
GCCGCATTCT GGTTCTGGCA AGGCCGCAAT GACTCCCGGA GTGCAGCCCC AGGGGCGACG
AAACAAGCGC AGCAATCGCC AGCGGGTGGT CGCCGTGGTA TGCGTTCCGG CCCATTAGCC
CCGGTTCAGG CGGCGACCGC CGTAGAACAG GCTGTTCCGC GTTACCTCAC CGGGCTTGGC
ACCATTACCG CCGCTAATAC TGTTACGGTG CGCAGCCGCG TGGACGGTCA ACTGATGGCG
TTGCATTTCC AGGAAGGCCA GCAGGTCAAA GCTGGCGATT TATTGGCAGA AATTGACCCC
AGCCAGTTCA AAGTTGCATT AGCACAAGCC CTGGGCCAAC TGGCAAAAGA TAAAGCCACG
CTTGCCAACG CCCGCCGTGA CCTGGCGCGT TATCAACAAC TGGTAAAAAC CAATCTCGTA
TCTCGTCAGG AACTGGATGC CCAACAAGCG CTGGTCAGTG AAACCGAAGG TACCATTAAG
GCTGATGAAG CAAGCGTCGC CAGCGCGCAA TTGCAACTCG ACTGGAGCCG TATCACCGCA
CCAGTTGATG GTCGCGTTGG TCTCAAGCAG GTTGATGTTG GTAACCAAAT CTCCAGTGGC
GACACCACCG GGATTGTGGT GATCACCCAA ACACATCCTA TCGATCTGGT CTTTACTCTG
CCGGAAAGCG ATATCGCCAC CGTTGTACAG GCACAAAAAG CCGGAAAACC GCTGGTGGTA
GAAGCCTGGG ATCGCACCAA CTCGAAGAAG TTAAGTGAAG GCACGCTGTT AAGCCTTGAT
AACCAAATCG ATGCCACTAC CGGTACGATT AAAGTGAAAG CTCGCTTTAA TAATCAGGAT
GATGCGCTTT TCCCTAATCA GTTTGTTAAC GCGCGCATGT TAGTCGACAC CGAACAAAAC
GCCGTGGTGA TCCCCACCGC CGCTCTGCAA ATGGGCAACG AAGGCCATTT TGTCTGGGTG
CTGAATAGCG AAAACAAGGT CAGCAAACAT CTGGTGACAC CGGGCATTCA GGACAGTCAG
AGAGTGGTGA TCCGCGCAGG TATTTCTGCG GGCGATCGCG TGGTGACGGA TGGCATTGAT
CGCCTGACCG AAGGGGCGAA AGTGGAAGTG GTGGAAGCCC AGAGCGCCAC CACTTCGGAA
GAGAAAGCCA CCAGCCGCGA ATACGCGAAA AAAGGAGCAC GCTCCTGA
 
Protein sequence
MKGSYKSRWV IVIVVVIAAI AAFWFWQGRN DSRSAAPGAT KQAQQSPAGG RRGMRSGPLA 
PVQAATAVEQ AVPRYLTGLG TITAANTVTV RSRVDGQLMA LHFQEGQQVK AGDLLAEIDP
SQFKVALAQA LGQLAKDKAT LANARRDLAR YQQLVKTNLV SRQELDAQQA LVSETEGTIK
ADEASVASAQ LQLDWSRITA PVDGRVGLKQ VDVGNQISSG DTTGIVVITQ THPIDLVFTL
PESDIATVVQ AQKAGKPLVV EAWDRTNSKK LSEGTLLSLD NQIDATTGTI KVKARFNNQD
DALFPNQFVN ARMLVDTEQN AVVIPTAALQ MGNEGHFVWV LNSENKVSKH LVTPGIQDSQ
RVVIRAGISA GDRVVTDGID RLTEGAKVEV VEAQSATTSE EKATSREYAK KGARS