Gene EcSMS35_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2807 
SymbolemrA 
ID6145823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2886921 
End bp2888093 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID641617676 
Productmultidrug resistance protein A 
Protein accessionYP_001744836 
Protein GI170682707 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR00998] efflux pump membrane protein (multidrug resistance protein A) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000414824 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00826621 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCAA ATGCGGAGAC TCAAACCCCG CAGCAACCGG TAAAGAAGAG CGGCAAACGT 
AAGCGTCTGC TCCTCCTTCT CACCTTGCTC TTTATAATTA TTGCCGTAGC GATAGGGATT
TATTGGTTTT TAGTACTGCG TCATTTCGAA GAAACCGATG ATGCATACGT GGCAGGGAAT
CAGATTCAAA TTATGTCTCA GGTGTCTGGC AGCGTGACGA AAGTCTGGGC CGATAACACC
GATTTTGTAA AAGAAGGCGA CGTGCTGGTC ACTCTCGACC CGACAGATGC TCGCCAGGCG
TTTGAAAAAG CCAAAACTGC ACTGGCTTCC AGCGTTCGCC AAACCCACCA GCTGATGATT
AACAGCAAAC AGTTGCAGGC GAATATTGAG GTGCAAAAAA TCGCACTCGC GAAAGCACAA
AGCGACTACA ACCGCCGTGT GCCGCTGGGC AATGCCAACC TGATTGGTCG CGAAGAGCTG
CAACACGCCC GCGACGCCGT TACCAGTGCC CAGGCACAAC TGGACGTCGC TATTCAACAA
TACAATGCAA ATCAGGCGAT GATTCTGGGG ACGAAACTGG AAGATCAGCC TGCCGTGCAA
CAGGCTGCCA CCGAAGTACG TAACGCCTGG CTGGCGCTGG AGCGTACTCG TATTGTCAGT
CCGATGACCG GTTATGTCTC CCGCCGCGCG GTACAGCCTG GGGCGCAAAT TAGCCCAACG
ACGCCGCTGA TGGCGGTCGT TCCAGCCACC AATATGTGGG TGGATGCCAA CTTTAAAGAG
ACGCAGATTG CCAATATGCG TATCGGTCAA CCGGTCACCA TCACCACCGA TATTTACGGC
GATGATGTGA AATACACCGG TAAAGTGGTT GGTCTGGATA TGGGCACAGG TAGCGCGTTC
TCACTGCTTC CGGCGCAAAA CGCCACCGGT AACTGGATCA AAGTCGTTCA GCGTCTGCCT
GTTCGTATTG AACTGGACCA GAAACAGCTC GAGCAATACC CGCTGCGTAT CGGTTTGTCC
ACGCTGGTGA GCGTCAATAC CACTAACCGT GACGGTCAGG TACTGGCAAA TAAAGTACGT
TCCACACCGG TAGCGGTAAG CACCGCGCGT GAAATCAGCT TGGCACCTGT CAATAAACTG
ATCGACGATA TCGTAAAAGC TAACGCTGGC TAA
 
Protein sequence
MSANAETQTP QQPVKKSGKR KRLLLLLTLL FIIIAVAIGI YWFLVLRHFE ETDDAYVAGN 
QIQIMSQVSG SVTKVWADNT DFVKEGDVLV TLDPTDARQA FEKAKTALAS SVRQTHQLMI
NSKQLQANIE VQKIALAKAQ SDYNRRVPLG NANLIGREEL QHARDAVTSA QAQLDVAIQQ
YNANQAMILG TKLEDQPAVQ QAATEVRNAW LALERTRIVS PMTGYVSRRA VQPGAQISPT
TPLMAVVPAT NMWVDANFKE TQIANMRIGQ PVTITTDIYG DDVKYTGKVV GLDMGTGSAF
SLLPAQNATG NWIKVVQRLP VRIELDQKQL EQYPLRIGLS TLVSVNTTNR DGQVLANKVR
STPVAVSTAR EISLAPVNKL IDDIVKANAG