Gene EcSMS35_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1836 
SymboleefA 
ID6143081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1859612 
End bp1860733 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content52% 
IMG OID641616712 
Productmultidrug efflux transport protein EefA 
Protein accessionYP_001743890 
Protein GI170683115 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000609578 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTATA TAGCAACATC TGTAGTGGCA ATGCTGCTCT TATCGGGTTG CGATAATACG 
CAAAGTAACA ATTCATCCCC GTCAGAAACA GAAGTCGGCG TTGTTACACT CAAATCTCAA
CCGGTTTCGG TAGTCAGTGA ATTAACCGGA CGCACCAGTG CTGCGCTCAG TGCCGAAGTA
CGTCCGCAGG TTGGGGGAAT TATCCAGAAA CGCTTATTTA AGGAAGGTGA TCTGGTCAAG
GCTGGACAGC CGCTCTACCA GATTGATGCG GCCAGTTATC AGGCTGCATG GAATGAAGCC
CGGGCAGCAT TACAACAAGC ACAGGCACTG GTAAAAGCCG ATTGCCAGAA AGCGCAGCGT
TATGCTCGAC TGGTGAAAGA GAACGGTGTT TCACAACAGG ATGCTGATGA TGCTCAGTCT
ACCTGTGCAC AAGATAAAGC CAGTGTAGAG GCGAAAAAAG CCGCACTGGA AACTGCGCGC
ATTAATCTTG ACTGGACCAC GGTAACCGCA CCGATTTCAG GGCGCATTGG CATTTCGTCG
GTAACCCCTG GCGCACTGGT GACCGCGTCG CAAGATACAG CGTTAACGAC GATTCGTGGT
CTGGATACAA TGTATGTCGA CCTCACTCGC TCCAGTGTCG ATTTATTACG TCTGCGTAAA
CAGTCACTGG CGACCAACAG TGACACCATG AGCGTCTCAC TTATTCTGGA AGATGGCACA
ACCTACAGCG AAAAAGGGCG TCTGGAACTC ACCGAAGTCG CGGTGGATGA GTCTACCGGT
TCGGTGACAT TACGGGCAAT TTTCCCCAAT CCACAACAGC AGTTATTACC GGGAATGTTT
GTTCGCGCTC GTGTCGATGA AGGCGTGATG GAAGACGCTA TTCTCGCGCC TCAACAGGGC
GTCACGCGCG ATGCTAAAGG CAATGCAACT GCGCTGGTGG TGAATAAAGA CAATAAAGTA
GAGCAGCGAG CGCTCGAAAC GGGAGAAACG TATGGTGATA AATGGCTGGT GCTGAACGGC
CTGCACAACG GCGACCGACT GATTGTTGAA GGTTCTGCCA AAGTCACTTC AGGGCAGACC
GTCAAGGCTG TTGAAGTTCA GGCTAATGGA GGCAACGCCT GA
 
Protein sequence
MKYIATSVVA MLLLSGCDNT QSNNSSPSET EVGVVTLKSQ PVSVVSELTG RTSAALSAEV 
RPQVGGIIQK RLFKEGDLVK AGQPLYQIDA ASYQAAWNEA RAALQQAQAL VKADCQKAQR
YARLVKENGV SQQDADDAQS TCAQDKASVE AKKAALETAR INLDWTTVTA PISGRIGISS
VTPGALVTAS QDTALTTIRG LDTMYVDLTR SSVDLLRLRK QSLATNSDTM SVSLILEDGT
TYSEKGRLEL TEVAVDESTG SVTLRAIFPN PQQQLLPGMF VRARVDEGVM EDAILAPQQG
VTRDAKGNAT ALVVNKDNKV EQRALETGET YGDKWLVLNG LHNGDRLIVE GSAKVTSGQT
VKAVEVQANG GNA