Gene EcSMS35_4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4545 
SymbolmdtO 
ID6145846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4646635 
End bp4648686 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content55% 
IMG OID641619361 
Productmultidrug efflux system protein MdtO 
Protein accessionYP_001746473 
Protein GI170681173 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC TCAACTCCCT GCCATTACCG GTGGTCAGGC TGCTGGCGTT CTTTCATGAA 
GAGTTAAGCG AGCGGCGGCC AGGTCGTGTG CCGCAGACCG TGCAACTCTG GGTAGGCTGC
CTGCTGGTGA TTCTGATCTC GATGACCTTC GAGATCCCCT TTGTGGCGTT ATCGCTGGCG
GTGCTGTTTT ACGGTATTCA GTCCAACGCG TTTTACACCA AATTTGTCGC GATCTTGTTT
GTGGTTGCCA CGGTGCTGGA GATCGGCAGC CTGTTTTTGA TCTACAAATG GTCATACGGC
GAACCGTTGA TCCGATTGAT CATCGCAGGG CCGATCCTGA TGGGATGTAT GTTTTTGATG
CGCACCCATC GGCTGGGACT GGTCTTTTTC GCTGTCGCCA TTGTCGCCAT TTACGGGCAA
ACCTTCCCCG CCATGCTCGA CTATCCGGAA GTGGTCGTGC GCTTAACGCT GTGGTGTATC
GTTGTTGGCC TCTATCCAAC CTTGCTGATG ACGTTAATCG GCGTGCTGTG GTTTCCCAGC
CGCGCTATCA CGCAGATGCA TCAGGCGCTT AATGATCGGC TTGATGATGC CATTAGCCAC
CTGACGAACA GCCTCACACC GCTACCCGAA ACGCGGATTG AAAGAGAGGC GCTGGCGCTG
CAAAAACTCA ATGTCTTTTG CCTCGCGGAC GATGCCGACT GGCGTACTCA GAGCGCATGG
TGGCAAAGCT GTGTGGCAAC GGTAACCTAC ATTTACTCGA CGCTGAATCG CTACGATCCC
ACCTCTTTTG CTGATTCTCA GGCAATTATT GAATTCCGGC AAAAATTAGC CTCAGAAATC
AACAAGCTGC AGCATGCCAT TGCCGAAGGT CAGTGCTGGC AAAGCGACTG GCGGATCACT
GAAAGTCAAG CGATGGCGGC ACGGGAATGT AACCTGGAGA ATATCTGCCA GACATTGTTG
CAACTGGGTC AGATGGACCC GAATACGCCG CCAACGCCCG CCACCAAACC GCCATCAATG
GTCGCCGACG CTTTTACCAA TCCAGACTAT ATGCGCTACG CGGTAAAAAC GCTGCTCGCC
TGTTTGATCT GTTACACCTT CTACAGCGGC GTGGACTGGG AAGGCATTCA CACCTGTATG
CTGACCTGCG TGATCGTCGC TAATCCAAAT GTCGGTTCGT CGTACCAGAA GATGGTGCTG
CGTTTTGGCG GGGCCTTTTG CGGTGCGATT CTGGCGCTGT TATTCACGCT ACTGGTCATG
CCCTGGCTGG ACAATATTGT CGAATTGCTG TTTGTGCTGG CACCGATTTT CCTGTTGGGC
GCATGGATTG CCACCAGCTC TGAACGCTCT TCTTATATCG GCACACAGAT GGTGGTCACC
TTCGCGCTCG CCACGCTCGA AAATGTTTTT GGCCCGGTGT ACGACCTGGT GGAAATTCGC
GATCGCGCCC TGGGTATCAT CATTGGTACC GTGGTGTCCG CGGTGATTTA CACCTTTGTC
TGGCCTGAAA GTGAAGCGCG CACCCTGCCG CAAAAACTGG CTGGCGCGCT GGGTATGTTA
AGTAAAGTAA TGCGGATCCC ACGCCAGCAG GAAGTCACGG CTCTGCGCAC TTATCTGCAA
ATTCGCATAG GTCTGCATGC GGCGTTTAAT GCCTGTGAAG AGATGTGCCA ACGCGTGGCG
CTGGAGCGTC AACTGGACAG CGAAGAACGC GCCTTACTGA TTGAACGTTC GCAAACGGTT
ATTCGTCAGG GTCGCGATAT TCTTCACGCC TGGGATGCAA CCTGGAACTC GGCGCAGGCG
CTGGATAACG CACTACAGCC GGACAGAGCT GGTCAGTTTG CCGACGCCCT GGAGAAATAC
GCTGCCGGTC TGGCAACCGC ACTCAGTCGT TCTCCTCAAA TAACGCTTGA AGAAACACCC
GCCTCGCAGG CCATCCTGCC GACCTTATTA AAACAGGAGC AACACGTCTG CCAGCTTTTC
GCCCGCTTGC CAGACTGGAC AGCCCCGGCA TTAACGCCCG CCACGGAACA GGCACAAGGA
GCCACGCAAT GA
 
Protein sequence
MSALNSLPLP VVRLLAFFHE ELSERRPGRV PQTVQLWVGC LLVILISMTF EIPFVALSLA 
VLFYGIQSNA FYTKFVAILF VVATVLEIGS LFLIYKWSYG EPLIRLIIAG PILMGCMFLM
RTHRLGLVFF AVAIVAIYGQ TFPAMLDYPE VVVRLTLWCI VVGLYPTLLM TLIGVLWFPS
RAITQMHQAL NDRLDDAISH LTNSLTPLPE TRIEREALAL QKLNVFCLAD DADWRTQSAW
WQSCVATVTY IYSTLNRYDP TSFADSQAII EFRQKLASEI NKLQHAIAEG QCWQSDWRIT
ESQAMAAREC NLENICQTLL QLGQMDPNTP PTPATKPPSM VADAFTNPDY MRYAVKTLLA
CLICYTFYSG VDWEGIHTCM LTCVIVANPN VGSSYQKMVL RFGGAFCGAI LALLFTLLVM
PWLDNIVELL FVLAPIFLLG AWIATSSERS SYIGTQMVVT FALATLENVF GPVYDLVEIR
DRALGIIIGT VVSAVIYTFV WPESEARTLP QKLAGALGML SKVMRIPRQQ EVTALRTYLQ
IRIGLHAAFN ACEEMCQRVA LERQLDSEER ALLIERSQTV IRQGRDILHA WDATWNSAQA
LDNALQPDRA GQFADALEKY AAGLATALSR SPQITLEETP ASQAILPTLL KQEQHVCQLF
ARLPDWTAPA LTPATEQAQG ATQ