Gene EcSMS35_4544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4544 
Symbol 
ID6144744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4645172 
End bp4646638 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content56% 
IMG OID641619360 
Productputative outer membrane efflux protein MdtP 
Protein accessionYP_001746472 
Protein GI170681660 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC GTCAACTTTC ACGCCTGCTG TTGTGCAGCA TTCTCGGCAG CACGACGCTG 
ATTTCCGGCT GTGCCCTGGT ACGTAAGGAT TCTGCGCCTC ATCAACAGCT CAAACCGGAA
CAAATCAAAC TGGCCGATGA TATTCATCTT GCCAGCTCCG GCTGGCCGCA GGCGCAGTGG
TGGAAACAAC TCAATGACCC GCAGTTGGAT GCTTTGATCC AACGGACGCT AAGTGGTTCA
CACACCCTCG CCGAAGCGAA ACTGCGGGAA GAAAAAGCGC AATCGCAGGC CGATTTGTTA
GATGCCGGTT CACAGTTGCA GGTGGCAGCG TTAGGGATGC TTAACCGCCA ACGCGTCTCG
GCGAACGGCT TTTTAAGCCC TTATGCGATG GATGCGCCCG CACTGGGGAT GGACGGGCCG
TACTATACGG AAGCCACAGT AGGTTTGTTT GCCGGACTGG ATCTCGATTT GTGGGGTGTG
CATCGCTCAG CGGTTGCCGC CGCCATTGGC GCGCATAATG CAGCGCTGGC AGAAACCGCA
GCAGTAGAGC TATCGCTGAC CACGGGCGTA GCGCAGCTTT ATTACAGTAT GCAGGCCAGC
TATCAGATGC TCGATCTGTT AGAACAAACA CGCGATGTGA TTGATTACGC GGTGAAAGCG
CACCAAAGTA AAGTGGCGCA CGGTCTGGAA GCGCAAGTGC CTTTCCACGG CGCGCGGGCG
CAAATTCTGG CCGTCGATAA ACAAATTGCC GCCGTCAAAG GGCAAATCAC TGAAACGCGG
GAATCTCTGC GCGCATTGAT TGGCGCGGGC GCCAGCGATA TGCCGGAGAT CAAACCGGTG
GCATTACCGC GAGTCCAGAC CGGCATTCCG GCAACACTCT CTTATGAGTT GCTCGCCAGA
CGCCCGGATC TGCAAGCCAT GCGCTGGTAT GTTCAGGCGT CATTAGATCA GGTGGATTCC
GCGCGGGCGC TGTTCTATCC GAGCTTTGAT ATCAAAGCGT TTTTCGGCCT GGATTCTATC
CACCTGGATA CCTTATTCAA AAAAACCAGT CGCCAGTTCA ACTTTATCCC AGGTCTGAAA
TTGCCGCTGT TTGACGGTGG ACGGTTGAAT GCCAATCTCG AAGGCACGCG CGCCGCCAGC
AACATGATGA TTGAACGTTA CAACCAGTCA GTACTGAACG CGGTGCGCGA CGTTGCCGTC
AACGGCACGC GTCTGCAAAC GCTTAACGAC GAGCGAGAGA TGCAGGCTGA ACGCGTGGAA
GCAACGCGCT TCACCCAGCG CGCTGCCGAG GCCGCCTATC AACGCGGCTT AACCAGCCGC
TTACAGGCCA CCGAAGCCCG GTTGCCAGTG CTTGCCGAGG AGATGTCATT ACTGATGCTG
GACAGCCGCC GGGTGATCCA AAGCATTCAG TTGATGAAAT CGCTGGGCGG CGGGTATCAG
GCGGCTCCCG TCGTCGAGAA AAAATAA
 
Protein sequence
MINRQLSRLL LCSILGSTTL ISGCALVRKD SAPHQQLKPE QIKLADDIHL ASSGWPQAQW 
WKQLNDPQLD ALIQRTLSGS HTLAEAKLRE EKAQSQADLL DAGSQLQVAA LGMLNRQRVS
ANGFLSPYAM DAPALGMDGP YYTEATVGLF AGLDLDLWGV HRSAVAAAIG AHNAALAETA
AVELSLTTGV AQLYYSMQAS YQMLDLLEQT RDVIDYAVKA HQSKVAHGLE AQVPFHGARA
QILAVDKQIA AVKGQITETR ESLRALIGAG ASDMPEIKPV ALPRVQTGIP ATLSYELLAR
RPDLQAMRWY VQASLDQVDS ARALFYPSFD IKAFFGLDSI HLDTLFKKTS RQFNFIPGLK
LPLFDGGRLN ANLEGTRAAS NMMIERYNQS VLNAVRDVAV NGTRLQTLND EREMQAERVE
ATRFTQRAAE AAYQRGLTSR LQATEARLPV LAEEMSLLML DSRRVIQSIQ LMKSLGGGYQ
AAPVVEKK