Gene EcSMS35_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3930 
Symbol 
ID6146715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4005888 
End bp4007024 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID641618756 
Productauxiliary membrane fusion protein family transporter 
Protein accessionYP_001745895 
Protein GI170682468 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.954749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTAT TGATTGTTTT AACTTACGTG GCGCTGGCGT GGGCGGTCTT TAAAATCTTC 
CGCATTCCGG TAAATCAGTG GACGCTGGCG ACGGCGGCGC TGGGAGGCGT GTTTCTGGTG
AGTGGTTTGA TTTTGTTGAT GAACTACAAC CACCCTTACA CTTTTACCGC GCAAAAGGCA
GTGATAGCGA TCCCCATCAC GCCACAGGTG ACGGGAATTG TTACTGAAGT CACCGACAAA
AATAATCAGC TTATTCAGAA GGGCGAGGTG CTTTTTAAGC TCGACCCGGT TCGTTACCAG
GCGCGAGTTG ACAGACTTCA GGCTGACCTG ATGACGGCGA CGCATAATAT AAAGACGCTG
CGCGCGCAGC TCACTGAAGC GCAGGCCAAC ACCACCCAGG TTTCAGCGGA GCGCGACCGT
CTGTTTAAAA ATTATCAACG TTACCTGAAA GGCAGCCAGG CGGCGGTGAA TCCGTTCTCG
GAACGTGACA TCGACGATGC GCGGCAAAAT TTCCTCGCGC AGGACGCGCT GGTGAAAGGC
TCGGTGGCGG AGCAGGCGCA GATCCAGAGC CAGCTCGACA GTATGGTTAA CGGCGAGCAG
TCGCAGATTG TGAGCTTAAG AGCGCAACTT ACTGAAGCAA AATATAACCT TGAGCAGACT
GTCATTCGCG CACCGAGCAA TGGCTACGTT ACTCAGGTAC TGATCCGCCC AGGTACATAC
GCAGCTGCCT TGCCGCTACG TCCGGTGATG GTTTTCATCC CCGAGCAAAA ACGGCAAATT
GTCGCCCAAT TTCGGCAGAA CTCGCTGTTA CGTCTGAAAC CCGGCGATGA TGCGGAAGTG
GTGTTTAACG CGCTACCTGG GCAGGTGTTC CACGGCAAAC TGACTAGTAT TTTACCTGTC
GTGCCAGGCG GTTCGTATCA GGCGCAAGGG GTACTGCAAT CATTAACCGT CGTGCCCGGC
ACGGACGGTG TGCTGGGAAC CATTGAACTG GACCCTAACG ATGATATCGA TGCTTTACCC
GACGGCATCT ACGCCCAGGT GGCAGTCTAC TCCGACCATT TCAGCCATGT TTCGGTGATG
CGGAAAGTGC TGCTAAGAAT GACCAGTTGG ATGCATTATC TTTATTTGGA TCATTAA
 
Protein sequence
MDLLIVLTYV ALAWAVFKIF RIPVNQWTLA TAALGGVFLV SGLILLMNYN HPYTFTAQKA 
VIAIPITPQV TGIVTEVTDK NNQLIQKGEV LFKLDPVRYQ ARVDRLQADL MTATHNIKTL
RAQLTEAQAN TTQVSAERDR LFKNYQRYLK GSQAAVNPFS ERDIDDARQN FLAQDALVKG
SVAEQAQIQS QLDSMVNGEQ SQIVSLRAQL TEAKYNLEQT VIRAPSNGYV TQVLIRPGTY
AAALPLRPVM VFIPEQKRQI VAQFRQNSLL RLKPGDDAEV VFNALPGQVF HGKLTSILPV
VPGGSYQAQG VLQSLTVVPG TDGVLGTIEL DPNDDIDALP DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW MHYLYLDH