Gene EcSMS35_4506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4506 
SymboldinF 
ID6143547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4605345 
End bp4606670 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID641619322 
ProductDNA-damage-inducible SOS response protein 
Protein accessionYP_001746434 
Protein GI170682064 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCC TCACTTCATC TGATAAAGCG CTCTGGCATC TCGCCTTACC CATGATTTTC 
TCCAATATCA CCGTTCCGTT GCTGGGACTG GTCGATACGG CGGTAATTGG TCATCTTGAT
AGTCCGGTTT ATTTGGGCGG CGTGGCGGTT GGTGCAACGG CGACCAGCTT TCTCTTTATG
CTGTTGCTGT TTTTACGCAT GAGCACCACC GGGCTGACTG CGCAGGCTTA TGGTGCCAAA
AATCCTCAGG CATTAGCCCG TGCGCTGGTG CAACCGTTGC TGTTGGCGTT GGGGGCTGGG
GCGTTAATTG CGCTGCTGCG TACGCCGATT ATCGATCTGG CGCTGCATAT TGTTGGCGGC
AGCGAAGCGG TGCTAGAACA GGCGCGACGC TTTCTTGAAA TCCGCTGGTT AAGTGCACCG
GCGTCGCTGG CGAATCTGGT ATTACTTGGT TGGTTACTCG GTGTGCAATA TGCCCGTGCG
CCAGTAATTT TGTTAGTGGT CGGCAATATC CTCAACATTG TGCTGGATGT CTGGCTGGTG
ATGGGGCTGC ATATGAACGT GCAGGGCGCG GCGCTGGCGA CGGTTATTGC GGAATATGCA
ACATTGCTGA TTGGTCTGCT AATGGTGCGT AAAATCCTCA AACTACGCGG AATCTCCGGC
GAAATGCTGA AAACTGCCTG GCGAGGAAAC TTCCGTCACT TGCTGGCGCT TAACCGCGAT
ATCATGCTGC GCTCGCTGTT GTTGCAACTC TGTTTCGGCG CGATCACCGT ACTTGGCGCG
CGACTGGGGA GTGACATTAT CGCTGTTAAC GCGGTTCTGA TGACGTTACT CACCTTTACC
GCCTATGCGC TGGATGGTTT TGCCTACGCG GTTGAAGCGC ATTCCGGTCA GGCGTACGGT
GCGCGCGATG GTAGCCAGTT ACTGGATGTC TGGCGGGCAG CGTGCCGCCA GTCGGGGATT
GTAGCGTTAC TGTTTTCGGT GGTTTATTTG CTGGCAGGGG AACACATCAT TGCGTTGCTG
ACGTCGTTAA CCCAGATTCA GCAGCTGGCT GACCGCTATC TTATCTGGCA GGTGATTTTG
CCGTTGGTCG GCGTCTGGTG TTATCTGCTC GACGGCATGT TTATAGGTGC AACGCGCGCC
GCCGAAATGC GTAACAGTAT GGCGGTGGCC GCCGCAGGTT TTGCGCTGAC GCTCCTTACG
CTGCCGTGGC TGGGGAATCA TGGTTTGTGG CTGGCATTAA CCGTCTTTCT GGCGTTACGC
GGGCTTTCTC TGGCGGCTAT CTGGCGGCGT CACTGGCGCA ACGGTACCTG GTTTGCCGCA
ACGTGA
 
Protein sequence
MAFLTSSDKA LWHLALPMIF SNITVPLLGL VDTAVIGHLD SPVYLGGVAV GATATSFLFM 
LLLFLRMSTT GLTAQAYGAK NPQALARALV QPLLLALGAG ALIALLRTPI IDLALHIVGG
SEAVLEQARR FLEIRWLSAP ASLANLVLLG WLLGVQYARA PVILLVVGNI LNIVLDVWLV
MGLHMNVQGA ALATVIAEYA TLLIGLLMVR KILKLRGISG EMLKTAWRGN FRHLLALNRD
IMLRSLLLQL CFGAITVLGA RLGSDIIAVN AVLMTLLTFT AYALDGFAYA VEAHSGQAYG
ARDGSQLLDV WRAACRQSGI VALLFSVVYL LAGEHIIALL TSLTQIQQLA DRYLIWQVIL
PLVGVWCYLL DGMFIGATRA AEMRNSMAVA AAGFALTLLT LPWLGNHGLW LALTVFLALR
GLSLAAIWRR HWRNGTWFAA T