Gene EcSMS35_3911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3911 
Symbol 
ID6146208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3983271 
End bp3984407 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID641618738 
Productmembrane fusion protein family protein 
Protein accessionYP_001745877 
Protein GI170680467 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAC TGATTATTTT AACCTATGTG GCTTTCGCAT GGGCAATGTT TAAGATCTTT 
AAAATTCCTG TAAATAAATG GACCATTCCC ACAGCGGCCC TGGGAGGCAT ATTTATTGTC
AGCGGTCTAA TTCTGTTAAT GAACTATAAC CATCCGTATA CCTTTAAAGC GCAAAAAGCG
GTTATTTCTA TTCCTGTTGT CCCACAGGTG ACAGGCGTGG TGATCGAAGT GACGGACAAG
AAAAATACGC TGATTAAAAA AGGTGAGGTG CTATTTCGAC TGGACCCGAC GCGTTATCAG
GCGCGGGTGG ATCGGCTGAT GGCGGATATC GTTACCGCAG AACATAAACA GCGGGCGTTG
GGCGCAGAGT TAGATGAGAT GGCGGCGAAT ACTCAGCAGG CAAAGGCCAC GCGGGATAAA
TTCGCTAAAG AGTATCAGCG TTACGCACGC GGCAGTCAGG CGAAAGTAAA CCCGTTTTCA
GAACGCGATA TCGATGTGGC GCGGCAAAAT TATCTGGCGC AGGAAGCCTC CGTGAAGTCA
TCGGCGGCGG AACAAAAACA GATCCTGAGC CAGCTGGATA GCCTGGTGTT GGGTGAACAT
TCTCAAATCG CCAGCCTGAA AGCGCAGCTC GCGGAAGCAA AATATAACCT TGAGCAGACG
ATAGTGCGTG CGCCGAGCGA TGGTTATGTG ACCCAGGTAC TGATTCGTCC GGGTACCTAT
GCCGCGTCGC TGCCGCTACG TCCGGTGATG GTGTTTATAC CCGATCAGAA ACGACAAATC
GTGGCGCAGT TCCGTCAGAA CTCCTTGCTG CGACTGGCTC CTGGCGACGA TGCGGAAGTG
GTGTTTAATG CTCTGCCAGG TAAGGTATTC AGCGGTAAGC TGGCAGCCAT TAGTCCAGCC
GTTCCCGGCG GAGCTTATCA GTCGACCGGC ACCTTACAGA CGTTAAACAC AGCGCCGGGT
TCAGATGGCG TTATCGCGAC CATTGAACTG GATGAGCATA CTGATTTGAG CGCATTACCA
GACGGTATTT ACGCCCAGGT GGCGGTCTAC TCTGATCATT TCAGCCATGT CTCGGTGATG
CGCAAAGTAC TGTTACGCAT GACCAGCTGG GTGCATTACC TTTATCTCGA TCATTAA
 
Protein sequence
MDLLIILTYV AFAWAMFKIF KIPVNKWTIP TAALGGIFIV SGLILLMNYN HPYTFKAQKA 
VISIPVVPQV TGVVIEVTDK KNTLIKKGEV LFRLDPTRYQ ARVDRLMADI VTAEHKQRAL
GAELDEMAAN TQQAKATRDK FAKEYQRYAR GSQAKVNPFS ERDIDVARQN YLAQEASVKS
SAAEQKQILS QLDSLVLGEH SQIASLKAQL AEAKYNLEQT IVRAPSDGYV TQVLIRPGTY
AASLPLRPVM VFIPDQKRQI VAQFRQNSLL RLAPGDDAEV VFNALPGKVF SGKLAAISPA
VPGGAYQSTG TLQTLNTAPG SDGVIATIEL DEHTDLSALP DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW VHYLYLDH