Gene EcSMS35_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3775 
Symbol 
ID6143018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3838419 
End bp3841154 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content57% 
IMG OID641618601 
ProductABC transporter, ATP binding protein 
Protein accessionYP_001745741 
Protein GI170680682 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC TGGAACTGGT TCCCGTCCCG CCTGTCGCGC AACTGGCGGG CGTGAGCCAG 
CATTATGGAA AAACCGTTGC GCTGAACAAT ATCACTCTCG ATATTCCGGC CCGCTGTATG
GTCGGGCTGA TTGGTCCGGA TGGCGTCGGG AAGTCGAGCT TGTTGTCGTT GATTTCCGGT
GCCCGCGTCA TTGAGCAGGG CAATGTGATG GTGCTGGGCG GCGATATGCG CGACCCGAAG
CATCGCCGTG ACGTTTGCCC GCGCATCGCC TGGATGCCGC AAGGGCTGGG CAAAAACCTC
TACCACACCT TGTCGGTGTA TGAAAACGTC GATTTTTTCG CCCGCCTGTT CGGTCACGAC
AAAGCGGAGC GGGAAGTCCG AATCAATGAT CTGCTGACCA GCACCGGGTT AGCACCGTTT
CGCGATCGTC CGGCAGGTAA ACTCTCCGGC GGGATGAAGC AAAAACTTGG GCTGTGCTGT
GCGTTAATTC ACGACCCGGA ACTGTTGATT CTTGATGAGC CAACAACGGG GGTTGACCCG
CTCTCCCGTG CCCAGTTCTG GGATCTGATC GACAGTATTC GCCAGCGGCA GAGCAATATG
AGCGTGCTGG TCGCCACCGC CTACATGGAA GAGGCCGAAC GCTTCGACTG GCTGGTAGCG
ATGAATGCCG GAGAAGTGCT GGCAACCGGC AGCGCCGAAG AGCTGCGGCA GCAAACGCAA
AGCGCCACGC TGGAAGAAGC ATTTATAAAT CTGTTACCGC AAGCGCAACG CCAGGCGCAT
CAGGCGGTAG TGATCCCACC GTATCAACCG GAAAACGCAG AGATTGCCAT CGAAGCGCGC
GATCTGACCA TGCGTTTTGG TTCCTTCGTT GCCGTTGATC ACGTCAATTT CCGCATTCCA
CGCGGGGAGA TTTTTGGTTT TCTTGGTTCG AACGGCTGCG GTAAATCCAC CACCATGAAA
ATGCTTACCG GATTGCTGCC CGCCAGCGAA GGTGAGGCGT GGCTGTTCGG GCAGCCGGTT
GATCCAAAAG ATATCGATAC CCGCCGTCGT GTAGGCTATA TGTCGCAGGC GTTTTCGCTC
TATAACGAAC TCACCGTGCG GCAAAACCTT GAGTTACATG CCCGTTTGTT TCACATCCCG
GAAGCGGAAA TTCCCGCAAG AGTGGCTGAA ATGAGCGAGC GTTTTAAGCT CAACGACGTT
GAAGATGTTC TGCCGGAGTC ATTGCCGCTC GGCATTCGCC AGCGGCTTTC GCTGGCGGTG
GCGGTGATTC ATCGCCCGGA GATGTTAATC CTCGATGAGC CTACTTCTGG TGTCGATCCG
GTGGCGAGGG ATATGTTCTG GCAGTTGATG GTCGATCTCT CGCGCCAGGA CAAAGTGACC
ATTTTTATCT CCACCCACTT TATGAACGAA GCGGAACGTT GCGACCGTAT CTCACTGATG
CACGCCGGAA AAGTGCTCGC CAGCGGTACA CCGCAGGAAC TGGTTGAGAA ACGCGGAGCC
GCCAGTCTGG AAGAGGCATT TATCGCCTAT TTGCAGGAAG CGGCAGGGCA GAGCAACGAA
GCCGAAGCGC CGCCCGTGAT ACACGACACC ACCCACGCGC CGCGTCAGGG ATTTAGCCTG
CGCCGTCTGT TTAGCTACAG CCGTCGCGAA GCACTGGAAC TGCGCCGCGA TCCGGTACGT
TCGACGCTGG CGCTGATGGG AACGGTGATC CTGATGCTGA TAATGGGTTA CGGCATCAGT
ATGGATGTGG AAAACCTGCG CTTTGCGGTG CTCGACCGCG ACCAGACCGT CAGTAGCCAG
GCGTGGACGC TCAATCTCTC CGGTTCCCGT TACTTTATCG AGCAGCCGCC GCTCACCAGT
TATGACGAGC TTGATCGCCG GATGCGTGCG GGCGATATCA CCGTGGCGAT TGAGATCCCT
CCCAATTTCG GGCGCGATAT CGCGCGTGGT ACGCCCGTGG AACTCGGTGT CTGGATCGAC
GGTGCGATGC CGAGTCGCGC CGAAACGGTA AAAGGTTACG TGCAGGCCAT GCACCAGAGC
TGGCTACAGG ATGTGGCGAG TCGGCAATCC ACGCCCGCGA GCCAAAGCGG GCTGATGAAT
ATTGAGACAC GCTATCGCTA TAACCCGGAC GTAAAAAGCC TGCCAGCGAT TGTTCCGGCG
GTGATCCCGC TTCTGCTGAT GATGATTCCG TCGATGTTAA GCGCCCTTAG CGTAGTGCGG
GAAAAAGAGT TGGGGTCGAT TATCAACCTT TACGTTACCC CCACCACCCG CAGCGAATTT
TTACTTGGCA AACAGCTACC GTACATCGGG CTGGGAATGC TGAACTTTTT CCTGCTCTGC
GCCCTGTCGG TGTTTGTGTT TGGCGTGCCG CATAAAGGCA GTTTCCTGAC GCTCACCCTG
GCGGCGCTGC TGTATATCAT CATTGCCACC GGAATGGGGC TGCTGATCTC CACCTTTATG
AAAAGCCAGA TTGCCGCCAT TTTTGGCACT GCGATCATCA CGTTGATTCC GGCGACGCAG
TTTTCCGGGA TGATCGATCC GGTAGCTTCG CTGGAAGGTC CAGGACGTTG GATCGGCGAG
GTTTACCCGA CCAGTCATTT TCTGACTATT GCCCGCGGGA CGTTCTCGAA GGCGCTGGAT
TTGACTGATT TGTGGCAACT TTTTATCCCG TTGCTGATAG CCATCCCGCT GGTGATGGGC
TTAAGCATCC TGCTGCTGAA AAAACAGGAG GGATGA
 
Protein sequence
MTHLELVPVP PVAQLAGVSQ HYGKTVALNN ITLDIPARCM VGLIGPDGVG KSSLLSLISG 
ARVIEQGNVM VLGGDMRDPK HRRDVCPRIA WMPQGLGKNL YHTLSVYENV DFFARLFGHD
KAEREVRIND LLTSTGLAPF RDRPAGKLSG GMKQKLGLCC ALIHDPELLI LDEPTTGVDP
LSRAQFWDLI DSIRQRQSNM SVLVATAYME EAERFDWLVA MNAGEVLATG SAEELRQQTQ
SATLEEAFIN LLPQAQRQAH QAVVIPPYQP ENAEIAIEAR DLTMRFGSFV AVDHVNFRIP
RGEIFGFLGS NGCGKSTTMK MLTGLLPASE GEAWLFGQPV DPKDIDTRRR VGYMSQAFSL
YNELTVRQNL ELHARLFHIP EAEIPARVAE MSERFKLNDV EDVLPESLPL GIRQRLSLAV
AVIHRPEMLI LDEPTSGVDP VARDMFWQLM VDLSRQDKVT IFISTHFMNE AERCDRISLM
HAGKVLASGT PQELVEKRGA ASLEEAFIAY LQEAAGQSNE AEAPPVIHDT THAPRQGFSL
RRLFSYSRRE ALELRRDPVR STLALMGTVI LMLIMGYGIS MDVENLRFAV LDRDQTVSSQ
AWTLNLSGSR YFIEQPPLTS YDELDRRMRA GDITVAIEIP PNFGRDIARG TPVELGVWID
GAMPSRAETV KGYVQAMHQS WLQDVASRQS TPASQSGLMN IETRYRYNPD VKSLPAIVPA
VIPLLLMMIP SMLSALSVVR EKELGSIINL YVTPTTRSEF LLGKQLPYIG LGMLNFFLLC
ALSVFVFGVP HKGSFLTLTL AALLYIIIAT GMGLLISTFM KSQIAAIFGT AIITLIPATQ
FSGMIDPVAS LEGPGRWIGE VYPTSHFLTI ARGTFSKALD LTDLWQLFIP LLIAIPLVMG
LSILLLKKQE G