Gene EcSMS35_A0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0010 
SymboltraI 
ID6106491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp5854 
End bp11124 
Gene Length5271 bp 
Protein Length1756 aa 
Translation table11 
GC content60% 
IMG OID641614757 
Productconjugal transfer nickase/helicase TraI 
Protein accessionYP_001739898 
Protein GI170650784 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02686] conjugative relaxase domain, TrwC/TraI family
[TIGR02760] conjugative transfer relaxase protein TraI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0549712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTA TCGCGCAGGT CAGATCGGCC GGAAGTGCCG GTAACTATTA TACCGACAAG 
GATAATTACT ATGTACTGGG CAGCATGGGA GAACGCTGGG CCGGCAGGGG GGCTGAACAG
CTGGGACTGC AGGGCAGTGT CGATAAGGAT GTTTTTACCC GTCTTCTGGA GGGAAGGCTG
CCGGACGGAG CGGATCTAAG CCGCATGCAG GATGGCAGTA ATAAGCATCG TCCCGGCTAC
GACCTGACCT TCTCCGCCCC CAAAAGTGTC TCCATGATGG CCATGTTAGG TGGCGATAAG
CGCCTGATTG ATGCACATAA CCAGGCCGTG GACTTTGCTG TTCGTCAGGT GGAGGCGCTG
GCCTCCACAC GGGTGATGAC GGACGGACAG TCAGAAACGG TACTGACCGG TAATCTGGTG
ATGGCACTGT TTAACCACGA CACCAGTCGC GATCAGGAAC CACAGTTACA CACGCATGCG
GTGGTAGCTA ATGTCACGCA GCATAACGGT GAGTGGAAGA CACTGAGCAG TGACAAAGTG
GGGAAAACGG GGTTCATTGA GAATGTGTAC GCTAATCAGA TTGCCTTTGG CAGGCTCTAC
CGGGAAAAAC TGAAAGAGCA GGTTGAGGCG CTGGGCTATG AAACGGAAGT GGTGGGTAAG
CACGGTATGT GGGAAATGCC GGGCGTACCG GTGGAGGCCT TTTCCGGACG CTCACAGACT
ATCCGGGAGG CCGTCGGGGA GGACGCCTCG CTGAAATCCC GGGATGTGGC GGCGCTGGAT
ACGCGTAAAT CCAAACAGCA CGTCGATCCT GAAATCAAAA TGACCGAGTG GATGCAGACG
CTGAAGGAAA CCGGGTTCGA TATCCGGGCG TATCGTGACG CAGCGGAACA ACGGGCATAT
ACCCGCACGC AGACACCCGG ACCAGCTTCA CAGGACGGGC CGGATGTGCA GCAGGCGGTG
ACACAGGCGA TTGCCGGATT AAGTGAACGC AAAGTGCAGT TTACGTACAC GGACGTGCTG
GCCAGGACGG TCGGCATACT GCCGCCGGAA GCCGGTGTGA TTGAGCGGGC GCGCGCCGGT
ATCGATGAGG CCATCAGTCG TGAGCAGCTT ATCCCCCTCG ACCGTGAGAA GGGGCTGTTC
ACATCAGGAA TTCATGTGCT CGATGAGCTG TCCGTCCGGG CACTCAGTCG TGACATCATG
AAACAGAACC GGGTGACCGT ACATCCGGAG AAAAGTGTCC CCCGGACGGC TGGTTACAGC
GATGCGGTCA GTGTTCTGGC ACAGGACCGT CCGTCGCTGG CCATTGTGTC CGGGCAGGGC
GGTGCAGCCG GGCAGCGTGA GCGGGTGGCT GAACTGGTCA TGATGGCCCG GGAGCAGGGG
CGGGAGGTGC AGATTATCGC TGCTGACCGT CGCTCGCAGA TGAACCTGAA GCAGGATGAA
CGGCTGTCCG GTGAGCTGAT AACCGGACGT CGTCAGTTGC TGGAAGGCAT GGCCTTCACG
CCGGGCAGTA CTGTTATCGT TGACCAGGGC GAAAAACTCT CCCTGAAAGA GACGTTAACC
CTGCTGGACG GTGCCGCACG TCATAACGTA CAGGTCCTGA TAACCGACAG CGGGCAGCGA
ACCGGTACAG GCAGTGCGCT GATGGCCATG AAGGATGCCG GGGTGAACAC ATATCGCTGG
CAGGGGGGAG AACAGCGACC GGCCACCATC ATCAGTGAAC CGGACCGTAA TGTCCGCTAT
GCCCGGCTGG CAGGAGATTT TGCGGCCAGC GTGAAAGCCG GAGAAGAGAG CGTGGCACAG
GTCAGCGGGG TACGGGAACA GGCCATACTG ACACAGGCCA TTCGCAGTGA GCTGAAAACA
CAGGGCGTGC TCGGACATCA GGAGGTGACC ATGACGGCAC TGTCACCGGT CTGGCTGGAC
AGCCGGAGCC GTTATCTGCG GGATATGTAC CGCCCGGGAA TGGTGATGGA GCAGTGGAAC
CCGGAGACAC GCAGTCATGA CCGCTATGTG ATAGACCGGG TGACGGCGCA GAGTCACAGC
CTGACCCTGC GGGATGCGCA GGGTGAAACG CAGGTGGTGC GTATTTCTTC TCTGGACAGC
AGCTGGTCGC TGTTCCGGCC GGAAAAAATG CCGGTGGCAG ACGGCGAGCG TCTGAGGGTG
ACAGGGAAAA TTCCCGGACT CCGCGTCTCC GGCGGTGACC GCCTGCAGGT GGCATCCGTC
AGTGAAGATG CGATGACGGT TGTTGTGCCG GGGCGGGCTG AACCGGCCTC CCTGCCTGTG
GCTGATTCAC CGTTCACGGC ACTGAAGCTG GAGAACGGCT GGGTGGAAAC GCCCGGGCAT
TCCGTCAGCG ACAGTGCGAA GGTTTTTGCC GCCGTCACAC AGATGGCAAT GGACAACGCC
ACCCTGAACG GTCTGGCCCG CAGCGGTCGT GATGTCCGGC TGTATTCCTC ACTGGATGAA
ACCCGTACTG CGGAAAAACT TGCCCGCCAT CCCTCCTTTA CGGTGGTTTC TGAGCAGATA
AAGGCGCATG CCGGTGAGAC ATCGCTGGAA ACCTCTATCA GTCTGCAGAA AGCCGGGCTG
CACACGCCGG CACAGCAGGC CATTCATCTG GCCCTTCCTG TGCTGGAAAG TAAAAAACTG
GCCTTCAGCA TGGTGGACCT GCTGACAGAG GCAAAGTCGT TTGCTGCAGA AGGAACCGGT
TTTACTGAAC TGGGAGGGGA AATCAATGCG CAGATAAAAC GGGGTGATTT ACTGTATGTG
GATGTGGCAA AAGGCTATGG CACAGGCCTG CTGGTTTCCC GTGCGTCGTA TGAGGCAGAA
AAGAGCATTC TTCGCCATAT TCTCGAAGGT AAGGAGGCGG TCACGCCGCT GATGGAGAGA
GTACCTGGCG AACTCATGGA GAAACTGACA TCAGGACAGC GTGCCGCTAC CCGCATGATA
CTGGAAACGT CCGACCGTTT TACGGTGGTG CAGGGCTATG CCGGCGTGGG TAAGACCACA
CAGTTCCGGG CGGTGATGTC AGCCGTGAAC ATGCTGCCGG AGAGTATGCG TCCCCGTGTC
GTGGGGCTGG GGCCCACGCA CCGTGCGGTC GGGGAGATGC GCAGCGCCGG CGTGGATGCA
CAGACACTGG CGTCCTTTCT GCATGACACG CAGCTGCAGC AGCGCAGCGG AGAAACGCCG
GATTTCAGCA ACACGCTGTT CCTGCTCGAT GAGAGCTCAA TGGTGGGCAA TACCGACATG
GCACGGGCAT ACGCCCTGAT TGCGGCCGGT GGCGGTCGTG CTGTGGCCAG CGGTGACACG
GACCAGCTGC AGGCCATCGC GCCCGGTCAG CCTTTCCGTC TCCAGCAGAC GCGCAGTGCT
GCCGATGTGG CCATCATGAA GGAGATTGTG CGTCAGACGC CGGAACTGCG GGAGGCGGTA
TACAACCTGA TTAACCGGGA TGTGGAAAGG GCACTGTCCG GGCTTGAGCG TGTGAAACCG
TCTCAGGTGC CACGTCTGGA GGGCGCATGG GCACCGGAGC ACTCCGTGAT GGAGTTCAGT
CACAGCCAGG AAGCGAAACT GGCAGAAGCG CAGCAGAAGG CGATGCTGAA AGGCGAGGCT
TTCCCGGATG TCCCCATGAC ACTGTATGAA GCCATTGTCC GCGACTATAC CGGCAGAACA
CCGGAAGCAC GGGAGCAGAC GCTGATTGTC ACGCACCTGA ATGAGGACCG GCGCGTACTG
AACAGCATGA TTCATGATGC ACGGGAAAAG GCCGGTGAGC TGGGGAAAGA GCAGGTCATG
GTGCCTGTCC TGAACACTGC GAATATACGC GACGGGGAGC TGCGTCGTCT CTCCACCTGG
GAGACTCATC GGGACGCACT TGTCCTGGTG GATAATGTGT ATCACCGGAT TGCCGGTATC
AGTAAGGATG ACGGGCTGAT AACCCTGGAG GATGCGGAAG GTAACACGCG GCTGATTTCG
CCCCGGGAGG CGGTGGCTGA AGGCGTCACA CTGTACACCC CGGACACCAT CCGGGTGGGA
ACCGGTGACC GGATGCGCTT CACGAAGAGT GACCGGGAGC GCGGTTATGT GGCCAACAGC
GTCTGGACGG TGACAGCGGT TTCTGGTGAC AGTGTCACAC TTTCTGACGG TCAGCAGACC
CGGGTGATTC GCCCCGGTCA GGAGCGGGCA GAGCAACATA TTGACCTGGC CTATGCCATC
ACCGCCCACG GTGCGCAGGG GGCAAGTGAA ACCTTTGCCA TTGCGCTTGA AGGCACGGAA
GGTAACCGGA AACAGATGGC CGGCTTTGAG TCAGCCTACG TGGCCCTGTC GCGTATGAAG
CAGCATGTGC AGGTGTACAC CGATAACCGT CAGGGCTGGA CGGATGCCAT TAACAATGCC
GTACAGAAAG GAACGGCCCA CGATGTATTT GAGCCGAAAC CGGACCGGGA GGTCATGAAT
GCAGAGCGGC TGTTCAGTAC GGCGCGGGAA CTGCGGGACG TGGCGGCAGG GCGTGCTGTT
CTCCGTCAGG CGGGGCTGGC CGGGGGAGAC AGTCCTGCAC GGTTTATTGC TCCGGGACGT
AAATATCCGC AGCCGTATGT GGCACTGCCG GCGTTTGACC GTAACGGCAA GTCCGCCGGT
ATCTGGCTGA ACCCACTGAC CACGGATGAC GGAAACGGGC TGCGGGGATT CAGTGGTGAA
GGCCGTGTGA AAGGCAGCGG GGATGCACAG TTTGTGGCCC TGCAGGGCAG CCGTAACGGA
GAAAGCCTGC TGGCTGATAA TATGCAGGAG GGTGTCCGGA TTGCCCGTGA TAATCCTGAC
AGTGGTGTGG TGGTAAGAAT CGCCGGTGAA GGCCGTCCGT GGAATCCCGG TGCCATAACC
GGTGGCCGCG TGTGGGGGGA TATCCCGGAC AACAGCGTCC AGCCGGGAGC CGGAAATGGC
GAACCGGTCA CGGCAGAGGT GCTGGCACAG CGGCAGGCTG AAGAGGCCAT CCGCCGTGAA
ACGGAACGCC GTGCAGATGA AATTGTCCGT AAAATGGCAG AGAACAAACC TGACCTGCCG
GATGGCAAAA CAGAGCAGGC TGTCAGGGAG ATTGCCGGGC AGGAGCGTGA CCGGGCTGCC
ATAACTGAAC GGGAAGCCGC GCTGCCGGAG AGTGTGCTGC GTGAATCACA ACGGGAGCGG
GAAGCGGTCC GGGAGGTTGC CCGGGAAAAT CTGCTGCAGG AGCGTCTGCA GCAGATGGAG
CGGGATATGG TTCGTGACCT GCAGAAAGAG AAAACCCTGG GTGGAGACTG A
 
Protein sequence
MMSIAQVRSA GSAGNYYTDK DNYYVLGSMG ERWAGRGAEQ LGLQGSVDKD VFTRLLEGRL 
PDGADLSRMQ DGSNKHRPGY DLTFSAPKSV SMMAMLGGDK RLIDAHNQAV DFAVRQVEAL
ASTRVMTDGQ SETVLTGNLV MALFNHDTSR DQEPQLHTHA VVANVTQHNG EWKTLSSDKV
GKTGFIENVY ANQIAFGRLY REKLKEQVEA LGYETEVVGK HGMWEMPGVP VEAFSGRSQT
IREAVGEDAS LKSRDVAALD TRKSKQHVDP EIKMTEWMQT LKETGFDIRA YRDAAEQRAY
TRTQTPGPAS QDGPDVQQAV TQAIAGLSER KVQFTYTDVL ARTVGILPPE AGVIERARAG
IDEAISREQL IPLDREKGLF TSGIHVLDEL SVRALSRDIM KQNRVTVHPE KSVPRTAGYS
DAVSVLAQDR PSLAIVSGQG GAAGQRERVA ELVMMAREQG REVQIIAADR RSQMNLKQDE
RLSGELITGR RQLLEGMAFT PGSTVIVDQG EKLSLKETLT LLDGAARHNV QVLITDSGQR
TGTGSALMAM KDAGVNTYRW QGGEQRPATI ISEPDRNVRY ARLAGDFAAS VKAGEESVAQ
VSGVREQAIL TQAIRSELKT QGVLGHQEVT MTALSPVWLD SRSRYLRDMY RPGMVMEQWN
PETRSHDRYV IDRVTAQSHS LTLRDAQGET QVVRISSLDS SWSLFRPEKM PVADGERLRV
TGKIPGLRVS GGDRLQVASV SEDAMTVVVP GRAEPASLPV ADSPFTALKL ENGWVETPGH
SVSDSAKVFA AVTQMAMDNA TLNGLARSGR DVRLYSSLDE TRTAEKLARH PSFTVVSEQI
KAHAGETSLE TSISLQKAGL HTPAQQAIHL ALPVLESKKL AFSMVDLLTE AKSFAAEGTG
FTELGGEINA QIKRGDLLYV DVAKGYGTGL LVSRASYEAE KSILRHILEG KEAVTPLMER
VPGELMEKLT SGQRAATRMI LETSDRFTVV QGYAGVGKTT QFRAVMSAVN MLPESMRPRV
VGLGPTHRAV GEMRSAGVDA QTLASFLHDT QLQQRSGETP DFSNTLFLLD ESSMVGNTDM
ARAYALIAAG GGRAVASGDT DQLQAIAPGQ PFRLQQTRSA ADVAIMKEIV RQTPELREAV
YNLINRDVER ALSGLERVKP SQVPRLEGAW APEHSVMEFS HSQEAKLAEA QQKAMLKGEA
FPDVPMTLYE AIVRDYTGRT PEAREQTLIV THLNEDRRVL NSMIHDAREK AGELGKEQVM
VPVLNTANIR DGELRRLSTW ETHRDALVLV DNVYHRIAGI SKDDGLITLE DAEGNTRLIS
PREAVAEGVT LYTPDTIRVG TGDRMRFTKS DRERGYVANS VWTVTAVSGD SVTLSDGQQT
RVIRPGQERA EQHIDLAYAI TAHGAQGASE TFAIALEGTE GNRKQMAGFE SAYVALSRMK
QHVQVYTDNR QGWTDAINNA VQKGTAHDVF EPKPDREVMN AERLFSTARE LRDVAAGRAV
LRQAGLAGGD SPARFIAPGR KYPQPYVALP AFDRNGKSAG IWLNPLTTDD GNGLRGFSGE
GRVKGSGDAQ FVALQGSRNG ESLLADNMQE GVRIARDNPD SGVVVRIAGE GRPWNPGAIT
GGRVWGDIPD NSVQPGAGNG EPVTAEVLAQ RQAEEAIRRE TERRADEIVR KMAENKPDLP
DGKTEQAVRE IAGQERDRAA ITEREAALPE SVLRESQRER EAVREVAREN LLQERLQQME
RDMVRDLQKE KTLGGD