Gene EcSMS35_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1146 
Symbol 
ID6145381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1164952 
End bp1172028 
Gene Length7077 bp 
Protein Length2358 aa 
Translation table11 
GC content50% 
IMG OID641616024 
Productputative invasin 
Protein accessionYP_001743213 
Protein GI170680815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0236559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000373637 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTACGA AGAAGAGAAG TGGAGAAGAA ATAAATGACC GACAAATCTT ATGCGGGATG 
GGAATTAAAC TACGCCGCTT AACTGCGGGT ATCTGCCTGA TAACTCAACT TGCGTTCCCT
ATGGCTGCGG CAGCACAAGG TGTGGTAAAC ACCGCAACCC AACAACCAGT TCCTGCACAA
ATTGCCATTG CAAATGCCAA TACGGTGCCC TACACCCTTG GAGCGCTGGA ATCGGCCCAA
AGCGTTGCCG AACGTTTCGG TATTTCGGTG GCTGAGTTAC GCAAACTCAA CCAGTTTCGT
ACGTTTGCTC GAGGTTTTGA TAATGTCCGC CAGGGTGATG AACTGGATGT CCCGGCACAA
GTTAGTGAAA ATAATTTAAC CCCGCCACCG GGTAATAGCA GTGGCAACCT TGAGCAACAG
ATAGCCAGTA CTTCACAGCC AATCGGGTCT CTGCTCGCCG AAGATATGAA CAGCGAGCAA
GCGGCAAATA TGGCGCGTGG ATGGGCCTCT TCTCAGGCTT CAGGCGCAAT GACAGACTGG
TTAAGCCGCT TCGGTACCGC AAGAATCACG CTGGGCGTGG ATGAAGATTT TAGCCTGAAG
AACTCCCAGT TCGATTTTCT CCATCCGTGG TATGAAACGC CTGATAATCT CTTTTTCAGT
CAGCATACTC TCCATCGTAC TGACGAGCGT ACGCAGATTA ACAACGGCTT AGGTTGGCGT
CATTTCACTC CCACATGGAT GTCGGGCATC AACTTCTTTT TCGACCACGA TCTTAGCCGT
TACCACTCCC GCGCCGGCAT TGGCGCGGAG TACTGGCGCG ACTATCTAAA ATTAAGCAGT
AACGGCTATT TGCCACTGAC CAACTGGCGC AGCGCACCTG AGCTGGACAA CGATTATGAA
GCCCGCCCGG CCAATGGCTG GGATGTACGC GCAGAAGGCT GGCTACCCGC CTGGCCGCAC
CTTGGCGGTA AACTGGTCTA TGAACAGTAT TATGGCGATG AAGTGGCCCT GTTCGATAAA
GACGATCGGC AAAGTAATCC TCATACCATA ACCGCTGGAC TTAACTATAC CCCCTTCCCG
CTGATGACCT TCAGCGCGGA GCAACGCCAG GGTAAACAGG GCGAAAATGA CACCCGTTTT
GCCGTCGATT TTACCTGGCA ACCTGGCAGC GCAATGCAGA AACAGCTTGA CCCGAACGAA
GTCGCTGCAC GGCGTAGCCT TGCAGGCAGC CGTTATGATC TGGTGGATCG CAACAACAAC
ATCGTTCTGG AATACCGCAA AAAAGAACTG GTTCGCCTGA CCCTGACAGA CCCCGTGACA
GGGAAGTCAG GAGAAGTGAA ATCATTGGTT TCGTCGCTAC AAACCAAATA TGCCCTGAAA
GGCTATAACG TCGAAGCCAC CGCTCTGGAA GCTGCCGGTG GCAAAGTGGT CACAACGGGT
AAAGATATTC TGGTTACCCT GCCGGCTTAC CGGTTCACCA GTACGCCAGA AACCGATAAC
ACCTGGCCGA TTGAAGTCAC CGCTGAAGAT GTCAAAGGCA ATTTGTCGAA TCGTGAACAA
AGCATGGTGG TCGTTCAGGC ACCTACGCTA AGCCAGAAAG ATTCCTCGGT ATCGTTAAGT
ACCCAAACAT TGAGCGCGGA TTCCCATTCA ACCGCCACAC TGACTTTTAT TGCTCATGAT
GCAGCAGGTA ATCCTGTTAT CGGGCTGGTG CTTTCGACGC GTCACGAAGG AGTTCAGGAC
ATCACCCTTT CTGAATGGAA AGATAATGGT GACGGAAGCT ATACCCAGAT CCTGACCACA
GGAGCGATGT CTGGCACGCT GACGCTGATG CCACAGCTGA ATGGTGTGGA TGCGGCTAAA
GCCCCCGCCG TGGTGAATAT CATTTCTATT TCGTCATCCC GAACTCACTC GTCAATTAAA
ATTGATAAAG ACCGTTATCT CTCCGGCAAT CCTATCGAGG TGACGGTAGA ACTGAGAGAT
GAAAATGACA AACCTGTTAA GGAGCAAAAA CAGCAACTGA ATAACGCAGT CAGCATCGAC
AACGTGAAAC CTGGTGTCAC TACAGACTGG AAAGAAACCG CAGATGGCGT CTATAAGGCA
ACCTATACCG CCTATACCAA AGGCAGTGGG CTTACTGCGA AGCTATTAAT GCAAAACTGG
AATGAAGATT TGCATACCGC TGGTTTTATC ATCGACGCCA ACCCGCAGTC AGCAAAAATT
GCGACATTAT CTGCCAGCAA TAATGGTGTG CTCGCCAATG AGAATGCAGC AAACACCGTC
TCGGTCAATG TCGCTGATGA AGGAAGCAAC CCAATCAATG ATCATACCGT CACGTTTGCG
GTATTAAGCG GATCGGCAAC TTCCTTCAAC AATCAAAACA CCGCAAAAAC GGATGTTAAT
GGTCTGGCGA CTTTTGATCT GAAAAGTAGT AAGCAGGAAG ACAACACGGT TGAAGTCACC
CTTGAAAATG GCGTGAAACA AACGTTAATC GTCAGTTTTG TCGGCGACTC GAGTACCGCG
CAGGTTGAGC TGCAGAAGTC GAAAAATGAA GTGGTCGCTG ACGGCAATGA TAGCGCCACA
ATGACCGCGA CAGTCCGGGA TGCAAAAGGC AACCTGCTCA ATGACGTCAA GGTCACTTTC
AATGTTAATT CAGCAGAGGC GAAACTGAGC CAAACAGAAG TGAATAGCCA CGACGGGATC
GCCACAGCTA CGCTGACCAG TTTGAAAAAT GGTGATTATA GGGTTACGGC CTCTGTGAGC
TCTGGTTCTC AGGCTAATCA ACAGGTGATT TTTATCGGTG ATCAAAGTAC TGCTGCCCTG
ACCCTCAGTG TGCCTTCAGG TGATATCACC GTCACCAACA CAGCTCCGCT ACATATGACT
GCAACCTTGC AGGATAAAAA TGGCAACCCA TTAAAAGATA AAGAAATCAC CTTCTCTGTG
CCAAACGACG TCGCAAGTCG GTTCTCGATT AGCAACAGCG GAAAAGGCAT GACGGATAGC
AACGGGACTG CAATCGCCTC CCTGACCGGC ACGTTAGCGG GCACGCATAT GATCACGGCT
CGTCTGGCTA ACAGCAATGT CAGCGATACA CAGCCAATGA CGTTTGTGGC GGATAAAGAC
AGAGCGGTTG TCGTTCTGCA AACATCGAAA GCGGAAATCA TTGGGAATGG CGTGGATGAG
ACAACTCTGA CAGCAACAGT GAAAGATCCG TCGAATCATC CGGTGGCGGG AATAACGGTG
AACTTCACCA TGCCACAGGG CGTGGCGGCA AACTTTACCC TCGAAAATAA CGGCATTGCC
ATCACCCAAG CCAATGGGGA AGCGCATGTC ACGCTCAAAG GTAAAAAAGC GGGTACACAT
ACGGTTACCG CAACGCTGGG TAATAACAAT ACCAGTGATT CGCAGCCGGT AACATTTGTG
GCGGACAAAA CCTCGGCTCA GGTTGTCCTG CAGATGTCAA AAGATGAGAT CACAGGTAAT
GGCGTCGATA ACGCAACGCT AACTGCAACG GTTAAAGATC AGTTCGACAA TGAGGTGAAT
AATCTTCCGG TAACATTCAG CTCAGCCTCT TCAGGACTCA CCCTGACCCC GGGAGTAAGT
AATACCAATG AGTCTGGCAT CGCGCAGGCC ACTCTCGCAG GCGTTGCCTT TGGTGAGCAG
ACGGTCACTG CATCACTGGC TAATAATGGT GCCAGCGACA ACAAAACTGT GCATTTTATT
GGCGACACAG CAGCGGCAAA AATTATCGAG TTGACGCCTG TCCCAGACAG CATAATCGCC
GGTACCCCGC AGAACAGCTC CGGCAGCGTC ATCACCGCCA CAGTCGTTGA TAATAATGGC
TTTCCGGTGA AAGGTGTGAC TGTGAACTTC ACCAGCAGAA CAAACTCTGC CGAAATGACG
AATGGCGGCC AAGCCGTAAC GAACGAACAG GGTAAGGCTA CCGTCACTTA TACCAATACC
CGCTCCTCGA TAGAATCAGG AGCGAGACCG GATACCGTTG AGGCCAGTCT GGAAAATGGT
AGCTCCACGC TTAGCACATC AATTAATGTC AACGCTGATG CGTCTACGGC ACATCTCACC
TTGCTACAGG CACTTTTTGA TACAGTCTCC GCAGGCGACA CTACCAATCT GTATATTGAG
GTGAAGGATA ATTACGGCAA CGGTGTACCC CAGCAGGAGG TAACCCTCAG AGTATCACCA
AGTGAAGGCG TGACCCCCAG TAATAACGCT ATATATACTA CCAACCACGA CGGCAATTTT
TACGCAAGCT TTACCGCTAC AAAAGCCGGG GTTTATCAAG TGACGGCAAC CCTCGAAAAT
GGCGATTCGA TGCAACAAAC AGTGACCTAT GTGCCGAACG TCGCGAATGC CGAAATCACG
CTGGCAGCCT CGAAGGATCC GTTGATTGCC GACAATAACG ATCTCACGAC ACTAACAGCA
ACAGTCGCTG ATACAGAGGG CAATGCGATA GCCAACACTG AAGTAACATT TACTCTGCCG
GAAGATGTGA AGGCGAACTT CACGCTGAGC GATGGCGGTA AAGCGATTAC TGATGCTGAA
GGCAAAGCGA AAGTCACGCT GAAAGGTACA AAAGCAGGCG CTCATACTGT TACAGCATCG
ATGACTGGCG GTAAGAGTGA GCAGTTGGTG GTGAACTTTA TTGCGGATAC GCTCAGTGCG
CAGGTTAATC TTAACGTTAC CGAGGACAAC TTTATCGCCA ATAACGTTGG GATGACCACA
CTTCAGGCAA CAGTGACTGA TGGAAACGGC AACCCGTTAG CCAATGAGGC GGTGACATTC
ACGCTACCGG CAGACGTGAG CGCAAGCTTC ACTCTCGGAC AAGGCGGTTC CGCCATTACT
GATATCAACG GCAAGGCTGA AGTTACACTG AGCGGTACAA AATCCGGCAC CTACCCCGTG
ACAGTTAGCG TGAACAATTA TGGTGTCAGT GATACGAAAC AGGTGACTTT GATTGCCGAT
GCTGGTACCG CAACACTAGC CTCCTTAACC TCTGTATACT CATTCGTCGT CAGCACGACC
GAGGGCGCGA CCATGACTGC AAGCGTCACT GACGCTAACG GCAACCCGGT AGAAGGCATA
AAAGTTAATT TCCGCGGAAC CTCCGTCACG CTAAGCAGCA CCAGCGTTGA AACGGATGAT
CAGGGTTTCG CTGAAATTCT TGTGACAAGC ACCGAAGTCG GACTGAAAAC AGTTTCAGCC
TCTCTGGCAG ATAAACCTAC TGAAGTCATA TCGCGATTAC TGAATGCAAA AGCAGATATT
AATTCTGCAA CGATTACCAG TCTGGAGATA CCTGAAGGTC AGCTAATGGT CGCACAAGAC
GTAGCAGTTA AAGCTCACGT CAACGACCAG TTTGGCAACC CGATTCTTAA TGAATCTGTA
ACATTCAGTG CAGAGCCACC AGAGCACATG ACCATCAGCC AAAATATTGT CTCTACTGAT
ACGCATGGTA TAGCCGAGGT CTCCATGACG CCCGAAAGAA ACGGTTCGTA TATGGTGAAA
GCATCCCTGG CGAATGGAGC CTCACTTGAG AAACAACTGG AGGCTATTGA TGAAAAACTG
ACACTCACGG CGTCCAGTCC GCTTATCGGT GTCTATGCCC CTACAGGCAC TACTCTGACG
GCAACGCTAA CCTCTGCAAA TGGCACTCCA GTGGAGGGTC AGGTCATCAA CTTTAGCGTA
ACGCCAGAAG GGGCGACGTT AAGTGGCGGA AAAGTGAGAA CTAACTCTTC AGGTCAGGCT
CCGGTCGTTC TGACCAGCAA TAAAGTCGGT ACATATACGG TGACTGCATC GTTCCATAAC
GGCGTAACAA TACAGACACA GACAACCGTG AAAGTCACTG GCAACTCAAG CACCGCACAT
GTTGCTAGCT TTATCGCTGA TCCATCGACT ATCGCCGCCA CCAACAGTGA TTTAAGTACC
TTAAAGGCAA CGGTTGAGGA TGGCAGTGGT AACCTGATCG AAGGTCTCAC TGTGTACTTC
GCCTTAAAAA GCGGCTCTGC CACATTAACG TCATTAACAG CGGTGACCGA TCAAAACGGA
ATCGCGACAA CAAGCGTGAA AGGAGCGATG ACAGGTAGCG TCACGGTAAG CGCAGTCACG
ACCGCTGGTG GAATGCAAAC AGTAGATATA ACGCTGGTGG CTGGCCCGGC AGACACCTCG
CAGTCCGTCC TTAAGAGCAA TCGGTCATCA CTGAAAGGGG ACTATACCGA TAGTGCTGAA
TTACGTCTTG TTCTGCACGA TATATCAGGC AATCCGATCA AAGTTTCTGA AGGGATGGAA
TTTGTGCAAT CAGGTACTAA CGTGCCCTAT ATAAAAATTA GCGCAATTGA TTACAGTCTA
AATATCAACG GTGATTACAA AGCCACTGTT ACAAGCGGCG GAGAGGGTAT CGCAACGCTG
ATCCCTGTAT TGAATGGTGT TCATCAAGCT GGTCTGAGTA CCACAATACA ATTCACTCGC
GCAGAAGACA AAATAATGAG CGGTACAGTA TCAGTCAATG GTACTGACCT ACCGACAACT
ACATTCCCTT CGCAGGGGTT CACCGGGGCG TATTATCAGT TGAATAATGA CAACTTTGCC
CCAGGAAAAA CGGCGGCTGA TTATGAGTTT TCAAGCTCTG CCTCCTGGGT CGATGTTGAT
GCTACCGGTA AAGTGACATT TAAAAATGTC GGCAGCAATT GGGAAAGGAT TACGGCGACG
CCAAAATCAG GAGGCCCTAG CTATGTATAC GAAATCCGTG TGAAGAGTTG GTGGGTGAAC
GCCGGCGAGG CTTTCATGAT ATACAGCCTT GCTGAAAATT TTTGCAGCAG CAATGGCTAC
ACGCTCCCCA GAGCAAACTA TTTAAACCAC AGTAGTTCCC GAGGCATCGG GTCACTGTAC
AGTGAATGGG GAGATATGGG GCATTACACG ACTGACGCTG GTTTTCAATC AAATATGTAT
TGGTCATCTA GTCCCGCAAA CTCAAGCGAA CAATACGTAG TTTCCCTGGC AACAGGTGAT
CAAAGCGTAT TTGAAAAGCT TGGGTTTGCT TATGCGACAT GTTATAAAAA CCTGTGA
 
Protein sequence
MATKKRSGEE INDRQILCGM GIKLRRLTAG ICLITQLAFP MAAAAQGVVN TATQQPVPAQ 
IAIANANTVP YTLGALESAQ SVAERFGISV AELRKLNQFR TFARGFDNVR QGDELDVPAQ
VSENNLTPPP GNSSGNLEQQ IASTSQPIGS LLAEDMNSEQ AANMARGWAS SQASGAMTDW
LSRFGTARIT LGVDEDFSLK NSQFDFLHPW YETPDNLFFS QHTLHRTDER TQINNGLGWR
HFTPTWMSGI NFFFDHDLSR YHSRAGIGAE YWRDYLKLSS NGYLPLTNWR SAPELDNDYE
ARPANGWDVR AEGWLPAWPH LGGKLVYEQY YGDEVALFDK DDRQSNPHTI TAGLNYTPFP
LMTFSAEQRQ GKQGENDTRF AVDFTWQPGS AMQKQLDPNE VAARRSLAGS RYDLVDRNNN
IVLEYRKKEL VRLTLTDPVT GKSGEVKSLV SSLQTKYALK GYNVEATALE AAGGKVVTTG
KDILVTLPAY RFTSTPETDN TWPIEVTAED VKGNLSNREQ SMVVVQAPTL SQKDSSVSLS
TQTLSADSHS TATLTFIAHD AAGNPVIGLV LSTRHEGVQD ITLSEWKDNG DGSYTQILTT
GAMSGTLTLM PQLNGVDAAK APAVVNIISI SSSRTHSSIK IDKDRYLSGN PIEVTVELRD
ENDKPVKEQK QQLNNAVSID NVKPGVTTDW KETADGVYKA TYTAYTKGSG LTAKLLMQNW
NEDLHTAGFI IDANPQSAKI ATLSASNNGV LANENAANTV SVNVADEGSN PINDHTVTFA
VLSGSATSFN NQNTAKTDVN GLATFDLKSS KQEDNTVEVT LENGVKQTLI VSFVGDSSTA
QVELQKSKNE VVADGNDSAT MTATVRDAKG NLLNDVKVTF NVNSAEAKLS QTEVNSHDGI
ATATLTSLKN GDYRVTASVS SGSQANQQVI FIGDQSTAAL TLSVPSGDIT VTNTAPLHMT
ATLQDKNGNP LKDKEITFSV PNDVASRFSI SNSGKGMTDS NGTAIASLTG TLAGTHMITA
RLANSNVSDT QPMTFVADKD RAVVVLQTSK AEIIGNGVDE TTLTATVKDP SNHPVAGITV
NFTMPQGVAA NFTLENNGIA ITQANGEAHV TLKGKKAGTH TVTATLGNNN TSDSQPVTFV
ADKTSAQVVL QMSKDEITGN GVDNATLTAT VKDQFDNEVN NLPVTFSSAS SGLTLTPGVS
NTNESGIAQA TLAGVAFGEQ TVTASLANNG ASDNKTVHFI GDTAAAKIIE LTPVPDSIIA
GTPQNSSGSV ITATVVDNNG FPVKGVTVNF TSRTNSAEMT NGGQAVTNEQ GKATVTYTNT
RSSIESGARP DTVEASLENG SSTLSTSINV NADASTAHLT LLQALFDTVS AGDTTNLYIE
VKDNYGNGVP QQEVTLRVSP SEGVTPSNNA IYTTNHDGNF YASFTATKAG VYQVTATLEN
GDSMQQTVTY VPNVANAEIT LAASKDPLIA DNNDLTTLTA TVADTEGNAI ANTEVTFTLP
EDVKANFTLS DGGKAITDAE GKAKVTLKGT KAGAHTVTAS MTGGKSEQLV VNFIADTLSA
QVNLNVTEDN FIANNVGMTT LQATVTDGNG NPLANEAVTF TLPADVSASF TLGQGGSAIT
DINGKAEVTL SGTKSGTYPV TVSVNNYGVS DTKQVTLIAD AGTATLASLT SVYSFVVSTT
EGATMTASVT DANGNPVEGI KVNFRGTSVT LSSTSVETDD QGFAEILVTS TEVGLKTVSA
SLADKPTEVI SRLLNAKADI NSATITSLEI PEGQLMVAQD VAVKAHVNDQ FGNPILNESV
TFSAEPPEHM TISQNIVSTD THGIAEVSMT PERNGSYMVK ASLANGASLE KQLEAIDEKL
TLTASSPLIG VYAPTGTTLT ATLTSANGTP VEGQVINFSV TPEGATLSGG KVRTNSSGQA
PVVLTSNKVG TYTVTASFHN GVTIQTQTTV KVTGNSSTAH VASFIADPST IAATNSDLST
LKATVEDGSG NLIEGLTVYF ALKSGSATLT SLTAVTDQNG IATTSVKGAM TGSVTVSAVT
TAGGMQTVDI TLVAGPADTS QSVLKSNRSS LKGDYTDSAE LRLVLHDISG NPIKVSEGME
FVQSGTNVPY IKISAIDYSL NINGDYKATV TSGGEGIATL IPVLNGVHQA GLSTTIQFTR
AEDKIMSGTV SVNGTDLPTT TFPSQGFTGA YYQLNNDNFA PGKTAADYEF SSSASWVDVD
ATGKVTFKNV GSNWERITAT PKSGGPSYVY EIRVKSWWVN AGEAFMIYSL AENFCSSNGY
TLPRANYLNH SSSRGIGSLY SEWGDMGHYT TDAGFQSNMY WSSSPANSSE QYVVSLATGD
QSVFEKLGFA YATCYKNL