Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1146 |
Symbol | |
ID | 6145381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1164952 |
End bp | 1172028 |
Gene Length | 7077 bp |
Protein Length | 2358 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616024 |
Product | putative invasin |
Protein accession | YP_001743213 |
Protein GI | 170680815 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0236559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00000000373637 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTACGA AGAAGAGAAG TGGAGAAGAA ATAAATGACC GACAAATCTT ATGCGGGATG GGAATTAAAC TACGCCGCTT AACTGCGGGT ATCTGCCTGA TAACTCAACT TGCGTTCCCT ATGGCTGCGG CAGCACAAGG TGTGGTAAAC ACCGCAACCC AACAACCAGT TCCTGCACAA ATTGCCATTG CAAATGCCAA TACGGTGCCC TACACCCTTG GAGCGCTGGA ATCGGCCCAA AGCGTTGCCG AACGTTTCGG TATTTCGGTG GCTGAGTTAC GCAAACTCAA CCAGTTTCGT ACGTTTGCTC GAGGTTTTGA TAATGTCCGC CAGGGTGATG AACTGGATGT CCCGGCACAA GTTAGTGAAA ATAATTTAAC CCCGCCACCG GGTAATAGCA GTGGCAACCT TGAGCAACAG ATAGCCAGTA CTTCACAGCC AATCGGGTCT CTGCTCGCCG AAGATATGAA CAGCGAGCAA GCGGCAAATA TGGCGCGTGG ATGGGCCTCT TCTCAGGCTT CAGGCGCAAT GACAGACTGG TTAAGCCGCT TCGGTACCGC AAGAATCACG CTGGGCGTGG ATGAAGATTT TAGCCTGAAG AACTCCCAGT TCGATTTTCT CCATCCGTGG TATGAAACGC CTGATAATCT CTTTTTCAGT CAGCATACTC TCCATCGTAC TGACGAGCGT ACGCAGATTA ACAACGGCTT AGGTTGGCGT CATTTCACTC CCACATGGAT GTCGGGCATC AACTTCTTTT TCGACCACGA TCTTAGCCGT TACCACTCCC GCGCCGGCAT TGGCGCGGAG TACTGGCGCG ACTATCTAAA ATTAAGCAGT AACGGCTATT TGCCACTGAC CAACTGGCGC AGCGCACCTG AGCTGGACAA CGATTATGAA GCCCGCCCGG CCAATGGCTG GGATGTACGC GCAGAAGGCT GGCTACCCGC CTGGCCGCAC CTTGGCGGTA AACTGGTCTA TGAACAGTAT TATGGCGATG AAGTGGCCCT GTTCGATAAA GACGATCGGC AAAGTAATCC TCATACCATA ACCGCTGGAC TTAACTATAC CCCCTTCCCG CTGATGACCT TCAGCGCGGA GCAACGCCAG GGTAAACAGG GCGAAAATGA CACCCGTTTT GCCGTCGATT TTACCTGGCA ACCTGGCAGC GCAATGCAGA AACAGCTTGA CCCGAACGAA GTCGCTGCAC GGCGTAGCCT TGCAGGCAGC CGTTATGATC TGGTGGATCG CAACAACAAC ATCGTTCTGG AATACCGCAA AAAAGAACTG GTTCGCCTGA CCCTGACAGA CCCCGTGACA GGGAAGTCAG GAGAAGTGAA ATCATTGGTT TCGTCGCTAC AAACCAAATA TGCCCTGAAA GGCTATAACG TCGAAGCCAC CGCTCTGGAA GCTGCCGGTG GCAAAGTGGT CACAACGGGT AAAGATATTC TGGTTACCCT GCCGGCTTAC CGGTTCACCA GTACGCCAGA AACCGATAAC ACCTGGCCGA TTGAAGTCAC CGCTGAAGAT GTCAAAGGCA ATTTGTCGAA TCGTGAACAA AGCATGGTGG TCGTTCAGGC ACCTACGCTA AGCCAGAAAG ATTCCTCGGT ATCGTTAAGT ACCCAAACAT TGAGCGCGGA TTCCCATTCA ACCGCCACAC TGACTTTTAT TGCTCATGAT GCAGCAGGTA ATCCTGTTAT CGGGCTGGTG CTTTCGACGC GTCACGAAGG AGTTCAGGAC ATCACCCTTT CTGAATGGAA AGATAATGGT GACGGAAGCT ATACCCAGAT CCTGACCACA GGAGCGATGT CTGGCACGCT GACGCTGATG CCACAGCTGA ATGGTGTGGA TGCGGCTAAA GCCCCCGCCG TGGTGAATAT CATTTCTATT TCGTCATCCC GAACTCACTC GTCAATTAAA ATTGATAAAG ACCGTTATCT CTCCGGCAAT CCTATCGAGG TGACGGTAGA ACTGAGAGAT GAAAATGACA AACCTGTTAA GGAGCAAAAA CAGCAACTGA ATAACGCAGT CAGCATCGAC AACGTGAAAC CTGGTGTCAC TACAGACTGG AAAGAAACCG CAGATGGCGT CTATAAGGCA ACCTATACCG CCTATACCAA AGGCAGTGGG CTTACTGCGA AGCTATTAAT GCAAAACTGG AATGAAGATT TGCATACCGC TGGTTTTATC ATCGACGCCA ACCCGCAGTC AGCAAAAATT GCGACATTAT CTGCCAGCAA TAATGGTGTG CTCGCCAATG AGAATGCAGC AAACACCGTC TCGGTCAATG TCGCTGATGA AGGAAGCAAC CCAATCAATG ATCATACCGT CACGTTTGCG GTATTAAGCG GATCGGCAAC TTCCTTCAAC AATCAAAACA CCGCAAAAAC GGATGTTAAT GGTCTGGCGA CTTTTGATCT GAAAAGTAGT AAGCAGGAAG ACAACACGGT TGAAGTCACC CTTGAAAATG GCGTGAAACA AACGTTAATC GTCAGTTTTG TCGGCGACTC GAGTACCGCG CAGGTTGAGC TGCAGAAGTC GAAAAATGAA GTGGTCGCTG ACGGCAATGA TAGCGCCACA ATGACCGCGA CAGTCCGGGA TGCAAAAGGC AACCTGCTCA ATGACGTCAA GGTCACTTTC AATGTTAATT CAGCAGAGGC GAAACTGAGC CAAACAGAAG TGAATAGCCA CGACGGGATC GCCACAGCTA CGCTGACCAG TTTGAAAAAT GGTGATTATA GGGTTACGGC CTCTGTGAGC TCTGGTTCTC AGGCTAATCA ACAGGTGATT TTTATCGGTG ATCAAAGTAC TGCTGCCCTG ACCCTCAGTG TGCCTTCAGG TGATATCACC GTCACCAACA CAGCTCCGCT ACATATGACT GCAACCTTGC AGGATAAAAA TGGCAACCCA TTAAAAGATA AAGAAATCAC CTTCTCTGTG CCAAACGACG TCGCAAGTCG GTTCTCGATT AGCAACAGCG GAAAAGGCAT GACGGATAGC AACGGGACTG CAATCGCCTC CCTGACCGGC ACGTTAGCGG GCACGCATAT GATCACGGCT CGTCTGGCTA ACAGCAATGT CAGCGATACA CAGCCAATGA CGTTTGTGGC GGATAAAGAC AGAGCGGTTG TCGTTCTGCA AACATCGAAA GCGGAAATCA TTGGGAATGG CGTGGATGAG ACAACTCTGA CAGCAACAGT GAAAGATCCG TCGAATCATC CGGTGGCGGG AATAACGGTG AACTTCACCA TGCCACAGGG CGTGGCGGCA AACTTTACCC TCGAAAATAA CGGCATTGCC ATCACCCAAG CCAATGGGGA AGCGCATGTC ACGCTCAAAG GTAAAAAAGC GGGTACACAT ACGGTTACCG CAACGCTGGG TAATAACAAT ACCAGTGATT CGCAGCCGGT AACATTTGTG GCGGACAAAA CCTCGGCTCA GGTTGTCCTG CAGATGTCAA AAGATGAGAT CACAGGTAAT GGCGTCGATA ACGCAACGCT AACTGCAACG GTTAAAGATC AGTTCGACAA TGAGGTGAAT AATCTTCCGG TAACATTCAG CTCAGCCTCT TCAGGACTCA CCCTGACCCC GGGAGTAAGT AATACCAATG AGTCTGGCAT CGCGCAGGCC ACTCTCGCAG GCGTTGCCTT TGGTGAGCAG ACGGTCACTG CATCACTGGC TAATAATGGT GCCAGCGACA ACAAAACTGT GCATTTTATT GGCGACACAG CAGCGGCAAA AATTATCGAG TTGACGCCTG TCCCAGACAG CATAATCGCC GGTACCCCGC AGAACAGCTC CGGCAGCGTC ATCACCGCCA CAGTCGTTGA TAATAATGGC TTTCCGGTGA AAGGTGTGAC TGTGAACTTC ACCAGCAGAA CAAACTCTGC CGAAATGACG AATGGCGGCC AAGCCGTAAC GAACGAACAG GGTAAGGCTA CCGTCACTTA TACCAATACC CGCTCCTCGA TAGAATCAGG AGCGAGACCG GATACCGTTG AGGCCAGTCT GGAAAATGGT AGCTCCACGC TTAGCACATC AATTAATGTC AACGCTGATG CGTCTACGGC ACATCTCACC TTGCTACAGG CACTTTTTGA TACAGTCTCC GCAGGCGACA CTACCAATCT GTATATTGAG GTGAAGGATA ATTACGGCAA CGGTGTACCC CAGCAGGAGG TAACCCTCAG AGTATCACCA AGTGAAGGCG TGACCCCCAG TAATAACGCT ATATATACTA CCAACCACGA CGGCAATTTT TACGCAAGCT TTACCGCTAC AAAAGCCGGG GTTTATCAAG TGACGGCAAC CCTCGAAAAT GGCGATTCGA TGCAACAAAC AGTGACCTAT GTGCCGAACG TCGCGAATGC CGAAATCACG CTGGCAGCCT CGAAGGATCC GTTGATTGCC GACAATAACG ATCTCACGAC ACTAACAGCA ACAGTCGCTG ATACAGAGGG CAATGCGATA GCCAACACTG AAGTAACATT TACTCTGCCG GAAGATGTGA AGGCGAACTT CACGCTGAGC GATGGCGGTA AAGCGATTAC TGATGCTGAA GGCAAAGCGA AAGTCACGCT GAAAGGTACA AAAGCAGGCG CTCATACTGT TACAGCATCG ATGACTGGCG GTAAGAGTGA GCAGTTGGTG GTGAACTTTA TTGCGGATAC GCTCAGTGCG CAGGTTAATC TTAACGTTAC CGAGGACAAC TTTATCGCCA ATAACGTTGG GATGACCACA CTTCAGGCAA CAGTGACTGA TGGAAACGGC AACCCGTTAG CCAATGAGGC GGTGACATTC ACGCTACCGG CAGACGTGAG CGCAAGCTTC ACTCTCGGAC AAGGCGGTTC CGCCATTACT GATATCAACG GCAAGGCTGA AGTTACACTG AGCGGTACAA AATCCGGCAC CTACCCCGTG ACAGTTAGCG TGAACAATTA TGGTGTCAGT GATACGAAAC AGGTGACTTT GATTGCCGAT GCTGGTACCG CAACACTAGC CTCCTTAACC TCTGTATACT CATTCGTCGT CAGCACGACC GAGGGCGCGA CCATGACTGC AAGCGTCACT GACGCTAACG GCAACCCGGT AGAAGGCATA AAAGTTAATT TCCGCGGAAC CTCCGTCACG CTAAGCAGCA CCAGCGTTGA AACGGATGAT CAGGGTTTCG CTGAAATTCT TGTGACAAGC ACCGAAGTCG GACTGAAAAC AGTTTCAGCC TCTCTGGCAG ATAAACCTAC TGAAGTCATA TCGCGATTAC TGAATGCAAA AGCAGATATT AATTCTGCAA CGATTACCAG TCTGGAGATA CCTGAAGGTC AGCTAATGGT CGCACAAGAC GTAGCAGTTA AAGCTCACGT CAACGACCAG TTTGGCAACC CGATTCTTAA TGAATCTGTA ACATTCAGTG CAGAGCCACC AGAGCACATG ACCATCAGCC AAAATATTGT CTCTACTGAT ACGCATGGTA TAGCCGAGGT CTCCATGACG CCCGAAAGAA ACGGTTCGTA TATGGTGAAA GCATCCCTGG CGAATGGAGC CTCACTTGAG AAACAACTGG AGGCTATTGA TGAAAAACTG ACACTCACGG CGTCCAGTCC GCTTATCGGT GTCTATGCCC CTACAGGCAC TACTCTGACG GCAACGCTAA CCTCTGCAAA TGGCACTCCA GTGGAGGGTC AGGTCATCAA CTTTAGCGTA ACGCCAGAAG GGGCGACGTT AAGTGGCGGA AAAGTGAGAA CTAACTCTTC AGGTCAGGCT CCGGTCGTTC TGACCAGCAA TAAAGTCGGT ACATATACGG TGACTGCATC GTTCCATAAC GGCGTAACAA TACAGACACA GACAACCGTG AAAGTCACTG GCAACTCAAG CACCGCACAT GTTGCTAGCT TTATCGCTGA TCCATCGACT ATCGCCGCCA CCAACAGTGA TTTAAGTACC TTAAAGGCAA CGGTTGAGGA TGGCAGTGGT AACCTGATCG AAGGTCTCAC TGTGTACTTC GCCTTAAAAA GCGGCTCTGC CACATTAACG TCATTAACAG CGGTGACCGA TCAAAACGGA ATCGCGACAA CAAGCGTGAA AGGAGCGATG ACAGGTAGCG TCACGGTAAG CGCAGTCACG ACCGCTGGTG GAATGCAAAC AGTAGATATA ACGCTGGTGG CTGGCCCGGC AGACACCTCG CAGTCCGTCC TTAAGAGCAA TCGGTCATCA CTGAAAGGGG ACTATACCGA TAGTGCTGAA TTACGTCTTG TTCTGCACGA TATATCAGGC AATCCGATCA AAGTTTCTGA AGGGATGGAA TTTGTGCAAT CAGGTACTAA CGTGCCCTAT ATAAAAATTA GCGCAATTGA TTACAGTCTA AATATCAACG GTGATTACAA AGCCACTGTT ACAAGCGGCG GAGAGGGTAT CGCAACGCTG ATCCCTGTAT TGAATGGTGT TCATCAAGCT GGTCTGAGTA CCACAATACA ATTCACTCGC GCAGAAGACA AAATAATGAG CGGTACAGTA TCAGTCAATG GTACTGACCT ACCGACAACT ACATTCCCTT CGCAGGGGTT CACCGGGGCG TATTATCAGT TGAATAATGA CAACTTTGCC CCAGGAAAAA CGGCGGCTGA TTATGAGTTT TCAAGCTCTG CCTCCTGGGT CGATGTTGAT GCTACCGGTA AAGTGACATT TAAAAATGTC GGCAGCAATT GGGAAAGGAT TACGGCGACG CCAAAATCAG GAGGCCCTAG CTATGTATAC GAAATCCGTG TGAAGAGTTG GTGGGTGAAC GCCGGCGAGG CTTTCATGAT ATACAGCCTT GCTGAAAATT TTTGCAGCAG CAATGGCTAC ACGCTCCCCA GAGCAAACTA TTTAAACCAC AGTAGTTCCC GAGGCATCGG GTCACTGTAC AGTGAATGGG GAGATATGGG GCATTACACG ACTGACGCTG GTTTTCAATC AAATATGTAT TGGTCATCTA GTCCCGCAAA CTCAAGCGAA CAATACGTAG TTTCCCTGGC AACAGGTGAT CAAAGCGTAT TTGAAAAGCT TGGGTTTGCT TATGCGACAT GTTATAAAAA CCTGTGA
|
Protein sequence | MATKKRSGEE INDRQILCGM GIKLRRLTAG ICLITQLAFP MAAAAQGVVN TATQQPVPAQ IAIANANTVP YTLGALESAQ SVAERFGISV AELRKLNQFR TFARGFDNVR QGDELDVPAQ VSENNLTPPP GNSSGNLEQQ IASTSQPIGS LLAEDMNSEQ AANMARGWAS SQASGAMTDW LSRFGTARIT LGVDEDFSLK NSQFDFLHPW YETPDNLFFS QHTLHRTDER TQINNGLGWR HFTPTWMSGI NFFFDHDLSR YHSRAGIGAE YWRDYLKLSS NGYLPLTNWR SAPELDNDYE ARPANGWDVR AEGWLPAWPH LGGKLVYEQY YGDEVALFDK DDRQSNPHTI TAGLNYTPFP LMTFSAEQRQ GKQGENDTRF AVDFTWQPGS AMQKQLDPNE VAARRSLAGS RYDLVDRNNN IVLEYRKKEL VRLTLTDPVT GKSGEVKSLV SSLQTKYALK GYNVEATALE AAGGKVVTTG KDILVTLPAY RFTSTPETDN TWPIEVTAED VKGNLSNREQ SMVVVQAPTL SQKDSSVSLS TQTLSADSHS TATLTFIAHD AAGNPVIGLV LSTRHEGVQD ITLSEWKDNG DGSYTQILTT GAMSGTLTLM PQLNGVDAAK APAVVNIISI SSSRTHSSIK IDKDRYLSGN PIEVTVELRD ENDKPVKEQK QQLNNAVSID NVKPGVTTDW KETADGVYKA TYTAYTKGSG LTAKLLMQNW NEDLHTAGFI IDANPQSAKI ATLSASNNGV LANENAANTV SVNVADEGSN PINDHTVTFA VLSGSATSFN NQNTAKTDVN GLATFDLKSS KQEDNTVEVT LENGVKQTLI VSFVGDSSTA QVELQKSKNE VVADGNDSAT MTATVRDAKG NLLNDVKVTF NVNSAEAKLS QTEVNSHDGI ATATLTSLKN GDYRVTASVS SGSQANQQVI FIGDQSTAAL TLSVPSGDIT VTNTAPLHMT ATLQDKNGNP LKDKEITFSV PNDVASRFSI SNSGKGMTDS NGTAIASLTG TLAGTHMITA RLANSNVSDT QPMTFVADKD RAVVVLQTSK AEIIGNGVDE TTLTATVKDP SNHPVAGITV NFTMPQGVAA NFTLENNGIA ITQANGEAHV TLKGKKAGTH TVTATLGNNN TSDSQPVTFV ADKTSAQVVL QMSKDEITGN GVDNATLTAT VKDQFDNEVN NLPVTFSSAS SGLTLTPGVS NTNESGIAQA TLAGVAFGEQ TVTASLANNG ASDNKTVHFI GDTAAAKIIE LTPVPDSIIA GTPQNSSGSV ITATVVDNNG FPVKGVTVNF TSRTNSAEMT NGGQAVTNEQ GKATVTYTNT RSSIESGARP DTVEASLENG SSTLSTSINV NADASTAHLT LLQALFDTVS AGDTTNLYIE VKDNYGNGVP QQEVTLRVSP SEGVTPSNNA IYTTNHDGNF YASFTATKAG VYQVTATLEN GDSMQQTVTY VPNVANAEIT LAASKDPLIA DNNDLTTLTA TVADTEGNAI ANTEVTFTLP EDVKANFTLS DGGKAITDAE GKAKVTLKGT KAGAHTVTAS MTGGKSEQLV VNFIADTLSA QVNLNVTEDN FIANNVGMTT LQATVTDGNG NPLANEAVTF TLPADVSASF TLGQGGSAIT DINGKAEVTL SGTKSGTYPV TVSVNNYGVS DTKQVTLIAD AGTATLASLT SVYSFVVSTT EGATMTASVT DANGNPVEGI KVNFRGTSVT LSSTSVETDD QGFAEILVTS TEVGLKTVSA SLADKPTEVI SRLLNAKADI NSATITSLEI PEGQLMVAQD VAVKAHVNDQ FGNPILNESV TFSAEPPEHM TISQNIVSTD THGIAEVSMT PERNGSYMVK ASLANGASLE KQLEAIDEKL TLTASSPLIG VYAPTGTTLT ATLTSANGTP VEGQVINFSV TPEGATLSGG KVRTNSSGQA PVVLTSNKVG TYTVTASFHN GVTIQTQTTV KVTGNSSTAH VASFIADPST IAATNSDLST LKATVEDGSG NLIEGLTVYF ALKSGSATLT SLTAVTDQNG IATTSVKGAM TGSVTVSAVT TAGGMQTVDI TLVAGPADTS QSVLKSNRSS LKGDYTDSAE LRLVLHDISG NPIKVSEGME FVQSGTNVPY IKISAIDYSL NINGDYKATV TSGGEGIATL IPVLNGVHQA GLSTTIQFTR AEDKIMSGTV SVNGTDLPTT TFPSQGFTGA YYQLNNDNFA PGKTAADYEF SSSASWVDVD ATGKVTFKNV GSNWERITAT PKSGGPSYVY EIRVKSWWVN AGEAFMIYSL AENFCSSNGY TLPRANYLNH SSSRGIGSLY SEWGDMGHYT TDAGFQSNMY WSSSPANSSE QYVVSLATGD QSVFEKLGFA YATCYKNL
|
| |