Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2821 |
Symbol | |
ID | 6972374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2621252 |
End bp | 2629114 |
Gene Length | 7863 bp |
Protein Length | 2620 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643386671 |
Product | hypothetical protein |
Protein accession | YP_002271145 |
Protein GI | 209398677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0867101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCGG CAGCACAAGG TGTGGTAAAC GCCGCAACCC AACAACCAGT TCCTGCACAA ATTGCCATTG CAAATGCCAA TACGGTGCCC TACACCCTTG GAGCGCTGGA ATCGGCCCAA ATCGTTGCCG AACGTTTCGG TATTTCGGTG GCTGAGTTAC GCAAACTCAA CCAGTTTCGT ACGTTTGCTC GAGGTTTTGA TAATGTCCGC CAGGGTGATG AACTGGATGT CCCGGCACAA GTTAGTGAAA ATAATTTAAC CCCGCCACCG GGTAATAGCA GTGGCAACCT TGAGCAACAG ATAGCCAGTA CTTCACAGCA AATCGGGTCT CTGCTCGCCG AGGATATGAA CAGCGAGCAA GCGGCAAATA TGGCGCGTGG ATGGGCCTCT TCTCAGGCTT CAGGCGCAAT GACAGACTGG TTAAGCCGCT TCGGTACCGC AAGAATCACG CTGGGCGTGG ATGAAGATTT TAGCCTGAAG AACTCCCAGT TCGATTTTCT CCATCCGTGG TATGAAACGC CTGATAATCT CTTTTTCAGT CAGCATACTC TCCATCGTAC TGACGAGCGT ACGCAGATTA ACAACGGCTT GGGTTGGCGT CATTTCACTC CCACATGGAT GTCGGGCATC AACTTCTTTT TCGACCACGA TCTTAGCCGT TACCACTCCC GCGCCGGCAT TGGCGCGGAG TACTGGCGCG ACTATCTAAA ATTAAGCAGT AACGGCTATT TGCGACTGAC CAACTGGCGC AGCGCACCTG AACTGGACAA CGATTATGAA GCACGCCCGG CCAATGGCTG GGATGTACGC GCAGAAGGCT GGCTACCCGC CTGGCCGCAC CTTGGCGGTA AACTGGTCTA TGAACAGTAT TATGGCGATG AAGTGGCCCT GTTCGATAAA GATGATCGGC AAAGTAATCC TCATGCCATA ACCGCTGGAC TTAACTATAC CCCCTTCCCG CTGATGACCT TCAGCGCGGA GCAACGCCAG GGTAAACAGG GCGAAAATGA CACCCGTTTT GCCGTCGATT TTACCTGGCA ACCTGGAAGC GCGATGCAGA AACAGCTTGA CCCGAATGAA GTCGATGCAC GGCGTAGCCT TGCAGGCAGC CGTTTTGATC TGGTGGATCG CAACAACAAC ATCGTTCTGG AATATCGCAA AAAAGAACTG GTTCGCCTGA CCCTGACAGA CCCCGTGACA GGGAAGTCAG GAGAAGTGAA ATCACTGGTT TCGTCGCTAC AAACCAAATA TGCCCTGAAA GGCTATAACG TCGAAGCCAC CGCTCTGGAA GCTGCCGGTG GTAAAGTGGT TACAACGGGT AAAGATATTC TGGTTACCCT GCCGGCGTAC CGGTTCACCA GTACGCCAGA AACCGATAAC ACCTGGCCGA TTGAAGTCAC CGCTGAAGAT GTCAAAGGCA ATTTTTCGAA TCGTGAACAG AGCATGGTAG TCGTTCAGGC TCCTACGCTA AGCCAGAAAG ATTCCTCGGT ATCGTTAAGT AGCCAGACGT TGAGCGCGGA TTCCCATTCA ACCGCCACAC TGACTTTTAT TGCGCATGAT GCAGCAGGTA ATCCTGTTAT CGGGCTGGTG CTCTCGACGC GTCACGAAGG TGTTCAGGAC ATCACCCTTT CTGACTGGAA AGATAATGGT GACGGAAGCT ATACCCAGAT CCTGACCACA GGAGCGATGT CTGGCACGCT GACGCTGATG CCACAGCTGA ACGGTGTGGA TGCGGCTAAA GCCCCCGCCG TGGTGAATAT CATTTCTGTT TCGTCATCCC GGACTCACTC GTCAATTAAG ATTGATAAGG ACCGTTATCT CTCCGGGAAT CCTATCGAGG TGACGGTAGA ACTGAGAGAT GAAAATGACA AACCTGTTAA GGAGCAAAAA CAGCAACTGA ATACCGCAGT CAGCATCGAC AACGTGAAAC CTGGTGTCAC TACAGACTGG AAAGAAACCG CAGATGGCGT CTATAAGGCA ACCTATACCG CCTATACCAA AGGCAGTGGG CTTACTGCGA AGCTGTTAAT GCAAAACTGG AATGAAGATT TGCATACCGC TGGATTTATC ATCGACGCCA ACCCGCAGTC AGCGAAAATT GCGACATTAT CTGCCAGCAA TAATGGTGTG CTCGCCAATG AGAATGCAGC AAACACCGTC TCGGTCAATG TCGCTGATGA AGGAAGCAAC CCAATCAATG ATCATACCGT CACGTTTGCG GTATTAAGCG GATCGGCAAC TTCCTTTAAC AATCAAAACA CCGCAAAAAC GGATGTTAAT GGTCTGGCGA CTTTTGATCT GAAAAGTAGT AAGCAGGAAG ACAACACGGT TGAAGTCACC CTTGAAAATG GCGTGAAACA AACGTTAATC GTCAGTTTTG TCGGCGACTC GAGTACCGCG CAGGTTGATC TGCAGAAGTC GAAAAATGAA GTGGTCGCTG ACGGCAATGA CAGTGCCACA ATGACCGCGA CAGTTCGGGA TGCAAAAGGC AACCTGCTCA ATGACGTCAA GGTCACCTTC AATGTCAATT CAGCAGCAGC GAAACTGAGC CAAACCGAAG TGAATAGCCA CGACGGGATC GCCACAGCTA CGCTGACCAG TTTGAAAAAT GGTGATTATA CGGTTACGGC CTCTGTGAGC TCTGGTTCTC AGGCTAATCA ACAGGTGATT TTTATCGGTG ATCAAAGTAC TGCTGCCCTG ACCCTCAGTG TGCCTTCAGG TGATATCACC GTCACCAACA CAGCTCCGCT ACATATGACT GCAACCTTGC AGGATAAAAA TGGCAATCCA CTAAAAGATA AAGAAATCAC CTTCTCTGTG CCAAACGACG TCGCAAGTCG GTTCTCGATT AGCAACAGCG GAAAAGGCAT GACGGATAGC AACGGGACTG CAATCGCCTC CCTGACCGGC ACGTTAGCGG GCACGCATAT GATCACGGCT CGTCTGGCTA ACAGCAATGT CAGCGATACA CAGCCAATGA CGTTTGTGGC GGATAAAGAC AGAGCGGTTG TCGTTCTGCA AACATCGAAA GCGGAAATCA TTGGGAATGG CGTGGATGAG ACGACTCTGA CAGCAACAGT TAAAGATCCT TTTGATAACG TGGTTAAAAA TCTTTCAGTA GTCTTCCGCA CCTCCCCCGC AGACACGCAA CTGAGTCTGA ACGCGCGTAA TACTAATGAG AACGGTATTG CCGAAGTTAC CCTTAAGGGC ACGGTTTTGG GTGTTCATAC AGCCGAAGCC ATACTGCTTA ACGGCAACAG AGATACGAAA ATCGTCAATA TTGCGCCCGA TGCCAGCAAC GCGCAGGTCA CCCTGAACAT CCCTGCACAA CAGGTGGTGA CGAATAACAG TGACAGCGTG CAGCTGACGG CGACGGTGAA AGACCCGTCG AATCATCCGG TGGCGGGAAT AACGGTGAAC TTCACCATGC CACAGGACGT GGCGGCAAAC TTTACCCTTG AAAATAACGG TATTGCCATC ACTCAGGCCA ATGGCGAAGC GCATGTCACC CTCAAAGGCA AAAAAGCGGG CACGCATACT GTGACCGCCA CGCTGGGTAA CAATAATGCC AGCGATGCGC AACCAGTCAC CTTCGTGGCG GATAAGGACA GCGCGGTTGT CGTTCTGCAA ACATCGAAAG CGGAAATCAT TGGGAATGGC GTGGATGAGA CGACTCTGAC GGCAACAGTG AAAGATCCTT TTGATAACGC AGTAAAAGAT CTACAGGTCA CCTTCAGTAC CAACCCCGCA GATACTCAAC TTAGTCAGAG CAAAAGCAAT ACTAACGACA GTGGTGTGGC CGAAGTTACC TTTAAGGGCA CGGTTTTGGG TGTTCATACA GCCGAAGCCA CACTGCCTAA CGGCAACAAC GATACGAAGA TAGTCAATAT TGCGCCCGAT GCCAGCAACG CGCAGGTTAC GCTGAACATC CCTGCTCAAC AGGTGGTGAC GAATAACAGC GACAGCGTGC AGCTGACGGC GACGGTGAAA GATCCGTCGA ATCATCCGGT GGCGGGAATA ACGGTGAACT TCACCATGCC ACAGGACGTG GCGGCAAACT TTACCCTCGA AAATAACGGT ATTGCCATCA CCCAGGCCAA TGGGGAAGCG CATGTCACGC TCAAAGGTAA AAAAGCGGGT ACGCATACGG TTACCGCAAC GCTGAGTAAT AACAATACCA GTGATTCACA GCCGGTAACG TTTGTGGCGG ACAAAACCTC GGCTCTGGTT GTTCTTCAGA TATCAAAAAA TGAGATCACA GGTAATGGCG TCGATAGCGC AACGCTAACT GCAACGGTCA AAGATCAGTT CGACAATGAG GTGAACAATC TTCCGGTAAC ATTCAGCACA GCTTCTTCAG GCCTCACCCT GACCCCAGGG GAAAGTAATA CCAATGAGTC TGGCATCGCG CAGGCCACTC TCGCAGGCGT TGCCTTTGGT GAGCAGACGG TCACTGCATC ACTGGCTAAT AATGGTGCCA GCGACAACAA AACTGTGCAT TTTATTGGCG ACACAGCGGC GGCAAAAATT ATCGAGTTGA CGCCTGTCCC AGACAGCATA ATCGCAGGTA CCCCGCAGAA CAGCTCCGGC AGCGTCATCA CCGCCACAGT CGTTGATAAT AATGGCTTTC CGGTGAAAGG TGTGACTGTG AACTTCACCA GCAACGCAGC GACAGCCGAA ATGACGAATG GCGGTCAAGC CGTGACGAAC GAACAGGGTA AGGCTACCGT CACTTATACC AATACCCGCT CCTCGATAGA ATCAGGAGCG AGACCGGATA CCGTTGAGGC CAGTCTGGAA AATGGTAGCT CCACGCTTAG CACATCAATT AATGTCAACG CTGATGCGTC TACGGCACAT CTCACCTTGC TACAGGCACT TTTTGATACA GTCTCCGCAG GCGACACTAC CAATCTGTAT ATTGAGGTGA AGGATAATTA CGGCAACGGA GTACCCCAGC AGGAGGTAAC CCTCAGCGTT TCACCAAGTG AAGGTGTGAC CCCCAGTAAT AACGCTATAT ATACGACCAA TCACGACGGC AATTTTTACG CAAGCTTTAC CGCTACAAAA GCCGGGGTAT ACCAAGTGAC GGCAACCCTC GAAAATGGCG ATTCGATGCA ACAAACAGTG ACCTATGTGC CGAACGTAGC GAATGCTGAA ATCTCGCTGG CAGCCTCGAA GGATCCGGTA ATTGCCAACA ATAACGATCT CACGACACTA ACAGCAACAG TCGCTGATAC AGAGGGCAAT GCGATAGCCA ACAGTGAGGT AACATTTACT CTGCCGGAAG ATGTGAGGGC GAACTTCACG CTGGGCGATG GCGGTAAAGT GGTTACTGAT ACTGAAGGCA AAGCGAAAGT CACGCTGAAA GGTACAAAAG CAGGCGCTCA TACTGTTACA GCATCGATGG CTGGCGGTAA GAGTGAGCAG TTGGTGGTGA ACTTTATTGC GGATACACTC ACTGCGCAGG TTAATCTTAA CGTTACCGAG GACAATTTTA TCGCTAATAA CGTCGGGATG ACCAGGCTGC AGGCAACAGT GACTGATGGA AACGGCAACC CGTTAGCCAA TGAGGCGGTG ACATTCACGC TACCGGCAGA TGTGAGCGCA AGCTTTACTC TCGGACAAGG CGGTTCCGCC ATTACTGACA TCAACGGCAA GGCTGAAGTT ACACTGAGCG GTACAAAATC CGGCACCTAC CCCGTGACAG TTAGCGTGAA CAATTATGGT GTCAGTGATA CGAAACAGGT GACTTTGATT GCCGATGCTG GTACCGCAAA ACTAGCCTCC TTAACCTCTG TATACTCATT CGTCGTCAGC ACGACCGAGG GCGCGACCAT GACTGCAAGC GTCACTGACG CTAACGGCAA CCCGGTAGAA GGTATAAAAG TTAATTTCCG CGGAACTTCC GTCACGCTAA GCAGCACCAG CGTTGAAACG GATGATCGGG GTTTCGCTGA AATTCTTGTG ACAAGCACCG AGGTCGGACT GAAAACAGTT TCAGCCTCTC TGGCAGATAA ACCTACTGAA GTCATCTCGC GATTACTGAA TGCAAAAGCA GATATTAATT CTGCAACGAT TACCAGTCTG GAGATACCTG AAGGTCAGGT CATGGTCGCA CAAGACGTAG CAGTTAAAGC TCACGTCAAC GACCAGTTTG GCAATCCGAT TCTTAATGAA TCTGTAACAT TCAGTGCAGA ACCACCAGAG CACATGACCA TCAGCCAAAA TATTGTCTCT ACTGATACGC ATGGTATAGC CGAGGTCACT ATGACGCCCG AAAGAAACGG TTCGTATATG GTGAAAGCAT CCCTGGCGAA TGGATCCTCT TATGAGAAGG ATCTGGTGGT AATCGATCAA AAACTGACAC TCTCGGCGTC CAGCCCGCTT ATCGGTGTCA ATTCCCCAAC AGGTGCAACT CTGACGGCAA CGCTAACTTC TGCAAATGGC ACTCCAGTGG AGGGTCAGGT CATCAACTTT AGCGTAACGC CAGAAGGTGC GACGTTAAGT GGCGGAAAAG TGAGAACCAA CTCTTCAGGT CAGGCTCCAG TCGTTCTGAC CAGCAATAAA GTCGGTACAT ATACGGTGAC TGCATCGTTC CATAACGGCG TAACAATACA GACACAGACA ATCGTGAAAG TCACTGGCAA CTCAAGCACC GCCCATGTTG CTAGCTTTAT CGCTGATCCA TCGACTATAG CCGCCACCAA CAGTGATTTA AGTACCTTAA AGGCAACGGT TGAGGATGGC AGTGGTAACC TGATCGAAGG TCTCACTGTG TACTTCGCCT TAAAAAGCGG CTCTGCCACA TTAACGTCAT TAACAGCGGT GACAGATCAA AACGGAATCG CGACAACAAG CGTGAGAGGA GCGATAACGG GGAGCGTCAC GGTAAGCGCA GTCACGACCG CTGGTGGAAT GCAAACAGTA GATATAACGC TGGTGGCAGG CCCGGCAGAC GCCTCGCAGT CCGTCCTTAA GAACAATCGG TCATCATTGA AAGGAGACTT TACCGATAGT GCTGAGCTAC ATCTTGTTCT GCACGATATA TCAGGCAATC CGATCAAAGT TTCTGAAGGG CTGGAATTTG TGCAGTCAGG TACCAACGCG CCCTATGTGC AAGTTAGTGC AATTGACTAC AGTAAAAATT TCTCAGGCGA GTACAAAGCC ACTGTTACAG GCGGCGGAGA GGGTATCGCA ACGCTGATCC CTGTATTGAA TGGTGTTCAT CAAGCGGGTC TGAGTACCAC AATACAATTC ACTCGCGCAG AAGACAAAAT AATGAGCGGT ACAGTGTTAG TCAATGGTGC TAACCTACCG ACAACTACAT TCCCTTCGCA GGGGTTCACT GGGGCGTATT ATCAGTTGAA TAATGACAAC TTTGCCCCAG GAAAAACGGC GGCTGATTAT GAGTTTTCAA GCTCTGCCTC CTGGGTTGAT GTTGATGCTA CCGGTAAAGT GACATTTAAA AATGTCGGCA GCAAATGGGA GAGGATTACG GCGACGCCAA AAACAGGCGG CCCTAGCTAT ATATACGAAA TCCGAGTGAA GAGTTGGTGG GTGAACGCCG GCGATGCTTT CATGATATAC AGCCTTGCTG AAAATTTTTG CAGTAGCAAT GGCTACACAC TTCCCCTTGG AGACCATTTA AACCATAGTC GTTCCCGAGG CATCGGGTCA CTGTACAGTG AATGGGGAGA TATGGGGCAT TACACGACTG AAGCTGGTTT TCATTCAAAT ATGTATTGGT CATCGAGTCC CGCAAACTCA AACGAACAAT ACGTAGTTTC CCTGGCAACA GGTGATCAAA GCGTATTTGA AAAGCTTGGG TTTGCTTATG CGACATGTTA TAAAAACCTC TGA
|
Protein sequence | MAAAAQGVVN AATQQPVPAQ IAIANANTVP YTLGALESAQ IVAERFGISV AELRKLNQFR TFARGFDNVR QGDELDVPAQ VSENNLTPPP GNSSGNLEQQ IASTSQQIGS LLAEDMNSEQ AANMARGWAS SQASGAMTDW LSRFGTARIT LGVDEDFSLK NSQFDFLHPW YETPDNLFFS QHTLHRTDER TQINNGLGWR HFTPTWMSGI NFFFDHDLSR YHSRAGIGAE YWRDYLKLSS NGYLRLTNWR SAPELDNDYE ARPANGWDVR AEGWLPAWPH LGGKLVYEQY YGDEVALFDK DDRQSNPHAI TAGLNYTPFP LMTFSAEQRQ GKQGENDTRF AVDFTWQPGS AMQKQLDPNE VDARRSLAGS RFDLVDRNNN IVLEYRKKEL VRLTLTDPVT GKSGEVKSLV SSLQTKYALK GYNVEATALE AAGGKVVTTG KDILVTLPAY RFTSTPETDN TWPIEVTAED VKGNFSNREQ SMVVVQAPTL SQKDSSVSLS SQTLSADSHS TATLTFIAHD AAGNPVIGLV LSTRHEGVQD ITLSDWKDNG DGSYTQILTT GAMSGTLTLM PQLNGVDAAK APAVVNIISV SSSRTHSSIK IDKDRYLSGN PIEVTVELRD ENDKPVKEQK QQLNTAVSID NVKPGVTTDW KETADGVYKA TYTAYTKGSG LTAKLLMQNW NEDLHTAGFI IDANPQSAKI ATLSASNNGV LANENAANTV SVNVADEGSN PINDHTVTFA VLSGSATSFN NQNTAKTDVN GLATFDLKSS KQEDNTVEVT LENGVKQTLI VSFVGDSSTA QVDLQKSKNE VVADGNDSAT MTATVRDAKG NLLNDVKVTF NVNSAAAKLS QTEVNSHDGI ATATLTSLKN GDYTVTASVS SGSQANQQVI FIGDQSTAAL TLSVPSGDIT VTNTAPLHMT ATLQDKNGNP LKDKEITFSV PNDVASRFSI SNSGKGMTDS NGTAIASLTG TLAGTHMITA RLANSNVSDT QPMTFVADKD RAVVVLQTSK AEIIGNGVDE TTLTATVKDP FDNVVKNLSV VFRTSPADTQ LSLNARNTNE NGIAEVTLKG TVLGVHTAEA ILLNGNRDTK IVNIAPDASN AQVTLNIPAQ QVVTNNSDSV QLTATVKDPS NHPVAGITVN FTMPQDVAAN FTLENNGIAI TQANGEAHVT LKGKKAGTHT VTATLGNNNA SDAQPVTFVA DKDSAVVVLQ TSKAEIIGNG VDETTLTATV KDPFDNAVKD LQVTFSTNPA DTQLSQSKSN TNDSGVAEVT FKGTVLGVHT AEATLPNGNN DTKIVNIAPD ASNAQVTLNI PAQQVVTNNS DSVQLTATVK DPSNHPVAGI TVNFTMPQDV AANFTLENNG IAITQANGEA HVTLKGKKAG THTVTATLSN NNTSDSQPVT FVADKTSALV VLQISKNEIT GNGVDSATLT ATVKDQFDNE VNNLPVTFST ASSGLTLTPG ESNTNESGIA QATLAGVAFG EQTVTASLAN NGASDNKTVH FIGDTAAAKI IELTPVPDSI IAGTPQNSSG SVITATVVDN NGFPVKGVTV NFTSNAATAE MTNGGQAVTN EQGKATVTYT NTRSSIESGA RPDTVEASLE NGSSTLSTSI NVNADASTAH LTLLQALFDT VSAGDTTNLY IEVKDNYGNG VPQQEVTLSV SPSEGVTPSN NAIYTTNHDG NFYASFTATK AGVYQVTATL ENGDSMQQTV TYVPNVANAE ISLAASKDPV IANNNDLTTL TATVADTEGN AIANSEVTFT LPEDVRANFT LGDGGKVVTD TEGKAKVTLK GTKAGAHTVT ASMAGGKSEQ LVVNFIADTL TAQVNLNVTE DNFIANNVGM TRLQATVTDG NGNPLANEAV TFTLPADVSA SFTLGQGGSA ITDINGKAEV TLSGTKSGTY PVTVSVNNYG VSDTKQVTLI ADAGTAKLAS LTSVYSFVVS TTEGATMTAS VTDANGNPVE GIKVNFRGTS VTLSSTSVET DDRGFAEILV TSTEVGLKTV SASLADKPTE VISRLLNAKA DINSATITSL EIPEGQVMVA QDVAVKAHVN DQFGNPILNE SVTFSAEPPE HMTISQNIVS TDTHGIAEVT MTPERNGSYM VKASLANGSS YEKDLVVIDQ KLTLSASSPL IGVNSPTGAT LTATLTSANG TPVEGQVINF SVTPEGATLS GGKVRTNSSG QAPVVLTSNK VGTYTVTASF HNGVTIQTQT IVKVTGNSST AHVASFIADP STIAATNSDL STLKATVEDG SGNLIEGLTV YFALKSGSAT LTSLTAVTDQ NGIATTSVRG AITGSVTVSA VTTAGGMQTV DITLVAGPAD ASQSVLKNNR SSLKGDFTDS AELHLVLHDI SGNPIKVSEG LEFVQSGTNA PYVQVSAIDY SKNFSGEYKA TVTGGGEGIA TLIPVLNGVH QAGLSTTIQF TRAEDKIMSG TVLVNGANLP TTTFPSQGFT GAYYQLNNDN FAPGKTAADY EFSSSASWVD VDATGKVTFK NVGSKWERIT ATPKTGGPSY IYEIRVKSWW VNAGDAFMIY SLAENFCSSN GYTLPLGDHL NHSRSRGIGS LYSEWGDMGH YTTEAGFHSN MYWSSSPANS NEQYVVSLAT GDQSVFEKLG FAYATCYKNL
|
| |