Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5809 |
Symbol | |
ID | 6969870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5450290 |
End bp | 5455224 |
Gene Length | 4935 bp |
Protein Length | 1644 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643389437 |
Product | hypothetical protein |
Protein accession | YP_002273829 |
Protein GI | 209400195 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGG TCGGTATTAA TAACGAAAAC GAATTTTACT CTAACCACTA TTTGGGTGAG GTTTTCACCA GTGATATCCG CGATGTGCTG GAACCCTGGA TAGCCCAGGA AAATGCAGCG CGTGAAGCGG AGCGTGCCGC TCGTGAACAG GGCAAAGACG TGGAGCCGGG ATACCGCGCT CCGTGGAACC AGTTTAACAG TCTGGCGACT GAGTTTTTCC GCAAACTTGC CGAGCACGAA AAACAGCGTC AGATCCCGCA GCGTCTGGCC GATCAACGTA ATCGCTGGCA GCCATTGTTA AAGGCGCTGG GCTACGAAAT TACGCCACAG ATCCAAATGC TGGATGACGA TACACCGCTG CCGGTACTGG CGCGTTACAA CAGCACTGAC GGTAGCCCGT GGCTGTGGAT TGTTGAAGCA CACGATCAGG AAGAAGGAAC GCTGGATCCG CTGGCGCTCT CCTTACTGAC CGCGCAATTC CCGGCGGATA CCGACAAACA TAAGCACGAC AGCCTGCGCA AAAAAGCCAA CGGTGAATAT CGCAGCTGGC AGGATTTGCT CTCTACGGCG GTCTTCACCC AAAATGAACC GCCGCGTTTT GTGCTGCTGC TCGGTAACCG TCAGCTATTG TTGTTGGACC GTACTAAGTG GGCGCAAAAC CGTCTGCTAC GTTTTGATTT TGAAGAGATT TTAAGCCGTC GTGAAACGGA TACGCTGAAA GCGACTGCGG TGTTGTTACA TAAAGATTCT CTGCTGCCGG GCAGTGGGGC ACCTTATCTT GACTCGCTGG ATGACAATTC GCACAAACAT GCGTTTGGTG TTTCGGAAGA TCTGAAATAT GCCCTGCGCG AAAGCATTGA GTTGCTGGGC AACGAAGCGA TGCATTATCT GATCGACCGT GGCCTGGCAA ACTATACAGG TAATCGTGCG GTGGACCCGG ATGAACTGAG CCGCGAATGT CTGCGTTACA TGTACCGCCT GCTGTTCCTG TTCTACATTG AAGCGCGCCC GGAGCTGGGT TATGCGCCAA TGACCGCCAA AACCTATCTG CAAGGTTACA GCCTGGAAAC GTTGCGCGAT CTGGAGATGA TCCCGCTGAC CAGCGAAGAA GATCGCAACG GGCGCTACTT CCACGACAGC CTGAATATGC TGTTTAAACT GGTGCGCGAA GGCTACAACG GCGGCGTGAA AATGCAGAGT GACCTGGAGA GCGGCGACCG GATCACCATC CATAGTCATC AGTTCAGCGT CCCGCGTCTG GAAAGTCATC TGTTTGATGC CAACAACACG CGCATTCTTA ACCGCGTGGT ATTCCGTAAC GAAACCCTGC AACAGATTAT CCAGGCGATG TCGTTAAGCC GCCCGGCCAA AGGGCGCTTT AACCGCCGCG GACGTATTTC TTATCGCCAG TTGGGTATCA ACCAGTTGGG TGCGGTGTAT GAGGCGCTGC TCTCCTATCG CGGATTCTTC GCCAGCGAAG ATCTCTACGA GGTGAAGAAA GCCGGGGAAG AGTTTAACGA GCTGGAGACG GGTTACTTCG TCAGTAAGGA TGAGATTAGC AAATACCACG AAGACGAGAA GGTCTACGAG AAAGACGGCA GTCTGCGCAT TCACCGCAAA GGCAGCTTTA TCTACCGTAT GGCCGGGCGC GACCGTGAGA AATCTGCCTC TTATTACACC CCGGAAGTGC TGACCCGCTC ACTGGTTAAA TATGCCCTGA AAGAACTGTT TAAGGAGCAG ATTGATCCCA TTAGCGATCC GCACGCCAAA GCTGATGCCA TCTTAAACCT CACCGTGTGC GAACCGGCGA TGGGCAGCGC GGCGTTCCTT AACGAAGCCA TCAACCAGCT GGCGGAAGCG TATCTGTTCC ACAAGCAGCA GGCGGAAGGT CGCCGTATTC CGCAGGATCG TTACACCCAG GAGTTACAGC GGGTGAAAAT GTACATTGCC GACAACAACG TTTTCGGCGT GGACTTAAAC CCGGTGGCGG TGGAACTGGC GGAAGTGTCG CTGTGGCTGA ACGCCATTAG TGGCGATGCG TTTGTACCGT GGTTTGGTTA CCAGCTGCAC TGCGGTAACT CGCTGGTGGG CGCGCGCCGT CAGGTGTTCA ACAAGAGCGA ACTGACCTAC AAAAAAGCCA AAGATCCGAG CTGGCTTAAC AGCGAGCCGG TCGAACTGGC GATGAACACG CCGCGTGAAG AGACGCAGAT TTTCCACTTC CTGCTGCCCG ACGGCGGTAT GGCTAACTAC AGCGATAAAA CTGTTAAGCA GCGTTATCCG GATGACTTCA AAGCGCTGGA CAGCTGGCGC AAAGAGTTTA TTAAAAGCTT TGCCGGGCAT GAGATTGCTG ATGTGCAGCG TATCAGCGAA AAGGTGGAAG CACTGTGGAA CACCTATCGC CAGCAACTTA AAGCAGAACG TCTGAAAACC GCCGACAGCT ACCCGGTGTG GCCGGCAGAA AACAGCGAGC AGACGCGTTC TTCGCTGAGC AGTAAAGATG AAACCTTTAG CGGTCGTCTT GAAGATAACA GCGCCTACCA GAAGCTGCGT TGGGTAATGG ACTACTGGTG CGCGCTGTGG TTCTGGCCGA TCGACAAAGC CGATGAGCTA CCGGATCGCG GCACCTGGTT GTTTGAGATT GAAACCCTGC TTGACGGGAT TGTAATCACG GAAAAAGTCA CTGAAGTTGC GGAGCACACC ACCGGCGATC TGTTTGCCGA AGAAGGCCTG CTGCGAGAAG AGTCTTCGCT GTTTTCTGTT GCTGGTCGTC TGAAAACCGA GGTGTTGTTC CGTCATTTGC CGCGTCTGGC GATTGTCGAT GCCCTGAGAA AGCAGCACCG TTTCTTCCAC TGGGATCTGG AGTTCTGCGA CCTGTTTGCC GAGCGCGGCG GTTTTGACCT GATGCTCGGA AACCCGCCGT GGCTGAAAGT GGAATGGCAG GAAGCGGGCG TGCTGGGTGA TTACGAGCCG GAATTTGTGC TGCGTAAGCT GAGCGCCTCG AAGCTGGCAA CGTTGCGTAT TGATACCTTT AACCAGATCC CGGCGCTGGA AGCGGCCTGG CGCAGCGAGT ATGAAGGCTG TGAAGGGATG CAAAACTTCC TGAATGCGCA GCAGAACTAC CCGGTACTGC GCGGGGTGCA GACCAACTTG TATAAATGCT TTCTGCCGCA GGCCTGGCGA TTAGGGGCGC AGAAAGGCGT GGCAGGTTTC CTGCACCCGG AAGGGATTTA TGATGACCCG AAAGGCGGGC AATTACGTGC GGCGGTATAT CCGAGGCTGA GGGCGCATTT TCAGTTTCAG AATGAGTTAA ATTTGTTTGT TGAAGTTGAT CACCATGCGA AGTTTAGCAG CAATATTTAT TCTGCTAGCC CTAGCACAGT GGGATTTGAA CATATATCTA ATTTGTATGC TCCGCAAACT ATTGATGCAT GTTTTGAACA TTCTGGCAGT GGGGACATTC CCGGTCTCAA AGACGAGATT GAGAGCGAGG GAAAATTAAA AGTTGTATGG AACACATCTG GCCACCGTTC TCGATTAATA AGTATCGCCA CTCATGAGCT AGAATTATTT GCTCGTCTAT ATGACAGCGA AGGAACGCCA GCCTGGCAGG CACGTTTGCC AGCCTTACAT GCTAAACAAC TTGTTGCTGT ACTGGAAAAG TTTGCTAATC AGCCGAATAG ATTAGGTGAT TTGCAGGGGC AGTATTTTTC AACGGTTATG TTCGATGAAA CATATGCTCA GAGGGATGGG ACAATTTTAC GGCAGACTCA ATTCCCTCAA GATTCATCAC AATGGGTACT GTCTGGCCCT CATTTCTTTG TTGGGACGCC GTTCTACAAG ACTCCGCGCG AAAACTGTAC GCTTAACAGC GATTATGACT GCCTGGACTT GCTAACTCTG CCTGACGACT ATCTGCCGCG CACTAACTAC ATTCCGGCAT GTGATGCACA GGAGTATGCA AAACGTACTC CATGCGTTAC ATGGACTGAA CTGGCTGAAG ATGAACCGAA GAAGGTAACA GATTATTATC GCTTAGCTAT CAGAGCCATG TTGGCTCAAT CGGGGGAACG TACACTAATT AGTGCTATTT ATCCGCCAGA AATAAGTCAC ATGAACGCAG TACGTTCTTA CTGCTATAGC TCACAGAATC TGTTACTCGA ACATTCAGGT ATGTGTTTTT CTTTACCTTT TGATTTTATT TGTAAATCTA CTGGCAAGGC AAACTTACAT CAGATGCTTG ATGGTTTCTC ATACGTATTA TTCAATCCGA GACAAAAGGC ATTATTATAC TGCTTAGTAT TATCATTAAA TTCTGTAAAT GATGTATATG CTGGCCTTTG GCAATCCTGC TACACCCCAG ACTTCAACAC CCAGCGTTGG AGCCGCGATC TCCCGCAGCT CCCCCAGGAT TTCTTCGCCA AACTGACCCC AGAGTGGCAG CGTAACTGCG CTTTACGCTC TGACTACAGT CGTCGTCAGG CGCTGGTGGA AATCGACGTA TTGGTGGCGC AGGCGCTGGG GTTAACTCTC GAAGAGCTGC TTACCATTTA TCGCGTTCAG TTCCCGGTGA TGCGCCAGTA CGAAGCGGAT ACCTGGTACG ATCAAAACGG TCGCATTATC TTTACCCCAA GCAAAGGGCT GGTGGGCGTT GGCTTGCCTC GCACCGCGCG TAAAGCTGAC CTGAAAAACG GCTTTGTCTT TAACGTCGAC AGCCCGGAGT GGACCGGCGG TGACTGCACC GATCAAGCTA TCGGTTGGGA TGATGTCAAA CATCTTAAAA CCGGTACCGT CAGCGTCACC TTTGATGATT ATACCCGCAG CGACGAAGGT GAGCGCCGTA CCGTCACCTG GCAGGCTCCG TTTATCAAGC CAGATCGCGA AGATGACTAC AAAGTGGCCT GGGCGTTCTT TGCACAAGAT AAGGAGAGCG CCTGA
|
Protein sequence | MALVGINNEN EFYSNHYLGE VFTSDIRDVL EPWIAQENAA REAERAAREQ GKDVEPGYRA PWNQFNSLAT EFFRKLAEHE KQRQIPQRLA DQRNRWQPLL KALGYEITPQ IQMLDDDTPL PVLARYNSTD GSPWLWIVEA HDQEEGTLDP LALSLLTAQF PADTDKHKHD SLRKKANGEY RSWQDLLSTA VFTQNEPPRF VLLLGNRQLL LLDRTKWAQN RLLRFDFEEI LSRRETDTLK ATAVLLHKDS LLPGSGAPYL DSLDDNSHKH AFGVSEDLKY ALRESIELLG NEAMHYLIDR GLANYTGNRA VDPDELSREC LRYMYRLLFL FYIEARPELG YAPMTAKTYL QGYSLETLRD LEMIPLTSEE DRNGRYFHDS LNMLFKLVRE GYNGGVKMQS DLESGDRITI HSHQFSVPRL ESHLFDANNT RILNRVVFRN ETLQQIIQAM SLSRPAKGRF NRRGRISYRQ LGINQLGAVY EALLSYRGFF ASEDLYEVKK AGEEFNELET GYFVSKDEIS KYHEDEKVYE KDGSLRIHRK GSFIYRMAGR DREKSASYYT PEVLTRSLVK YALKELFKEQ IDPISDPHAK ADAILNLTVC EPAMGSAAFL NEAINQLAEA YLFHKQQAEG RRIPQDRYTQ ELQRVKMYIA DNNVFGVDLN PVAVELAEVS LWLNAISGDA FVPWFGYQLH CGNSLVGARR QVFNKSELTY KKAKDPSWLN SEPVELAMNT PREETQIFHF LLPDGGMANY SDKTVKQRYP DDFKALDSWR KEFIKSFAGH EIADVQRISE KVEALWNTYR QQLKAERLKT ADSYPVWPAE NSEQTRSSLS SKDETFSGRL EDNSAYQKLR WVMDYWCALW FWPIDKADEL PDRGTWLFEI ETLLDGIVIT EKVTEVAEHT TGDLFAEEGL LREESSLFSV AGRLKTEVLF RHLPRLAIVD ALRKQHRFFH WDLEFCDLFA ERGGFDLMLG NPPWLKVEWQ EAGVLGDYEP EFVLRKLSAS KLATLRIDTF NQIPALEAAW RSEYEGCEGM QNFLNAQQNY PVLRGVQTNL YKCFLPQAWR LGAQKGVAGF LHPEGIYDDP KGGQLRAAVY PRLRAHFQFQ NELNLFVEVD HHAKFSSNIY SASPSTVGFE HISNLYAPQT IDACFEHSGS GDIPGLKDEI ESEGKLKVVW NTSGHRSRLI SIATHELELF ARLYDSEGTP AWQARLPALH AKQLVAVLEK FANQPNRLGD LQGQYFSTVM FDETYAQRDG TILRQTQFPQ DSSQWVLSGP HFFVGTPFYK TPRENCTLNS DYDCLDLLTL PDDYLPRTNY IPACDAQEYA KRTPCVTWTE LAEDEPKKVT DYYRLAIRAM LAQSGERTLI SAIYPPEISH MNAVRSYCYS SQNLLLEHSG MCFSLPFDFI CKSTGKANLH QMLDGFSYVL FNPRQKALLY CLVLSLNSVN DVYAGLWQSC YTPDFNTQRW SRDLPQLPQD FFAKLTPEWQ RNCALRSDYS RRQALVEIDV LVAQALGLTL EELLTIYRVQ FPVMRQYEAD TWYDQNGRII FTPSKGLVGV GLPRTARKAD LKNGFVFNVD SPEWTGGDCT DQAIGWDDVK HLKTGTVSVT FDDYTRSDEG ERRTVTWQAP FIKPDREDDY KVAWAFFAQD KESA
|
| |