Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5861 |
Symbol | |
ID | 6969406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5514767 |
End bp | 5517199 |
Gene Length | 2433 bp |
Protein Length | 810 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643389480 |
Product | type III restriction enzyme domain protein |
Protein accession | YP_002273872 |
Protein GI | 209398938 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAC TGAACCTAAG TAACCTGACG GAAGCAGACA TCATTACCAA ATGCGTTATG CCAGCCATTC TCAATGCAGG CTGGGACAAC ACAACGCAAA TCAGACAGGA GGTCAAACTC CGGGACGGTA AAGTCATCGT ACGTGGTAAA GTTGCGGCAC GCAGAACGGT AAAATCTGCT GACATCGTGC TGTATCACAA ACCTGGCATT CCCTTAGCCG TGATTGAAGC AAAAGCCAAC AAACATGAAA TTGGCAAAGG GATGCAACAG GGCATTGAAT ATGCGCGCCT GCTGGACGTT CCTTTTGTTT TTGCCACCAA CGGTGATGGC TTTATCTTCC GCGATGCCAC CGCAGCCGAA GGTGAATGCC TTGAAAAGCA AATAACGCTG GATGACTTCC CCTCCCCCGC TGAACTCTGG CAAAAATTCT GCCTCTGGAA AGGATACACA CAAGCTCAGC TTCCGGTGAT TACTCAGGAC TATTACGACG ATGGCAGCGG TAAATCGCCA CGTTATTACC AGCTTCAGGC AATCAACAAA ACCATTGAAG CCGTCTCCAA CGGGCAAAAC CGCGTTCTGC TGGTCATGGC GACCGGAACA GGGAAAACCT ATACCGCATT CCAGATCATC TGGCGCCTGT GGAAATCAAA AAATAAAAAA CGCATTTTGT TCCTTGCCGA TCGCAATATT CTGGTCGACC AAACCAAAAA TAATGATTTC CAGCCATTTG GTACGGCAAT GACCAAAGTC AGCGGACGCA CCATTGATCC CGCTTATGAA ATTCACCTCG CGCTCTATCA GGCTATAACT GGCCCGGAGG AAGACCAAAA AGCGTTTAAA CAAGTCGCAC CAGATTTCTT CGATCTGATC GTGATCGACG AATGCCATCG CGGCAGCGCA TCCGAAGACA GCGCCTGGCG AGAAATCCTT GATTATTTTA GTTCCGCCAC CCAAATTGGC TTAACCGCCA CGCCAAAAGA GACGCATGAA GTCTCCAGCA CGGATTACTT CGGCGATCCG GTTTACGTCT ACTCACTAAA AGAAGGGATC GAAGACGGCT TCCTCGCCCC TTATAAAGTT GTCCGTGTTG ATATTGATGT TGATCTGCAA GGCTGGCGCC CAACCAAAGG GCAAACTGAC TTAAACGGCG AAGTGATCGA CGATCGTATC TATAACCAGA AAGATTTCGA TCGCACGATG GTAATCGACG AACGCACAGA ACTGGTTGCC AGAACCATTA CCGACTACCT CAAGCGTACC AATCCGATGG ATAAAACCAT CGTCTTCTGT AACGACATCG ATCATGCAGA ACGTATGCGC CGCGCCCTGG TTAATCTCAA CCCGGAGCAG GTGAAAAAGA ACGACAAATA CGTCATGAAA ATCACCGGCG ATGATGAAAT TGGCAAAGCT CAGTTGGATA ACTTCATCAA CCCGAAAAAA CCGTACCCGG TTATCGCGAC CACTTCAGAG CTGATGACCA CCGGTGTGGA TGCTAAAACC TGCAAACTGG TAGTACTGGA CCAGAACATC CAGTCGATGA CCAAATTCAA GCAGATTATC GGTCGTGGTA CACGCATCGA CGAACGTTAC GGCAAACTCT GGTTTACCAT CCTCGACTTT AAAAAAGCCA CCGAACTGTT TGCCGATGAG CGTTTCGATG GCATTCCCGA AAAAGTCATG GATACCACAC CAGAGGATAT CGCCGATCCA GAATCTGATT TTGAAGAGAA ACTCGAAGAA ATCAGCGAAC ATGACGACGA ACAGGTAACA GGCGTTGATG AACCGCCTGC GCCACCATAC CAGGTTAAAG ATACCGATGA TGTCGGCCCA CTTCCGGAAG AAGACGAGAA GAAAATCCGC AAGTTTCACG TCAACGGTGT AGCAGTGGGC GTTATTGCCC AGCGTGTTCA GTATTACGAC GCCGACGGTA AACTGGTTAC CGAATCCTTT AAAGATTACA CCCGCAAAAC ACTGCTCAAA GAATATGCCT CGCTGGATGA CTTTACCCGC AAGTGGCAGG ACGCCGATCG CAAAGAAGCG ATCATTCACG AGCTGGAGCA ACAGGGGATC ATCTGGGAAG TACTGGCAGA AGAAGTCGGT AAAGATCTCG ACCCGTTCGA CATGCTTTGC CACGTAGTGT ATGGTCAGCC GCCGTTAACC CGCAAAGAGC GCGCCGAGAA CGTGCGCAAG CGGAACTACT TCACAAAATA CTCTGAAGCA GCGCAAGCCG TGCTCGATAA TCTGCTGGAT AAATACGCCG ATGCGGGCGT ACAGGAGATC GAAAGTATTC AGGTGCTGAA ACTTAAGCCA TTCGACAGCA TGGGCACCTT ACCGGAGATT ATTAAAACCG GATTTGGCGA CCGTAACGGG TATAATCAGG CGCTCAGCGA GCTGGAAAAC GAAATCTACC AATTACCGCC CCGCTCTGCT TAA
|
Protein sequence | MAELNLSNLT EADIITKCVM PAILNAGWDN TTQIRQEVKL RDGKVIVRGK VAARRTVKSA DIVLYHKPGI PLAVIEAKAN KHEIGKGMQQ GIEYARLLDV PFVFATNGDG FIFRDATAAE GECLEKQITL DDFPSPAELW QKFCLWKGYT QAQLPVITQD YYDDGSGKSP RYYQLQAINK TIEAVSNGQN RVLLVMATGT GKTYTAFQII WRLWKSKNKK RILFLADRNI LVDQTKNNDF QPFGTAMTKV SGRTIDPAYE IHLALYQAIT GPEEDQKAFK QVAPDFFDLI VIDECHRGSA SEDSAWREIL DYFSSATQIG LTATPKETHE VSSTDYFGDP VYVYSLKEGI EDGFLAPYKV VRVDIDVDLQ GWRPTKGQTD LNGEVIDDRI YNQKDFDRTM VIDERTELVA RTITDYLKRT NPMDKTIVFC NDIDHAERMR RALVNLNPEQ VKKNDKYVMK ITGDDEIGKA QLDNFINPKK PYPVIATTSE LMTTGVDAKT CKLVVLDQNI QSMTKFKQII GRGTRIDERY GKLWFTILDF KKATELFADE RFDGIPEKVM DTTPEDIADP ESDFEEKLEE ISEHDDEQVT GVDEPPAPPY QVKDTDDVGP LPEEDEKKIR KFHVNGVAVG VIAQRVQYYD ADGKLVTESF KDYTRKTLLK EYASLDDFTR KWQDADRKEA IIHELEQQGI IWEVLAEEVG KDLDPFDMLC HVVYGQPPLT RKERAENVRK RNYFTKYSEA AQAVLDNLLD KYADAGVQEI ESIQVLKLKP FDSMGTLPEI IKTGFGDRNG YNQALSELEN EIYQLPPRSA
|
| |