Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0339 |
Symbol | |
ID | 5595008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 345798 |
End bp | 349415 |
Gene Length | 3618 bp |
Protein Length | 1205 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640919524 |
Product | putative restriction enzyme |
Protein accession | YP_001457110 |
Protein GI | 157159792 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00710778 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACCA ATAACATTAA AAAATATGCC CCACAGGCCC GTAACGACTT CCGCGATGCG GTGATCCAGA AGCTAACGAC GCTTGGGATC GCTGCAGATA AAAAAGGCAA TTTGCAGATT GCCGAGGCCG AAACCATTGG CGAGACCGTG CGTTACGGTC AGTTTGATTA CCCGTTATCG ACCCTTCCCC GCCGCGAACG GCTGGTAAAA CGCGCCCGTG AGCAGGGTTT TGAGGTGCTG GTTGAGCACT GCGCCTACAC CTGGTTTAAC CGCTTATGTG CAATTCGCTA TATGGAGCTG CACGGTTATC TTGATCACGG CTTCCGTATG TTGTCTCACC CGGAGACTCC GACCGCGTTT GAGGTGCTGG ATCATGTGCC GGAAGTGGCA GAAGCTCTGC TACCAGAAAG TAAGGCGCAG TTGGTTGAAA TGAAGCTTTC CGGTAATCAG GACGAAGCTC TGTACCGTGA ACTGCTGCTG GGGCAGTGCC ACGCCCTGCA CCACGCGATG CCGTTCCTGT TTGAAGCGGT AGATGACGAA GCGGAACTGC TGCTGCCGGA TAACCTGACC CGTACTGACT CCATTCTACG TGGGCTGGTT GATGATATTC CGGAAGAAGA CTGGGAGCAG GTAGAGGTTA TCGGCTGGCT GTATCAGTTC TATATTTCGG AAAAGAAAGA TGCCGTGATT GGCAAAGTGG TGAAGAGCGA AGATATTCCT GCCGCCACCC AGCTGTTTAC GCCGAACTGG ATTGTGCAGT ATCTGGTACA GAACTCCGTT GGCCGCCAGT GGTTGCAGAC CTACCCGGAC TCACCGCTGA AAGACAAAAT GGAGTACTAC ATTGAGCCTG CGGAGCAAAC GCCGGAAGTG CAGGCGCAGC TGGCGGCGAT TACCCCGGCC AGCATTGAGC CTGAAAGCAT TAAAGTGCTG GATCCGGCCT GCGGTTCCGG GCATATCCTG ACAGAAGCCT ATAACGTGCT GAAGGCCATT TACGAAGAGC GCGGTTACCG TACCCGTGAC ATTCCGCAGC TGATTCTGGA AAACAATATT TTCGGCCTGG ACATCGATGA TCGTGCGGCG CAGCTTTCCG GATTTGCGAT GCTGATGCTG GCACGCCAGG ATGACCGTCG TATTCTGGGC CGTGGTGTGA GGCTGAATAT TGTCTCCCTA CAGGAAAGTA AGCTGGATAT TGCTGAGGTT TGGACCAAGC TGAACTTCCA TCAGCATATG CAACGCGGCA GCATGGGAGA TATGTTTACC CAAGGGACGG CACTGGCCAA TACCGACAGC GCGGAATATA AGCTACTGAT GCGTACTCTG GCATTGTTCA CTAGTGCAAA AACGCTGGGA TCACTGATTC AGGTACCGCA GGAAGACGAA GCGGCGTTGA AAGCGTTTCT GGAGAGATTG TATCGTCTGG CTGTTGAAGG TGATATTCAG CAGAAAGAGG CTGCGGCAGA GCTGATCCCA TATATTCAGC AGGCGTGGAT ACTGGCGCAA CGTTATGATG CTGTGGTCGC GAACCCGCCG TATATGGGGG GGAAAGGGAT GAATGGTGAT CTGAAAGAGT TTGCTAAAAA ACAATTCCCG GACAGCAAGT CGGATTTGTT TGCGATGTTT ATGCAACATG CATTTTCTTT ACTCAAAGAA AACGGTTTTA ATGCGCAAGT GAATATGCAG TCATGGATGT TCCTGTCTAG CTATGAAGCG CTTCGTGGAT GGTTGCTGGA TAATAAAACC TTTATTACGA TGGCACATTT GGGGGCTCGA GCATTTGGTC AAATCTCTGG AGAAGTTGTT CAAACTACCG CATGGGTGAT TAAAAATAAC CATTCAGGAT TTTATAAACC TGTATTTTTC CGTTTAGTTG ACGATAATGA AGAACATAAA AAAAACAATC TCTTGAATCG GATGAATTGC TTTAAAAACA CTCTGCAGAA TGACTTTAAA AAAATACCTG GTTCACCCAT TGCTTACTGG GCAACATTGG CGTTTATTAA TTCCTTTCTT AAATTGCCTG CTCTTGGAAC TCGTGCTGTG AAAGGACTAG ATACAAACGG GTCCATTGAT GTATTTTTGC GCAGATGGCC GGAAGTCAGC ATAAATTCCT TTGATGCATT AGGGAAAGGT AACTCAAAAT GGTTTCCGAT TGCAAAAGGG GGTGAGTTGA GGAAGTGGTT CGGAAATCAT GAGTATATCA TAAACTATGA AAATGATGGC ATTGAATTAA GAAAAAACAA AGCAAACTTG AGAAATAAGG ATATGTACTT TCAGGAGGGG GGAACCTGGA CTGTTGTATC AACCACCGGT TTCTCAATGA GATATATGCC AAAAGGATTC CTTTTTGACC AAGGCGGTTC TGCTGTTTTT TGTGAAAATA ATGATGAGCT ATCGATATAT AACATCCTGG CCTGCATGAA CTCTAAATAT ATCAACTACT CTGCAAGTTT AATTTGTCCT ACGCTTAACT TTACAACCGG TGATGTTAGG AAATTCCCTG TTATAAAAAA CAATCACCTT GAAGATTTAG CAAAAAAAGC AATTGAGATA TCAAAAGCAG ACTGGAACCA ATTCGAGACA TCTTGGGAGT TCAGTAAAAA CAAGTTAATT GAACACAAAG GAAACGTTGC TTATTCGTAT GCTAGCTATT GTAACTTTCA AGATAAACTG TATGAACAGC TAGTTAATAT TGAAAAAAAT ATTAATAACA TAATCGAAGA AATACTGGGT TTTAAAATAG AAACAACAGA GAATAGTGAG TTAATTACAT TAAACTCGAA CAAAATATAT CGTTATGGGC AAAGTGAAAC CAATGATACA TTCCTAAACA GGCATCGGAG TGACACCATT TCGGAACTCA TCTCATATTC AGTCGGCTGC CAAATGGGAC GCTATTCCCT CGATCGCGAA GGCCTCGTCT ACGCCCATGA AGGCAACAAA GGCTTTGCCG AACTTGCCGC TGAAGGTGCG TACAAAACCT TCCCGGCAGA CAATGACGGC ATCCTGCCGC TGATGGATGA CGAGTGGTTT GAGGACGACG TCACCTCTCG CGTCAAAGAG TTTGTCCGCA CCGTCTGGGG CGAAGAGCAC CTGCAGGAAA ACCTCGAATT TATCGCCGAA AGTCTCTGTT TATACGCGAT CAAGCCGAAA AAAGGCGAAT CTGCGCTGGA GACCATTCGT CGCTATCTTT CCACACAGTT CTGGAAAGAT CATATGAAGA TGTATAAAAA GCGCCCAATC TACTGGCTAT TCAGCTCCGG TAAAGAGAAA GCCTTCGAGT GCCTGGTGTA TCTGCATCGC TATAACGACG CCACGCTGTC GAGAATGCGT ACCGAATATG TGGTGCCGCT GCTGGCACGT TATCAGGCCA ATATCGATCG CCTGAACGAT CAACTTGATG AAGCTTCTGG CGGTGAAGCC ACACGCCTGA AACGCGAACG CGACAGCCTG ATCAAAAAAT TTAGCGAACT GCGTAGCTAT GACGACCGCC TGCGTCACTA TGCTGATATG AGAATCAGTA TTGATCTCGA CGATGGCGTT AAGGTTAACT ACGGCAAGTT TGGCGATCTG CTGGCAGATG TCAAAGCCAT CACCGGCAAT GCCCCAGAGG CGATCTAA
|
Protein sequence | MNTNNIKKYA PQARNDFRDA VIQKLTTLGI AADKKGNLQI AEAETIGETV RYGQFDYPLS TLPRRERLVK RAREQGFEVL VEHCAYTWFN RLCAIRYMEL HGYLDHGFRM LSHPETPTAF EVLDHVPEVA EALLPESKAQ LVEMKLSGNQ DEALYRELLL GQCHALHHAM PFLFEAVDDE AELLLPDNLT RTDSILRGLV DDIPEEDWEQ VEVIGWLYQF YISEKKDAVI GKVVKSEDIP AATQLFTPNW IVQYLVQNSV GRQWLQTYPD SPLKDKMEYY IEPAEQTPEV QAQLAAITPA SIEPESIKVL DPACGSGHIL TEAYNVLKAI YEERGYRTRD IPQLILENNI FGLDIDDRAA QLSGFAMLML ARQDDRRILG RGVRLNIVSL QESKLDIAEV WTKLNFHQHM QRGSMGDMFT QGTALANTDS AEYKLLMRTL ALFTSAKTLG SLIQVPQEDE AALKAFLERL YRLAVEGDIQ QKEAAAELIP YIQQAWILAQ RYDAVVANPP YMGGKGMNGD LKEFAKKQFP DSKSDLFAMF MQHAFSLLKE NGFNAQVNMQ SWMFLSSYEA LRGWLLDNKT FITMAHLGAR AFGQISGEVV QTTAWVIKNN HSGFYKPVFF RLVDDNEEHK KNNLLNRMNC FKNTLQNDFK KIPGSPIAYW ATLAFINSFL KLPALGTRAV KGLDTNGSID VFLRRWPEVS INSFDALGKG NSKWFPIAKG GELRKWFGNH EYIINYENDG IELRKNKANL RNKDMYFQEG GTWTVVSTTG FSMRYMPKGF LFDQGGSAVF CENNDELSIY NILACMNSKY INYSASLICP TLNFTTGDVR KFPVIKNNHL EDLAKKAIEI SKADWNQFET SWEFSKNKLI EHKGNVAYSY ASYCNFQDKL YEQLVNIEKN INNIIEEILG FKIETTENSE LITLNSNKIY RYGQSETNDT FLNRHRSDTI SELISYSVGC QMGRYSLDRE GLVYAHEGNK GFAELAAEGA YKTFPADNDG ILPLMDDEWF EDDVTSRVKE FVRTVWGEEH LQENLEFIAE SLCLYAIKPK KGESALETIR RYLSTQFWKD HMKMYKKRPI YWLFSSGKEK AFECLVYLHR YNDATLSRMR TEYVVPLLAR YQANIDRLND QLDEASGGEA TRLKRERDSL IKKFSELRSY DDRLRHYADM RISIDLDDGV KVNYGKFGDL LADVKAITGN APEAI
|
| |