Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2791 |
Symbol | |
ID | 8545179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 3830215 |
End bp | 3832116 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646387483 |
Product | type I restriction-modification system, M subunit |
Protein accession | YP_003267211 |
Protein GI | 262196002 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCA AGCGTCAGAT CCTGCACGAG ATCAGCAAGA ACGAGCTCTT GCCCTTCCTC GACCGCTGGC AGCTCGAGGT CGACGACCGC CGCGTCAAAG AGCAACTCGT CGAAGCCCTC GCGCGCTCCA AGCGCGCCCG CCTCGAGGAG CTGCTCGGCG AGCTATCGCG CGACACGCTC AAGGCCGTGT GCCGCGCGCT CGACCTCGAC GACACCGGCC GCGCCAAGCT CACCCTGATC GACCGCCTGC TCGGCCGCGA GTCCTCGCCC GCGTCGACCG ACGCCGACGC CGACGCCCCG TCCAAGTCCC CGGCCCGGTC CACATCCAAG CCCAAGCCCG CGCCGGTCAT CGACGACCAG GCCGAGGCCG AGCTCGCGGC CACCGAGGCC GAGCTCGCGG CCACCGAGCA GCTCACCACC GCGCAGCTCG AGCGCTACCT GTGGGCCGCG GCCGACATCC TGCGCGGCCA GATCGACTCA TCCGACTACA AGAACTACAT ATTTGGCCTG CTGTTCCTCA AGCGCCTGTC CGACGTCTTC GAGGAAGAGG CCGAAAAACT CACCGCCGAG GGGCTGCCCG CCGCCGTGGC CTGGAACGAC CCCGACGAGC ATCAGTTCTT CGTGCCCGAG CGCGCGCGCT GGTCCGAGAT CGCCAAGGTC GCCACCGGCA TCGGCGAGGC GCTCAACGTC GCCTGCGCGG CCCTCGAGGA GGCCAACAGC GGGCTCGACG GCGTGCTCGA GGGCATCGAC TTCAACGACG AGCGCCGCCT GGGCAACACC AAGAACCGCG ACGCCGTGCT CGCCCGCCTG GTGCAGCACT TTGGCCAGCT CAGCCTCAAG AACGCCGACC TCAGCGAGCC CGACATGCTC GGCCGCGCGT ACGAATACCT CATCGAGAAA TTCGCCGACG ACGCCGGCAA AAAGGGCGGC GAGTTCTACA CCCCGCGCAA GGTCGTGCAG CTCATCGTCG AGCTGCTCGC GCCCACCGCC GGCATGCGCA TCAGCGACCC CACCTGCGGT TCCGGCGGCA TGCTCATCGA GTGCGCCCAC TACGTCGAGC GCCAGGGCGG CAACCCGCGC AACCTGACGC TGCACGGCCA GGAGAAGAAC CTGGGCACCT GGGCCATCTG CAAGATGAAC ATGCTGCTGC ACGGCCTGCC CAGCGCGCGC ATCGAAAAAG GCGACACCAT CCGCGACCCG CGCCTGCTCG ATAACGGCGC GCTCCTGGTC TACGATCGCG TCATCGCCAA TCCGCCCTTT TCGCTCGACG AGTGGGGCGT CGAGGTCGCC GAGGGCGACG GCCACGGCCG CTTTCGCTTC GGCCTGCCGC CCAAGACCAA GGGCGACCTG GCGTTTTTGC AGCACATGGT CGCCACACTC AACGAGGGCG GCCGCCTCGG CGTGGTCATG CCCCACGGCG TGCTGTTCCG GGGCTCGTCC GAGGGCCGCA TCCGCAGCAA GCTGCTCGCC GAGGACCTGT TCGAGGCCGT CATCGGGCTG GCGCCCAACC TGTTCTATGG CACCGGCATC CCGGCCGCCG TGCTCGTGCT CAGCCGCGAC AAGGCCCGGG CGCGCAAAGG CAAGGTGTTG TTCGTCGACG CCTCGTCCGA GTTCGAGGCC GGCAGCGCGC AGAACTACCT GCGCGATGTC CACGTAACCA AGATCGCGCG CGCGTTTCAC GAGTATCGCG ACGTCGAGCG CTTCGCGCGC GTGGTTCCGC TGGCCGAGAT CGAGCAGAAC GAGGGCAACC TCAACATCAG CCGCTACGTG GACACCAGCC AGGAAGAGGA GCGCATCGAC GTGGCCGCCG CCGTGGCCCG GCTGCGCGAG CTAGAAGCCG CCCGCGACGA GGCCGAGGCG ACCATGCATC GGTTTCTGGA GGAGTTGGGT TATGGCGGAT GA
|
Protein sequence | MPTKRQILHE ISKNELLPFL DRWQLEVDDR RVKEQLVEAL ARSKRARLEE LLGELSRDTL KAVCRALDLD DTGRAKLTLI DRLLGRESSP ASTDADADAP SKSPARSTSK PKPAPVIDDQ AEAELAATEA ELAATEQLTT AQLERYLWAA ADILRGQIDS SDYKNYIFGL LFLKRLSDVF EEEAEKLTAE GLPAAVAWND PDEHQFFVPE RARWSEIAKV ATGIGEALNV ACAALEEANS GLDGVLEGID FNDERRLGNT KNRDAVLARL VQHFGQLSLK NADLSEPDML GRAYEYLIEK FADDAGKKGG EFYTPRKVVQ LIVELLAPTA GMRISDPTCG SGGMLIECAH YVERQGGNPR NLTLHGQEKN LGTWAICKMN MLLHGLPSAR IEKGDTIRDP RLLDNGALLV YDRVIANPPF SLDEWGVEVA EGDGHGRFRF GLPPKTKGDL AFLQHMVATL NEGGRLGVVM PHGVLFRGSS EGRIRSKLLA EDLFEAVIGL APNLFYGTGI PAAVLVLSRD KARARKGKVL FVDASSEFEA GSAQNYLRDV HVTKIARAFH EYRDVERFAR VVPLAEIEQN EGNLNISRYV DTSQEEERID VAAAVARLRE LEAARDEAEA TMHRFLEELG YGG
|
| |