Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0439 |
Symbol | |
ID | 3706610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 476041 |
End bp | 477645 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637736949 |
Product | Type I restriction-modification system M subunit |
Protein accession | YP_342493 |
Protein GI | 77163968 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00153392 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGGG ATCAACTCAG CCAGCTTGGC AAAACCCTTT GGGCCATTGC CGACGATCTG CGCGGCGCCA TGAATGCCGA TGACTTCCGC GACTATATGC TGTCGTTTTT GTTCTTGCGC TATCTGTCCG ATAACTACGA AGCGGCGGCG AAAAAAGAAC TGGGGCCGGA CTATCCCAAG CTGGCCGACG ACGACCGCCG CGCGCCCTTG GCGGTGTGGT ATGAGGATAA TGCCGAGGAT GTGGCCGCCT TTGAAAAACA GATGCGCCGC AAGATGCACT ATGTCATCCA CCCGGATTAT CTATGGAGCA GTATTGCCGA GCTGGCTCGC ACCCAGGATG AGGAGCTCTT GCAAACATTG GCGGGCGGCT TTAAACACAT TGAAAACGAG TCCTTCGCCA GCACTTTTCA GGGCTTGTTC TCGGAAATTA ATCTGCGTTC GGAAAAGTTG GGCAGAACCC TCGCGGACCA AAACAGAAAG CTTTGCACCA TCATCACTAA AATCGCCAAG GGCATCGCAC GGTTCTCCAC CGGCAGCGAT ATTCTGGGCG ATGCCTATGA ATATCTGATT GGCCAGTTCG CCGCTGGTTC CGGCAAAAAG GCCGGCGAGT TTTACACCCC CCAAAGCGTC TCCACCATCC TCTCCCGCAT TGTGACACTG GACAGTCAGG AGCCCTCCAC CGGCAAAAAG AAAAAGCTTA ATCGCGTGCT GGATTTTGCC TGTGGTTCCG GCTCGTTGCT GCTCAACGTG CGCAACCAAA TGGGGCCGCG GGGTATCGGC ATGATCTACG GCCAGGAAAA GAATATCACC ACCTATAACC TGGCGCGCAT GAATATGCTG CTCCATGGCA TGAAGGATAC CGAGTTCCAG ATCCACCACG GCGATACGCT GGAAAACGAC TGGGCTATTC TCAACGAGAG GAACCCCGCT AAAAAACTCC AATGCGATGC GGTCGTGGCC AACCCGCCGT TCAGTTATCG CTGGGAGCCC GACGAGGCCA TGGGCGAGGA TTTCCGCTTC GAAAGCCACG GCCTTGCCCC CAAATCAGCG GCGGATTTCG CCTTTCTACT CCACGGCCTT CACTTTTTGA GCGACGAGGG CACCATGGCC ATTGTCCTGC CCCACGGCGT CCTTTTTCGG GGCGGCGCGG AAGCGCGTAT CCGCACCAAA TTGCTGAAAG ACGGCCACAT CGATACCGTG ATCGGCCTGC CCTCCAATCT GTTTTTCTCC ACCGGCATAC CGGTATGTAT TCTGGTGCTG AAAAAGTGCA AAAAGCCCGA CGACGTACTG TTTATCAATG CCAGCGAGCA CTTTGAAAAA GGCAAGCGCC AAAACGCCCT ACGGCCCGTG GACATAGATA AAATTGTTGA TACCTATCAA TACCGCAAGG AAGAGGAACG CTACGCCCGG CGCGTCAGTA GGGAGGAGAT CGAGAAAAAC GATGACAACC TCAATATTTC CCGCTACGTC AGCACCGCTA AGCCTGAGCC AGAGGTGGAC CTAGATGACG TTCATAAAAA GCTCACCGCT ATTGAGCAGC GCATATCGCA AACAACAGAA ACCCATAATC AGTTTCTCAA AGCATTGGGG CTGCCAATAT TGTGA
|
Protein sequence | MTRDQLSQLG KTLWAIADDL RGAMNADDFR DYMLSFLFLR YLSDNYEAAA KKELGPDYPK LADDDRRAPL AVWYEDNAED VAAFEKQMRR KMHYVIHPDY LWSSIAELAR TQDEELLQTL AGGFKHIENE SFASTFQGLF SEINLRSEKL GRTLADQNRK LCTIITKIAK GIARFSTGSD ILGDAYEYLI GQFAAGSGKK AGEFYTPQSV STILSRIVTL DSQEPSTGKK KKLNRVLDFA CGSGSLLLNV RNQMGPRGIG MIYGQEKNIT TYNLARMNML LHGMKDTEFQ IHHGDTLEND WAILNERNPA KKLQCDAVVA NPPFSYRWEP DEAMGEDFRF ESHGLAPKSA ADFAFLLHGL HFLSDEGTMA IVLPHGVLFR GGAEARIRTK LLKDGHIDTV IGLPSNLFFS TGIPVCILVL KKCKKPDDVL FINASEHFEK GKRQNALRPV DIDKIVDTYQ YRKEEERYAR RVSREEIEKN DDNLNISRYV STAKPEPEVD LDDVHKKLTA IEQRISQTTE THNQFLKALG LPIL
|
| |