Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2904 |
Symbol | |
ID | 3707421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3284539 |
End bp | 3285816 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637739381 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_344880 |
Protein GI | 77166355 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA CGGTTCCTGA GGGGTGGGAA GTTAAGCCGC TAGGAAAACT CGTAGACGTT CGATCTAGTA ATATTGACAA GAAGACTGAA ACGTCGGAAA TCCCGGTTCG TTTGTGTAAC TACACTGATG TGTATTACAA CAACAGGATC ACGTCTGCAA TTGATTTTAT GGCGGCGAGT GCGAAACAGC GGGAAATAGA CCGCTTCTCG CTAGAAAAAG GAGATGTGAT AATCACGAAG GATTCTGAAA CTCCTGATGA CATAGCAGTC CCATCGTATG TGAGTGATGA TCTTTCTGGG GTGGTTTGTG GCTATCATTT AACCTTATTG AAGCCAGATC AAGATGAATC CGACGGTGAA TTCCTTTCCC ATCTATTCCA GTTGCCAAGC GTTCAGCACT ACTTTTACAT ACTGGCAAAT GGAATAACTC GCTTTGGTCT GACTGCGGAT GCTATCAATG AGGCCCCACT TCTCACGCCC CCTCTCCCCG AACAACAAAA AATCGCCGCC ATCCTGTCCT CCGTCGATGA CGTGATTGAA AAAACACGCG CCCAGATCCA CAAGCTGAAA GATCTGAAAA CCGCCATGAT GCAGGAATTG TTGACCAAAG GGATTGGGCA CACGGAATTC AAGGACTCGC CGGTGGGAAG GATTCCGGTG GGGTGGAGTA TTTGCAGCGC GGGGGAAGTC GCTGTTGCCA TAATGGTTGG GGTCGTCGTT AAACCAGCGC AATACTATGT TGAATCAGGC GTTCCTGCAT TGCGCTCCGC AAATGTTCGT GAAAACGGTT TAACCATGGA TAACTTGAAA TATTTTTCAG AAGACTCAAA TGAAATACTC AAAAAAAGCC GGCTAATAAA GGGTGACCTT TTGACAGTCA GAACAGGTTA TCCCGGCACG ACAGCGGTAG TTACTGATGA ATTTGAAGGC TGTAACTGCA TAGATGTTGT CATTACTCGT CCATCTTCGC GTATTGACTC AGACTTTTTT TGTTTATGGG TGAATTCTGA CCACGGAAAA GGGCAAGTCT TGAAGGCACA AGGTGGACTT GCTCAGCAGC ACTTTAACGT CAGTGATATG AAAAACCTTA CAGTGGTAGT TCCTTCACTA ACTGAGCAAA AAGCTATCTT CAATGCTGTT AATTCAGTAA CTAAGAAAAT AGCCTTAACT GAAAAACGCC TTACTCTCTT GCTCGATACC AAAAAAGCCC TGATGCAAGA CCTGCTCACC GGCAAAGTCC GCGTCAACGT CGAACAAGAG GAACCAGTGA TCGCCTGA
|
Protein sequence | MSDTVPEGWE VKPLGKLVDV RSSNIDKKTE TSEIPVRLCN YTDVYYNNRI TSAIDFMAAS AKQREIDRFS LEKGDVIITK DSETPDDIAV PSYVSDDLSG VVCGYHLTLL KPDQDESDGE FLSHLFQLPS VQHYFYILAN GITRFGLTAD AINEAPLLTP PLPEQQKIAA ILSSVDDVIE KTRAQIHKLK DLKTAMMQEL LTKGIGHTEF KDSPVGRIPV GWSICSAGEV AVAIMVGVVV KPAQYYVESG VPALRSANVR ENGLTMDNLK YFSEDSNEIL KKSRLIKGDL LTVRTGYPGT TAVVTDEFEG CNCIDVVITR PSSRIDSDFF CLWVNSDHGK GQVLKAQGGL AQQHFNVSDM KNLTVVVPSL TEQKAIFNAV NSVTKKIALT EKRLTLLLDT KKALMQDLLT GKVRVNVEQE EPVIA
|
| |