Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2903 |
Symbol | |
ID | 3707420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3282822 |
End bp | 3284546 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637739380 |
Product | Type I restriction-modification system M subunit |
Protein accession | YP_344879 |
Protein GI | 77166354 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACA ATATCACCCT ACAACAACTG GAATCCTTCC TGTGGGAGGC CGCCGATATT CTCCGGGGCA ACATGGACGC CTCCGAGTAC AAGGATTATA TCTTCGGCAT GATGTTCCTC AAGCGCCTGT CCGATGCCTT TGAGGAAGCC CAGGAAGGGG TTATCCAGTA CTACCTGGGC AAGGGCAAAA CTGACGCCGA GGCCCGGGAG CTGGCCAACG ATGAGGACGA ATACGACAAG ACCTTCTACA TTCCGCCCAT CGCCCGCTGG GGTGCCCTGA AAGACCTGAA ACACGATATT GGCACCGAGC TGAACAAGGC CACGGAAGCC ATCGAGGAAG TCAACCCTTC CCTGGAAGGG GTGCTGGTGT CCATCGACTT CAATATCAAG AACAAGCTCT CGGATAAGAA ACTGCGGGAT CTGCTCCGTC ACTTCAGCCG CCATCGGCTT CGCAATGAAG ACTTCGAGCA CCCCGATCTA CTGGGCACCG CCTACGAGTA CCTGATAAAA ATGTTTGCCG ATAGCGCCGG CAAGAAGGGC GGCGAGTTTT ACACCCCTTC CGAGGTGGTG CGGCTGCTGG TGGCCCTGCT CAAGCCCCAG GCCGGGATGC GCATCTACGA TCCCACCGCC GGGTCCGGCG GGATGCTGGT GCAGACCCGC AACTATCTGG CCCGTCATGG TGAAAACCCG GCCAATCTGT CCCTGTTCGG TCAGGAGATG AACCTGAACA CCTGGGCCAT CTGCAAGATG AATATGTTTT TGCACGGAGT CTACAGCGCC GACATCCGCA AGGGGGATAC CCTGCGGGAA CCCCAGCATA CCCAGGGCGG TGAACTGATG ACCTTTGACC GGGTGATCGC CAATCCCCCC TTTTCCCTGA AAAAGTGGGG CAAGGATGAA GCGGACAAGG ACGCCTACGG GCGCTTCCCC TACGGTACAC CCCCCAAGGA CGCCGGGGAT TTGGCCTTTG TCCAGCACAT GATCGCCAGC CTGAACGCCG AAGGCATGAT GGGGGTGGTC ATGCCCCACG GGGTGCTGTT CCGGGGCGCC AGCGAAAAAG CCATCCGCCA GGGCATCCTG AAGGATGATC TACTGGAAGC GGTGATCGGC CTGCCCGCCG CTTTGTTCTA CGGCACCGGC ATCCCCGCCT GCCTGCTGAT CCTCAACAAA AACAAACCGG CGGAACGCAC AGGCAAGGTA TTGTTCATCA ACGGCGAGCT GGAATTTCAG GAAGGCAAGA ACCAGAACAA ACTGCGCCCG CAGGATATGG ACAAGATCGT TCGGACCTTC GATGACTACA GGGAGATCAA GCGGTATTCC AAGGTGGTCA GTTTGGCGGA CATTGCCGGG AACGATGATA ACCTGAATAT TCGCCGCTAC GCGGACACCT CGCCGCCCCC GGAAATCTTC GATGTCCGCG CCATTCTCCA CGGCGGTATT CCCGTGCGGG AAGTGGAAAG CGAGTATATC CGGGAAGAAA TACTGGAAGA CTTTGATGTG ACCATGGTCT TTGTGAGGCG GGATGAGCGC TACTTTGAGT TCAAGCCCGA GATCGAGTCC AAGGAAGCCA TCCGGGAAGC GGCTGGGGAA GTTGACGCCA AGGTGATCCA ACAACTGGAA CGCTGGTGGG ACAAGTACCG GGTGTCCCTG CATGAACTGG ACGCCCAGGT GGCGGCAGCT GAGGAAGTGA TGAAAGGCTA TCTGAAGGAG CTAGGGTATG AGTGA
|
Protein sequence | MSHNITLQQL ESFLWEAADI LRGNMDASEY KDYIFGMMFL KRLSDAFEEA QEGVIQYYLG KGKTDAEARE LANDEDEYDK TFYIPPIARW GALKDLKHDI GTELNKATEA IEEVNPSLEG VLVSIDFNIK NKLSDKKLRD LLRHFSRHRL RNEDFEHPDL LGTAYEYLIK MFADSAGKKG GEFYTPSEVV RLLVALLKPQ AGMRIYDPTA GSGGMLVQTR NYLARHGENP ANLSLFGQEM NLNTWAICKM NMFLHGVYSA DIRKGDTLRE PQHTQGGELM TFDRVIANPP FSLKKWGKDE ADKDAYGRFP YGTPPKDAGD LAFVQHMIAS LNAEGMMGVV MPHGVLFRGA SEKAIRQGIL KDDLLEAVIG LPAALFYGTG IPACLLILNK NKPAERTGKV LFINGELEFQ EGKNQNKLRP QDMDKIVRTF DDYREIKRYS KVVSLADIAG NDDNLNIRRY ADTSPPPEIF DVRAILHGGI PVREVESEYI REEILEDFDV TMVFVRRDER YFEFKPEIES KEAIREAAGE VDAKVIQQLE RWWDKYRVSL HELDAQVAAA EEVMKGYLKE LGYE
|
| |