Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0546 |
Symbol | |
ID | 3706738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 586862 |
End bp | 589873 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637737054 |
Product | DNA methylase containing a Zn-ribbon |
Protein accession | YP_342596 |
Protein GI | 77164071 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGCCC CCCAACAGGA GCAGACGCTA TGCCTTGAGG CACCGCCGCT CAAGAATACC CCCGCCCTCC TGGAGCGGGT CTTCCCGGCC CAGAAGATTT CCGCCGAAGC CCAGAAGGAA AGAAAGGCCG GTTCGGGGAA AACCCTCACC GCCCTGGGCT CCTACTGGAA AGGCCGCAAG CCCCTCATCC TGGTGCGGGC TCTTATTCTG GGTTCGTTAC TTCCCGCCAC GGATGATCCA GAAACGGATT TAGCGATCTT CGAGCAATTG ATGGCCCTGG ATGAGGCGTC CTTCGGGCGG CGGGAACCAA AACTAAGCGC GGCCCAGGTG GCGGCAAGAA TCACGCTGCC CCGTCCCTGG GATTATTTCG ATTATAGTTT CAAGGACGCG ACGGTTGAGC CTACGAAAAT AGAGGAGCTG ACCTTTCCCC TCCGGGCCGG GGACATCCCC GGGCTGTCCC TGCGCTGGAA GCGGGCTATT CCCCTGGCGG ACAAGCAAAC ACTGCTGGCG GCGGCCCTAA AGGAGCTGCC CTACCCGGAC AAAGTGGCCC TTTGCAAGCG ACCGGAGGAG TGCGATCCGG CAACCCTCTA CGGCCCCATC TGGGACTCCG TCAATCAACA TCTGGGGCGC TTTGGCGTCC AGGCCCATAG CCACGAAGAG CTGGTAGCCC AGTTGGGTAT GCTTCGCTTT GGGCACCGGC CCCGGGTGGG AGACACCTTC TGCGGCAGCG GTTCCATTCC CTTCGAGGCC GCCCGCCTGG GCTGCGAGGT GTATGCCTCG GATCTTAACC CCATTGCCTG CATGCTCACC TGGGGAGCGC TCAATATTAT TGGCGCTTCC CCTGAGCGGC GAGATGAGAT CGCCCAGGCC CAGCAAGCGG TGGCCGCAGC GGTGAACCAG GAAATCACCG CCCTCGGCAT TGAGCACAAC AGCCAAGGCG ATCGAGCGAA AGCCTATTTG TATTGTCTGG AGACTCGCTG CCCGGAAACC GGCTGGCAGG TGCCTCTAGC GCCCAGTTGG GTGATTTCTA AAACCCGCCA GGTTTATGCC AAGCTGATTC CAAATCCGCG GGAAAAACGC TTTGAAATTG ACATTGTCAG CGGCGCTTCC CCAGAGGAGA TGGCAGCCGC TGAGCAAGGC ACGGTCCAGC AGGGGCAGAT GGTGTATACG CTGGAAGGGA AAACCTACCG CACCTCCATC AAAACCCTGC GGGGCGACTA TCGAGACGCC CAGGGCGTTA ACCGCAACCG CCTGCGGCAG TGGGAGAAGC ACGATTTCAG GCCCCAGCCG GAGGATGTCT TTCAGGAGCG CCTTTACTCC ATTCAGTGGA TCACCCAGGA AACGCTGGGG AAATCCCGGC AGCAGACCTA TTTCGCCCCG GTCACCGAAG AAGATCGGGC GCGGGAGCGA CAAGTGGAGC AGATCGTGGC GGAAAATCTG GCCTCCTGGC AAGAGCAAGG ACTCGTGCCC GATATGGCCA TTGAACCGGG TAAAGAGACC ACGAGGCTTC AACGGGAGCG CGGCTGGCGG TATTGGCATC AATTGTTTAA TGCACGGCAG CTACTTATTT CTTCGCTTTT CTGCAAGCAT CGACACCCCG TATCTGCGAT TTGTCTTTTA AAGGCGGCTG ACTGGAATAA TCGGCTATGC CGGTGGGAGC CTTATTGGGC TAAGTCACAA CAAGTCTTTT ACAATCAAGC ATTAAATACC TTTTATAATT ATGGGACTCG GGCGTATGAC ATGCACATGC AGGCGTACGA TTTGCCTATG AGGCGATCTC AAACATTAGA CGTATCCAAT TATGTTGAGA TGTTGGATTG CCGCTCAATT ACTGCGGTGG CAGATCTGTG GATCACCGAT CCGCCCTACG GGGATGCGGT TCACTACCAC GAAATCACTG AGTTCTTTAT TGCCTGGCTG CGGAAAAACC CGCCCGCCCC CTTCAATGAA TGGATCTGGG ACTCCCGCCG GGCGCTGGCC ATTCAAGGCG CTAGCGACAA GTTCCGCCGC GATATGGTGG AAGCTTACCA AGCCATGACC GAACACATGC CGGATAACGG CCGTCAATGC GTCATGTTTA CCCATCAGGA CAGCCGGGTG TGGTCCGATA TGGCCGCTAT CTTCTGGGCG GCGGGTCTCC AGGTCATCAA CGCCTGGTAC ATTGCCACCG AAACCAGCTC CGAGTTGAAA AAGGGCGGTT ATGTCCAGGG CACCGTGATT CTGCTCCTGG GGAAACGGCC GCCCGGCCAG CGGGCGGGCT TTACCCCCCG TCTTCTGCCC CAGGTGCGCA AGGAAGTCAA CGCCCAAATC CAGGACATGA TGCATCTTAA CGCGCGGACT CAGGAACAGA TGGGGGCGCC CGTATTCACC GACTCCGATC TCCAGATGGC GGGCTACGCG GCGGCCCTGA AGGTCCTCAC CGGCTACACG GAAATCAACG GCGAGGAAGT CACCCGCCTG GCGCTGCGTC CCCGGCGGAA GGGGGAGAAA ACGGTGGTGA GTGAGATGGT TCAGCAAGCG GCGGCAACCG CCAACAGCCT CCTCGTCCCT GAGGGCCTGC CCAAAGCGAC TTGGGAGGTG ATTAGTGGCA TCCAGCGCTT CTACCTGCGG ATGGTGGCCC TGGAAACCAC CGGAGCCAGC AAGCTGGATA ATTATCAAAA CTTCGCCAAA ACCTTCCGGG TGGACAACTA CCAAGCGGTG ATGGCAAGCC TAAAACCCAA CAGAGCCCGG CTGAAAGGAG CCCAGGATTT CAAGCCCCGG GAGCTGGCCG GAACCGAAAT CGGCGAGACT CTCCTGGGGC AGGTGCTGGT GGCGCTCCAG GAACTTTTGG GGGAGAAGGA ACCACCGATC GTCATGGACA ATCTCCGGGA GGCCCTGCCG GATTATTTTC AGCAACGCCC CCACCTCCAG GCCATGGCGC AATTTCTCGG TGACCAGCTC GCCCAGCGGC GTCCCCAGGA AGCACGAGCC GCCGAGATCA TCGCCAGTCG GGTGCGCAAC GAGCGCCTGT GA
|
Protein sequence | MLAPQQEQTL CLEAPPLKNT PALLERVFPA QKISAEAQKE RKAGSGKTLT ALGSYWKGRK PLILVRALIL GSLLPATDDP ETDLAIFEQL MALDEASFGR REPKLSAAQV AARITLPRPW DYFDYSFKDA TVEPTKIEEL TFPLRAGDIP GLSLRWKRAI PLADKQTLLA AALKELPYPD KVALCKRPEE CDPATLYGPI WDSVNQHLGR FGVQAHSHEE LVAQLGMLRF GHRPRVGDTF CGSGSIPFEA ARLGCEVYAS DLNPIACMLT WGALNIIGAS PERRDEIAQA QQAVAAAVNQ EITALGIEHN SQGDRAKAYL YCLETRCPET GWQVPLAPSW VISKTRQVYA KLIPNPREKR FEIDIVSGAS PEEMAAAEQG TVQQGQMVYT LEGKTYRTSI KTLRGDYRDA QGVNRNRLRQ WEKHDFRPQP EDVFQERLYS IQWITQETLG KSRQQTYFAP VTEEDRARER QVEQIVAENL ASWQEQGLVP DMAIEPGKET TRLQRERGWR YWHQLFNARQ LLISSLFCKH RHPVSAICLL KAADWNNRLC RWEPYWAKSQ QVFYNQALNT FYNYGTRAYD MHMQAYDLPM RRSQTLDVSN YVEMLDCRSI TAVADLWITD PPYGDAVHYH EITEFFIAWL RKNPPAPFNE WIWDSRRALA IQGASDKFRR DMVEAYQAMT EHMPDNGRQC VMFTHQDSRV WSDMAAIFWA AGLQVINAWY IATETSSELK KGGYVQGTVI LLLGKRPPGQ RAGFTPRLLP QVRKEVNAQI QDMMHLNART QEQMGAPVFT DSDLQMAGYA AALKVLTGYT EINGEEVTRL ALRPRRKGEK TVVSEMVQQA AATANSLLVP EGLPKATWEV ISGIQRFYLR MVALETTGAS KLDNYQNFAK TFRVDNYQAV MASLKPNRAR LKGAQDFKPR ELAGTEIGET LLGQVLVALQ ELLGEKEPPI VMDNLREALP DYFQQRPHLQ AMAQFLGDQL AQRRPQEARA AEIIASRVRN ERL
|
| |