Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0059 |
Symbol | |
ID | 3705935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 59989 |
End bp | 62832 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637736584 |
Product | hypothetical protein |
Protein accession | YP_342131 |
Protein GI | 77163606 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAA AACAAGAAAG CCTGTTCCAG ACCAGACTGG TTCCCGCCGA AGCGGGCTCC GGCAGGCTAT TCGACGAAGA ACTGGTGGCT GGTTCGGATG GGCCTGTAAA GTGTCTTGGT CTTGAGTTTG AAAACGACGA AGCCCGTCGC ACCCACTTTA CCGAGGAGCT GCGCAAGAAG CTGCAGGACC CGGAGTTCCG CAAGATCGAA GGTTTTCCCA TCGGCAGCGA CGAGGACATC CTGAACCTGA GTGATCCGCC ATATTACACC GCCTGCCCGA ACCCATGGAT CGCCGACTTC ATCACCGAGT GGGAGGCGCA AAAGCCGGAA CAGCCCGAAG GCTATCACTA TCACCGTGAG CCTTTTGCCG CCGATGTCAG CGAAGGAAAA AACGACCCGA TTTACAATGC CCACTCTTAT CATACCAAAG TGCCGCACAA GGCAATCATG CGGTACATCC TCCACTACAC GGAGCCAGGA GACATCGTTT TTGACGGCTT CTGTGGTACT GGCATGACAG GAGTGGCCGC GCAGATGTGT GGCGACCGTG AAGTGGTCAT GTCGCTCGGC TACCAGTTGA AGCCGGACGG GACCATTTTG CAGGAAGAGA TGGACGAAGA TGGCAAAAAG GTTTGGCGAC CATTTTCAAA ACTGGGCGTC CGCAGAGCCG TACTGAACGA CCTTTCACCG GCGGCGACGT TCCTTGCGCA TAACTACAAC CTTTATGTTG ATACGGCATC ATTTGAAAAT GAAGCTAAGA AATTCATCAA AATAATCGAG CGAGAATGTG GATGGATGTA TGAAACAACT CATACAGACG GGGTGACCAA GGCGAAAGTG AACTACACAA TCTGGTCAGA TGTATTTGTC TGTCCTGATT GCACTAATGA GATTGTATTC TGGGATGTTG CAGTCGACAA AGAATCTGAG ACAGTAAACG ATGAGTTTCA ATGTCCTAAT TGCCAGACCA GCTTAACAAA GCGCAATATG GACCGTGCTT GGGTTACCAC TTATGACCGA TTCTTGGGAG AGACGATTCG GCAGGCCAAG CAGATACCTG CTTTGATCAA TTATAGCGTT GGTGGGAAAC GATATGAGAA GAAACCTGAC GAGGGTGACT TCGCAATTTT GGAGAAGATA GAAAACGAAG GATTAGACGG ATGGTTTCCC ATCAATCGCA TGATTGAGGG GCATGAGTCT AGGAGAAATG ATCCGGTTGG TATCACGCAC ACTCATCACT TTTATTCACC TCGAAATCTA TCAGTGATGT CGAAGATTTT GGATTTGGCT GACAAATCTG AATTTACTCC CTTCAAATTC GGTTTTCTGA ATACCTCTTG GCACGCCACG CAGATGAGGC AATACAACCC AGGTGGCGGG CACAGACCTA GAACGGGTAC TTTGTATATG CCTTCCATCC ATAGTGAAGG TAACATGATT CCCGTTTACA AGAAAAAGCT GAACCAGCTT GTTCAGTTTT ATAAAGTCAA GTCTCATCGA AATAGGGTTG CCATTATTCA GACCATGTCG TCTACGGTGG AATCATCTAT CGAAGCGGGT TTGGACTATG TATTCATTGA TCCGCCTTTC GGCGCAAATC TGAACTACTC AGAACTAAAT TCAATATGGG AGGCATGGCT TAAAGTAAGT ACGAATAATG CCGAGGAGGC AATTGAGAAT AGGTCACAGA ACAAGGGGAT TGATGAATAC CGATCTCTTA TGACTCAGTG CTTTCGACAG GCATACAACC AATTGAAACC TGGGCGCTGG ATGACTGTAG AGTTTTCTAA TACAAGCGCT GGTATTTGGA ACAATATTCA GACGGCAATA TCAGACGCTG GATTTATTGT CGCGAATGTC TCTGTTCTAA ATAAAAAGCA GGGGTCGATA ATGGCTTACA CTACCCCCAC TGCTGTCAAA CAAGACCTCG TTATCTCAGC CTACAAACCC AACGGCGGTT TTGAAGAGCG CTTTCAGAAA GAGGCGCAAA CCGAAGAAGG TGTGTGGGAT TTTGTCCGAA CCCACCTAAA GTATCTGCCG GTCACCAAGC AACAGGGAGC CTTGCTGCAG TTCGTCCCGG AACGCGATCC GCGCATCCTG TTCGACCAAA TGGTGGCCTA CTACGTCCGC AAGGGCTACC CCGTGCCGAT CTCGAGCCAG GAGTTCCAGA TTGGGCTGTC GCAGCGTTTC ATCGAGCGCG ACGGCATGTT TTTCCTGCCG GATCAGGTGG CCGAATACGA CCGCAAGAAG ATGACCTCGG GCGAACTTAA GCAGATGTCC ATGTTCGTCT CTGACGAGGC ATCCGCCATC CAGTGGCTCC ACCAGCTCAT CAAGGAAAAG CCGCAGACCT TCTCCGACAT CAATCCGCAG TTTATGCAGC AGCTCGGCGG CTGGAGCAAA AACGAGGCCC AGCTCGACCT GCGTGAACTG CTGAACCAAA ACTTCCTCAG CTACGACGGC AAAGGCCCGG TACCCGAGCA GATCCACGCC TACCTCTCCA CCAATTGGAA AGAACTGCGC AACCTGCCCA AGGACGACCC GACTCTGGTC GCCAAGGCCC GCGACCGCTG GTACGTGCCC GATCCAAATA AGGCGGGCGA CCTGGAGAAA CTGCGCGAGA AGGCACTGCT CAAGGAGTTC GAGGAATACA AAGAGGTCAA AAAGAAACTC AAGGTCTTCC GCCTGGAAGC TGTCCGCGCC GGATTCAAGA AAGCCTGGCA GGAACGCGAC TACGCCGTCA TCGTCGCCGT GGCCGACAAG ATCCCCAACA ACGTCCTGGA AGAAGATCCC AAGCTGCTCA TGTGGTACGA CCAGGCGGTA ACAAGAATGG GAGGCGGTGA CTAG
|
Protein sequence | MKPKQESLFQ TRLVPAEAGS GRLFDEELVA GSDGPVKCLG LEFENDEARR THFTEELRKK LQDPEFRKIE GFPIGSDEDI LNLSDPPYYT ACPNPWIADF ITEWEAQKPE QPEGYHYHRE PFAADVSEGK NDPIYNAHSY HTKVPHKAIM RYILHYTEPG DIVFDGFCGT GMTGVAAQMC GDREVVMSLG YQLKPDGTIL QEEMDEDGKK VWRPFSKLGV RRAVLNDLSP AATFLAHNYN LYVDTASFEN EAKKFIKIIE RECGWMYETT HTDGVTKAKV NYTIWSDVFV CPDCTNEIVF WDVAVDKESE TVNDEFQCPN CQTSLTKRNM DRAWVTTYDR FLGETIRQAK QIPALINYSV GGKRYEKKPD EGDFAILEKI ENEGLDGWFP INRMIEGHES RRNDPVGITH THHFYSPRNL SVMSKILDLA DKSEFTPFKF GFLNTSWHAT QMRQYNPGGG HRPRTGTLYM PSIHSEGNMI PVYKKKLNQL VQFYKVKSHR NRVAIIQTMS STVESSIEAG LDYVFIDPPF GANLNYSELN SIWEAWLKVS TNNAEEAIEN RSQNKGIDEY RSLMTQCFRQ AYNQLKPGRW MTVEFSNTSA GIWNNIQTAI SDAGFIVANV SVLNKKQGSI MAYTTPTAVK QDLVISAYKP NGGFEERFQK EAQTEEGVWD FVRTHLKYLP VTKQQGALLQ FVPERDPRIL FDQMVAYYVR KGYPVPISSQ EFQIGLSQRF IERDGMFFLP DQVAEYDRKK MTSGELKQMS MFVSDEASAI QWLHQLIKEK PQTFSDINPQ FMQQLGGWSK NEAQLDLREL LNQNFLSYDG KGPVPEQIHA YLSTNWKELR NLPKDDPTLV AKARDRWYVP DPNKAGDLEK LREKALLKEF EEYKEVKKKL KVFRLEAVRA GFKKAWQERD YAVIVAVADK IPNNVLEEDP KLLMWYDQAV TRMGGGD
|
| |