Gene Noc_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0059 
Symbol 
ID3705935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp59989 
End bp62832 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content51% 
IMG OID637736584 
Producthypothetical protein 
Protein accessionYP_342131 
Protein GI77163606 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA AACAAGAAAG CCTGTTCCAG ACCAGACTGG TTCCCGCCGA AGCGGGCTCC 
GGCAGGCTAT TCGACGAAGA ACTGGTGGCT GGTTCGGATG GGCCTGTAAA GTGTCTTGGT
CTTGAGTTTG AAAACGACGA AGCCCGTCGC ACCCACTTTA CCGAGGAGCT GCGCAAGAAG
CTGCAGGACC CGGAGTTCCG CAAGATCGAA GGTTTTCCCA TCGGCAGCGA CGAGGACATC
CTGAACCTGA GTGATCCGCC ATATTACACC GCCTGCCCGA ACCCATGGAT CGCCGACTTC
ATCACCGAGT GGGAGGCGCA AAAGCCGGAA CAGCCCGAAG GCTATCACTA TCACCGTGAG
CCTTTTGCCG CCGATGTCAG CGAAGGAAAA AACGACCCGA TTTACAATGC CCACTCTTAT
CATACCAAAG TGCCGCACAA GGCAATCATG CGGTACATCC TCCACTACAC GGAGCCAGGA
GACATCGTTT TTGACGGCTT CTGTGGTACT GGCATGACAG GAGTGGCCGC GCAGATGTGT
GGCGACCGTG AAGTGGTCAT GTCGCTCGGC TACCAGTTGA AGCCGGACGG GACCATTTTG
CAGGAAGAGA TGGACGAAGA TGGCAAAAAG GTTTGGCGAC CATTTTCAAA ACTGGGCGTC
CGCAGAGCCG TACTGAACGA CCTTTCACCG GCGGCGACGT TCCTTGCGCA TAACTACAAC
CTTTATGTTG ATACGGCATC ATTTGAAAAT GAAGCTAAGA AATTCATCAA AATAATCGAG
CGAGAATGTG GATGGATGTA TGAAACAACT CATACAGACG GGGTGACCAA GGCGAAAGTG
AACTACACAA TCTGGTCAGA TGTATTTGTC TGTCCTGATT GCACTAATGA GATTGTATTC
TGGGATGTTG CAGTCGACAA AGAATCTGAG ACAGTAAACG ATGAGTTTCA ATGTCCTAAT
TGCCAGACCA GCTTAACAAA GCGCAATATG GACCGTGCTT GGGTTACCAC TTATGACCGA
TTCTTGGGAG AGACGATTCG GCAGGCCAAG CAGATACCTG CTTTGATCAA TTATAGCGTT
GGTGGGAAAC GATATGAGAA GAAACCTGAC GAGGGTGACT TCGCAATTTT GGAGAAGATA
GAAAACGAAG GATTAGACGG ATGGTTTCCC ATCAATCGCA TGATTGAGGG GCATGAGTCT
AGGAGAAATG ATCCGGTTGG TATCACGCAC ACTCATCACT TTTATTCACC TCGAAATCTA
TCAGTGATGT CGAAGATTTT GGATTTGGCT GACAAATCTG AATTTACTCC CTTCAAATTC
GGTTTTCTGA ATACCTCTTG GCACGCCACG CAGATGAGGC AATACAACCC AGGTGGCGGG
CACAGACCTA GAACGGGTAC TTTGTATATG CCTTCCATCC ATAGTGAAGG TAACATGATT
CCCGTTTACA AGAAAAAGCT GAACCAGCTT GTTCAGTTTT ATAAAGTCAA GTCTCATCGA
AATAGGGTTG CCATTATTCA GACCATGTCG TCTACGGTGG AATCATCTAT CGAAGCGGGT
TTGGACTATG TATTCATTGA TCCGCCTTTC GGCGCAAATC TGAACTACTC AGAACTAAAT
TCAATATGGG AGGCATGGCT TAAAGTAAGT ACGAATAATG CCGAGGAGGC AATTGAGAAT
AGGTCACAGA ACAAGGGGAT TGATGAATAC CGATCTCTTA TGACTCAGTG CTTTCGACAG
GCATACAACC AATTGAAACC TGGGCGCTGG ATGACTGTAG AGTTTTCTAA TACAAGCGCT
GGTATTTGGA ACAATATTCA GACGGCAATA TCAGACGCTG GATTTATTGT CGCGAATGTC
TCTGTTCTAA ATAAAAAGCA GGGGTCGATA ATGGCTTACA CTACCCCCAC TGCTGTCAAA
CAAGACCTCG TTATCTCAGC CTACAAACCC AACGGCGGTT TTGAAGAGCG CTTTCAGAAA
GAGGCGCAAA CCGAAGAAGG TGTGTGGGAT TTTGTCCGAA CCCACCTAAA GTATCTGCCG
GTCACCAAGC AACAGGGAGC CTTGCTGCAG TTCGTCCCGG AACGCGATCC GCGCATCCTG
TTCGACCAAA TGGTGGCCTA CTACGTCCGC AAGGGCTACC CCGTGCCGAT CTCGAGCCAG
GAGTTCCAGA TTGGGCTGTC GCAGCGTTTC ATCGAGCGCG ACGGCATGTT TTTCCTGCCG
GATCAGGTGG CCGAATACGA CCGCAAGAAG ATGACCTCGG GCGAACTTAA GCAGATGTCC
ATGTTCGTCT CTGACGAGGC ATCCGCCATC CAGTGGCTCC ACCAGCTCAT CAAGGAAAAG
CCGCAGACCT TCTCCGACAT CAATCCGCAG TTTATGCAGC AGCTCGGCGG CTGGAGCAAA
AACGAGGCCC AGCTCGACCT GCGTGAACTG CTGAACCAAA ACTTCCTCAG CTACGACGGC
AAAGGCCCGG TACCCGAGCA GATCCACGCC TACCTCTCCA CCAATTGGAA AGAACTGCGC
AACCTGCCCA AGGACGACCC GACTCTGGTC GCCAAGGCCC GCGACCGCTG GTACGTGCCC
GATCCAAATA AGGCGGGCGA CCTGGAGAAA CTGCGCGAGA AGGCACTGCT CAAGGAGTTC
GAGGAATACA AAGAGGTCAA AAAGAAACTC AAGGTCTTCC GCCTGGAAGC TGTCCGCGCC
GGATTCAAGA AAGCCTGGCA GGAACGCGAC TACGCCGTCA TCGTCGCCGT GGCCGACAAG
ATCCCCAACA ACGTCCTGGA AGAAGATCCC AAGCTGCTCA TGTGGTACGA CCAGGCGGTA
ACAAGAATGG GAGGCGGTGA CTAG
 
Protein sequence
MKPKQESLFQ TRLVPAEAGS GRLFDEELVA GSDGPVKCLG LEFENDEARR THFTEELRKK 
LQDPEFRKIE GFPIGSDEDI LNLSDPPYYT ACPNPWIADF ITEWEAQKPE QPEGYHYHRE
PFAADVSEGK NDPIYNAHSY HTKVPHKAIM RYILHYTEPG DIVFDGFCGT GMTGVAAQMC
GDREVVMSLG YQLKPDGTIL QEEMDEDGKK VWRPFSKLGV RRAVLNDLSP AATFLAHNYN
LYVDTASFEN EAKKFIKIIE RECGWMYETT HTDGVTKAKV NYTIWSDVFV CPDCTNEIVF
WDVAVDKESE TVNDEFQCPN CQTSLTKRNM DRAWVTTYDR FLGETIRQAK QIPALINYSV
GGKRYEKKPD EGDFAILEKI ENEGLDGWFP INRMIEGHES RRNDPVGITH THHFYSPRNL
SVMSKILDLA DKSEFTPFKF GFLNTSWHAT QMRQYNPGGG HRPRTGTLYM PSIHSEGNMI
PVYKKKLNQL VQFYKVKSHR NRVAIIQTMS STVESSIEAG LDYVFIDPPF GANLNYSELN
SIWEAWLKVS TNNAEEAIEN RSQNKGIDEY RSLMTQCFRQ AYNQLKPGRW MTVEFSNTSA
GIWNNIQTAI SDAGFIVANV SVLNKKQGSI MAYTTPTAVK QDLVISAYKP NGGFEERFQK
EAQTEEGVWD FVRTHLKYLP VTKQQGALLQ FVPERDPRIL FDQMVAYYVR KGYPVPISSQ
EFQIGLSQRF IERDGMFFLP DQVAEYDRKK MTSGELKQMS MFVSDEASAI QWLHQLIKEK
PQTFSDINPQ FMQQLGGWSK NEAQLDLREL LNQNFLSYDG KGPVPEQIHA YLSTNWKELR
NLPKDDPTLV AKARDRWYVP DPNKAGDLEK LREKALLKEF EEYKEVKKKL KVFRLEAVRA
GFKKAWQERD YAVIVAVADK IPNNVLEEDP KLLMWYDQAV TRMGGGD