Gene Noc_0546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0546 
Symbol 
ID3706738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp586862 
End bp589873 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content59% 
IMG OID637737054 
ProductDNA methylase containing a Zn-ribbon 
Protein accessionYP_342596 
Protein GI77164071 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGCCC CCCAACAGGA GCAGACGCTA TGCCTTGAGG CACCGCCGCT CAAGAATACC 
CCCGCCCTCC TGGAGCGGGT CTTCCCGGCC CAGAAGATTT CCGCCGAAGC CCAGAAGGAA
AGAAAGGCCG GTTCGGGGAA AACCCTCACC GCCCTGGGCT CCTACTGGAA AGGCCGCAAG
CCCCTCATCC TGGTGCGGGC TCTTATTCTG GGTTCGTTAC TTCCCGCCAC GGATGATCCA
GAAACGGATT TAGCGATCTT CGAGCAATTG ATGGCCCTGG ATGAGGCGTC CTTCGGGCGG
CGGGAACCAA AACTAAGCGC GGCCCAGGTG GCGGCAAGAA TCACGCTGCC CCGTCCCTGG
GATTATTTCG ATTATAGTTT CAAGGACGCG ACGGTTGAGC CTACGAAAAT AGAGGAGCTG
ACCTTTCCCC TCCGGGCCGG GGACATCCCC GGGCTGTCCC TGCGCTGGAA GCGGGCTATT
CCCCTGGCGG ACAAGCAAAC ACTGCTGGCG GCGGCCCTAA AGGAGCTGCC CTACCCGGAC
AAAGTGGCCC TTTGCAAGCG ACCGGAGGAG TGCGATCCGG CAACCCTCTA CGGCCCCATC
TGGGACTCCG TCAATCAACA TCTGGGGCGC TTTGGCGTCC AGGCCCATAG CCACGAAGAG
CTGGTAGCCC AGTTGGGTAT GCTTCGCTTT GGGCACCGGC CCCGGGTGGG AGACACCTTC
TGCGGCAGCG GTTCCATTCC CTTCGAGGCC GCCCGCCTGG GCTGCGAGGT GTATGCCTCG
GATCTTAACC CCATTGCCTG CATGCTCACC TGGGGAGCGC TCAATATTAT TGGCGCTTCC
CCTGAGCGGC GAGATGAGAT CGCCCAGGCC CAGCAAGCGG TGGCCGCAGC GGTGAACCAG
GAAATCACCG CCCTCGGCAT TGAGCACAAC AGCCAAGGCG ATCGAGCGAA AGCCTATTTG
TATTGTCTGG AGACTCGCTG CCCGGAAACC GGCTGGCAGG TGCCTCTAGC GCCCAGTTGG
GTGATTTCTA AAACCCGCCA GGTTTATGCC AAGCTGATTC CAAATCCGCG GGAAAAACGC
TTTGAAATTG ACATTGTCAG CGGCGCTTCC CCAGAGGAGA TGGCAGCCGC TGAGCAAGGC
ACGGTCCAGC AGGGGCAGAT GGTGTATACG CTGGAAGGGA AAACCTACCG CACCTCCATC
AAAACCCTGC GGGGCGACTA TCGAGACGCC CAGGGCGTTA ACCGCAACCG CCTGCGGCAG
TGGGAGAAGC ACGATTTCAG GCCCCAGCCG GAGGATGTCT TTCAGGAGCG CCTTTACTCC
ATTCAGTGGA TCACCCAGGA AACGCTGGGG AAATCCCGGC AGCAGACCTA TTTCGCCCCG
GTCACCGAAG AAGATCGGGC GCGGGAGCGA CAAGTGGAGC AGATCGTGGC GGAAAATCTG
GCCTCCTGGC AAGAGCAAGG ACTCGTGCCC GATATGGCCA TTGAACCGGG TAAAGAGACC
ACGAGGCTTC AACGGGAGCG CGGCTGGCGG TATTGGCATC AATTGTTTAA TGCACGGCAG
CTACTTATTT CTTCGCTTTT CTGCAAGCAT CGACACCCCG TATCTGCGAT TTGTCTTTTA
AAGGCGGCTG ACTGGAATAA TCGGCTATGC CGGTGGGAGC CTTATTGGGC TAAGTCACAA
CAAGTCTTTT ACAATCAAGC ATTAAATACC TTTTATAATT ATGGGACTCG GGCGTATGAC
ATGCACATGC AGGCGTACGA TTTGCCTATG AGGCGATCTC AAACATTAGA CGTATCCAAT
TATGTTGAGA TGTTGGATTG CCGCTCAATT ACTGCGGTGG CAGATCTGTG GATCACCGAT
CCGCCCTACG GGGATGCGGT TCACTACCAC GAAATCACTG AGTTCTTTAT TGCCTGGCTG
CGGAAAAACC CGCCCGCCCC CTTCAATGAA TGGATCTGGG ACTCCCGCCG GGCGCTGGCC
ATTCAAGGCG CTAGCGACAA GTTCCGCCGC GATATGGTGG AAGCTTACCA AGCCATGACC
GAACACATGC CGGATAACGG CCGTCAATGC GTCATGTTTA CCCATCAGGA CAGCCGGGTG
TGGTCCGATA TGGCCGCTAT CTTCTGGGCG GCGGGTCTCC AGGTCATCAA CGCCTGGTAC
ATTGCCACCG AAACCAGCTC CGAGTTGAAA AAGGGCGGTT ATGTCCAGGG CACCGTGATT
CTGCTCCTGG GGAAACGGCC GCCCGGCCAG CGGGCGGGCT TTACCCCCCG TCTTCTGCCC
CAGGTGCGCA AGGAAGTCAA CGCCCAAATC CAGGACATGA TGCATCTTAA CGCGCGGACT
CAGGAACAGA TGGGGGCGCC CGTATTCACC GACTCCGATC TCCAGATGGC GGGCTACGCG
GCGGCCCTGA AGGTCCTCAC CGGCTACACG GAAATCAACG GCGAGGAAGT CACCCGCCTG
GCGCTGCGTC CCCGGCGGAA GGGGGAGAAA ACGGTGGTGA GTGAGATGGT TCAGCAAGCG
GCGGCAACCG CCAACAGCCT CCTCGTCCCT GAGGGCCTGC CCAAAGCGAC TTGGGAGGTG
ATTAGTGGCA TCCAGCGCTT CTACCTGCGG ATGGTGGCCC TGGAAACCAC CGGAGCCAGC
AAGCTGGATA ATTATCAAAA CTTCGCCAAA ACCTTCCGGG TGGACAACTA CCAAGCGGTG
ATGGCAAGCC TAAAACCCAA CAGAGCCCGG CTGAAAGGAG CCCAGGATTT CAAGCCCCGG
GAGCTGGCCG GAACCGAAAT CGGCGAGACT CTCCTGGGGC AGGTGCTGGT GGCGCTCCAG
GAACTTTTGG GGGAGAAGGA ACCACCGATC GTCATGGACA ATCTCCGGGA GGCCCTGCCG
GATTATTTTC AGCAACGCCC CCACCTCCAG GCCATGGCGC AATTTCTCGG TGACCAGCTC
GCCCAGCGGC GTCCCCAGGA AGCACGAGCC GCCGAGATCA TCGCCAGTCG GGTGCGCAAC
GAGCGCCTGT GA
 
Protein sequence
MLAPQQEQTL CLEAPPLKNT PALLERVFPA QKISAEAQKE RKAGSGKTLT ALGSYWKGRK 
PLILVRALIL GSLLPATDDP ETDLAIFEQL MALDEASFGR REPKLSAAQV AARITLPRPW
DYFDYSFKDA TVEPTKIEEL TFPLRAGDIP GLSLRWKRAI PLADKQTLLA AALKELPYPD
KVALCKRPEE CDPATLYGPI WDSVNQHLGR FGVQAHSHEE LVAQLGMLRF GHRPRVGDTF
CGSGSIPFEA ARLGCEVYAS DLNPIACMLT WGALNIIGAS PERRDEIAQA QQAVAAAVNQ
EITALGIEHN SQGDRAKAYL YCLETRCPET GWQVPLAPSW VISKTRQVYA KLIPNPREKR
FEIDIVSGAS PEEMAAAEQG TVQQGQMVYT LEGKTYRTSI KTLRGDYRDA QGVNRNRLRQ
WEKHDFRPQP EDVFQERLYS IQWITQETLG KSRQQTYFAP VTEEDRARER QVEQIVAENL
ASWQEQGLVP DMAIEPGKET TRLQRERGWR YWHQLFNARQ LLISSLFCKH RHPVSAICLL
KAADWNNRLC RWEPYWAKSQ QVFYNQALNT FYNYGTRAYD MHMQAYDLPM RRSQTLDVSN
YVEMLDCRSI TAVADLWITD PPYGDAVHYH EITEFFIAWL RKNPPAPFNE WIWDSRRALA
IQGASDKFRR DMVEAYQAMT EHMPDNGRQC VMFTHQDSRV WSDMAAIFWA AGLQVINAWY
IATETSSELK KGGYVQGTVI LLLGKRPPGQ RAGFTPRLLP QVRKEVNAQI QDMMHLNART
QEQMGAPVFT DSDLQMAGYA AALKVLTGYT EINGEEVTRL ALRPRRKGEK TVVSEMVQQA
AATANSLLVP EGLPKATWEV ISGIQRFYLR MVALETTGAS KLDNYQNFAK TFRVDNYQAV
MASLKPNRAR LKGAQDFKPR ELAGTEIGET LLGQVLVALQ ELLGEKEPPI VMDNLREALP
DYFQQRPHLQ AMAQFLGDQL AQRRPQEARA AEIIASRVRN ERL