Gene Noc_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2687 
Symbol 
ID3704444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3042946 
End bp3044931 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content62% 
IMG OID637739169 
ProductN-6 DNA methylase 
Protein accessionYP_344670 
Protein GI77166145 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.965628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG AGAATCATTC CCAGACGGCT GCTTTCCTAT GGTCCATAGC CGATCTGTTG 
CGCGGTGACT TCAAGCAGTC CCAGTATGGC CGCATCATTT TGCCGTTCAC CCTGCTGCGG
CGCATGGAGT GCGTGCTCGA GCCAACCAAG GACAAGGTCG TCCAGCAGGC GACAGTGCAT
CAGCACAAGC CCGACCACGT GCGTGAGATG CTGCTGCGGC GCGCCGCCGG TGATCTGCAA
TTCTTCAACG CCTCGCCGCT GACCCTGGGC ACCCTGTCCG ACACCCAGAC CGCCGCCGAC
CTGATGAGCT ACGCGCAGTC CTTCAGCACC GACGCTTGCG AAATCTTCGA GCATTTCGAG
TTCGAGAACT TCGTCCAGCA GCTTAGCAGC GCCAACCTGC TGTACCAGGT GGTGCAACGC
TTCGCCGCCA CCGATCTCAG CCCGGCGCGC ATCAGCAACT TCGGCATGGG CATCATCTTC
GAGGAGTTGA TACGCCGCTT CGCTGAAAGC TCCAATGAAA CCGCCGGGGA GCACTTCACC
CCGCGGGATA TCGTCCACCT GACCACCTCG CTGGTGATCA CCGGCCAGGA CGACAAGCTG
GCGCCCAACC GCATCGTCAC CATTTATGAC CCTACCGCAG GCACGGGCGG CTTTCTCTCC
GAAGGTGACG AATACATCCA GTCCATCAGC GAAAAGGTCT CGGTATCCCT CCACGGCCAG
GAGCTCAACC CCGAGTCCTA CGCCATCTGC AAGGCCGACA TGCTAATCAA GGGCCAGGAT
GTGGCCAATA TCAAGCTGGG CAACACCCTG TCCAACGACC AGCTAACCGG CCCGGAACAC
CGCTTCGACT TCATGCTCTC CAATCCACCC TTCGGCGTGG AATGGAAAAA GGTCCAGAAG
CAGATCACCG GCGAGCACAA GCACAAGGGC TTCAACGGCC GCTTCGGCCC CGGCCTGCCA
CGGGTGTCCG ACGGCTCATT GCTGTTCCTG CTGCACCTGG TCAGCAAAAT GCGCGATCCT
CGCGACGGCG GTTCGCGCAT CGGTATTATC TTGAACGGCT CGCCCCTGTT CACCGGCGGC
GCTGGCTCCG GCGAATCCGA GATCCGCCGC TACCTGCTGC AGCACGATCT GGTCGAGGCC
ATCGTCGCCC TGCCCACCGA CATGTTCTAC AACACCGGCA TCGCCACCTA TGTGTGGCTA
CTGTCCAACC ACAAGCCCGC CGAGCGCAGG GGCAAGGTGC AGCTCATCGA CGGCAGCCAG
CACTTTGCCA AGATGCGCAA ATCCCTGGGC AGCAAGCGCC AGTACGTCAC CGCCGAGCAG
ATCAACGAGC TGGTATGCCT CTACGGCGCC TTCGAGGAAA CCCCCCAGAG CAAAATCTTC
CCCATCAACG CCTTCGGCTA CCGCCGCATC ACCGTCGAGC GCCCCCTGCG GCTCAACTTC
CAGGCCAGCG CCGAGCGCAT TGACAACGTG CTGCAGGAAA AAGCCATCCA GAAGCTGGAT
GACACCGCCC GCCAGCAACT CGCCGACGCC CTGGGAGCCA TGGACCCAAG CCCGTTGTAT
CGCAACCGGG AGCAGTTCGC CAAGCTCCTG AAAAAGACCC TAACCGCCCA CGGTGTCAGC
CTCAGTACGC CGGAGCAAAA GGCCCTATTG AACGGCCTCG GCAAGCGCGA CCCCAAGGCG
GATATCTGCA CAACCAAGGG CAAGCCCGAG CCGGACACCG GCCTGCGGGA CAACGAGAAC
GTGCCCCTGG GTGAATCCGT GTACGACTAC TTCCAGCGCG AAGTCATCCC CCACGTGCCC
GACGCCTGGA TCAACGAGAG CAAGCGTGAT GCGCTGGACG GCGAAGTCGG CATCGTCGGC
TTCGAGATCC CCTTCAACCG CCACTTCTAC GTATTCCAGC CGCCGCGCCC GCTGGAGGAA
ATCGACCGGG ACCTGAAAGC CTGCACTGAT CGCATCAAGC AAATGATTGA GGAGTTGTCG
GCATGA
 
Protein sequence
MNTENHSQTA AFLWSIADLL RGDFKQSQYG RIILPFTLLR RMECVLEPTK DKVVQQATVH 
QHKPDHVREM LLRRAAGDLQ FFNASPLTLG TLSDTQTAAD LMSYAQSFST DACEIFEHFE
FENFVQQLSS ANLLYQVVQR FAATDLSPAR ISNFGMGIIF EELIRRFAES SNETAGEHFT
PRDIVHLTTS LVITGQDDKL APNRIVTIYD PTAGTGGFLS EGDEYIQSIS EKVSVSLHGQ
ELNPESYAIC KADMLIKGQD VANIKLGNTL SNDQLTGPEH RFDFMLSNPP FGVEWKKVQK
QITGEHKHKG FNGRFGPGLP RVSDGSLLFL LHLVSKMRDP RDGGSRIGII LNGSPLFTGG
AGSGESEIRR YLLQHDLVEA IVALPTDMFY NTGIATYVWL LSNHKPAERR GKVQLIDGSQ
HFAKMRKSLG SKRQYVTAEQ INELVCLYGA FEETPQSKIF PINAFGYRRI TVERPLRLNF
QASAERIDNV LQEKAIQKLD DTARQQLADA LGAMDPSPLY RNREQFAKLL KKTLTAHGVS
LSTPEQKALL NGLGKRDPKA DICTTKGKPE PDTGLRDNEN VPLGESVYDY FQREVIPHVP
DAWINESKRD ALDGEVGIVG FEIPFNRHFY VFQPPRPLEE IDRDLKACTD RIKQMIEELS
A