Gene Noc_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0439 
Symbol 
ID3706610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp476041 
End bp477645 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content54% 
IMG OID637736949 
ProductType I restriction-modification system M subunit 
Protein accessionYP_342493 
Protein GI77163968 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00153392 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGG ATCAACTCAG CCAGCTTGGC AAAACCCTTT GGGCCATTGC CGACGATCTG 
CGCGGCGCCA TGAATGCCGA TGACTTCCGC GACTATATGC TGTCGTTTTT GTTCTTGCGC
TATCTGTCCG ATAACTACGA AGCGGCGGCG AAAAAAGAAC TGGGGCCGGA CTATCCCAAG
CTGGCCGACG ACGACCGCCG CGCGCCCTTG GCGGTGTGGT ATGAGGATAA TGCCGAGGAT
GTGGCCGCCT TTGAAAAACA GATGCGCCGC AAGATGCACT ATGTCATCCA CCCGGATTAT
CTATGGAGCA GTATTGCCGA GCTGGCTCGC ACCCAGGATG AGGAGCTCTT GCAAACATTG
GCGGGCGGCT TTAAACACAT TGAAAACGAG TCCTTCGCCA GCACTTTTCA GGGCTTGTTC
TCGGAAATTA ATCTGCGTTC GGAAAAGTTG GGCAGAACCC TCGCGGACCA AAACAGAAAG
CTTTGCACCA TCATCACTAA AATCGCCAAG GGCATCGCAC GGTTCTCCAC CGGCAGCGAT
ATTCTGGGCG ATGCCTATGA ATATCTGATT GGCCAGTTCG CCGCTGGTTC CGGCAAAAAG
GCCGGCGAGT TTTACACCCC CCAAAGCGTC TCCACCATCC TCTCCCGCAT TGTGACACTG
GACAGTCAGG AGCCCTCCAC CGGCAAAAAG AAAAAGCTTA ATCGCGTGCT GGATTTTGCC
TGTGGTTCCG GCTCGTTGCT GCTCAACGTG CGCAACCAAA TGGGGCCGCG GGGTATCGGC
ATGATCTACG GCCAGGAAAA GAATATCACC ACCTATAACC TGGCGCGCAT GAATATGCTG
CTCCATGGCA TGAAGGATAC CGAGTTCCAG ATCCACCACG GCGATACGCT GGAAAACGAC
TGGGCTATTC TCAACGAGAG GAACCCCGCT AAAAAACTCC AATGCGATGC GGTCGTGGCC
AACCCGCCGT TCAGTTATCG CTGGGAGCCC GACGAGGCCA TGGGCGAGGA TTTCCGCTTC
GAAAGCCACG GCCTTGCCCC CAAATCAGCG GCGGATTTCG CCTTTCTACT CCACGGCCTT
CACTTTTTGA GCGACGAGGG CACCATGGCC ATTGTCCTGC CCCACGGCGT CCTTTTTCGG
GGCGGCGCGG AAGCGCGTAT CCGCACCAAA TTGCTGAAAG ACGGCCACAT CGATACCGTG
ATCGGCCTGC CCTCCAATCT GTTTTTCTCC ACCGGCATAC CGGTATGTAT TCTGGTGCTG
AAAAAGTGCA AAAAGCCCGA CGACGTACTG TTTATCAATG CCAGCGAGCA CTTTGAAAAA
GGCAAGCGCC AAAACGCCCT ACGGCCCGTG GACATAGATA AAATTGTTGA TACCTATCAA
TACCGCAAGG AAGAGGAACG CTACGCCCGG CGCGTCAGTA GGGAGGAGAT CGAGAAAAAC
GATGACAACC TCAATATTTC CCGCTACGTC AGCACCGCTA AGCCTGAGCC AGAGGTGGAC
CTAGATGACG TTCATAAAAA GCTCACCGCT ATTGAGCAGC GCATATCGCA AACAACAGAA
ACCCATAATC AGTTTCTCAA AGCATTGGGG CTGCCAATAT TGTGA
 
Protein sequence
MTRDQLSQLG KTLWAIADDL RGAMNADDFR DYMLSFLFLR YLSDNYEAAA KKELGPDYPK 
LADDDRRAPL AVWYEDNAED VAAFEKQMRR KMHYVIHPDY LWSSIAELAR TQDEELLQTL
AGGFKHIENE SFASTFQGLF SEINLRSEKL GRTLADQNRK LCTIITKIAK GIARFSTGSD
ILGDAYEYLI GQFAAGSGKK AGEFYTPQSV STILSRIVTL DSQEPSTGKK KKLNRVLDFA
CGSGSLLLNV RNQMGPRGIG MIYGQEKNIT TYNLARMNML LHGMKDTEFQ IHHGDTLEND
WAILNERNPA KKLQCDAVVA NPPFSYRWEP DEAMGEDFRF ESHGLAPKSA ADFAFLLHGL
HFLSDEGTMA IVLPHGVLFR GGAEARIRTK LLKDGHIDTV IGLPSNLFFS TGIPVCILVL
KKCKKPDDVL FINASEHFEK GKRQNALRPV DIDKIVDTYQ YRKEEERYAR RVSREEIEKN
DDNLNISRYV STAKPEPEVD LDDVHKKLTA IEQRISQTTE THNQFLKALG LPIL