Gene Nmar_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0608 
Symbol 
ID5773842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp544537 
End bp547557 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content36% 
IMG OID641316243 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001581942 
Protein GI161528116 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTA GAATAGAGGA TAATGAGGAA GGGCGACAAG TTGAACTTCC TGCATTAGAG 
CTAATCACTG CATTAGGCTA CACATACATT CCAAATTATG AGCTAAACAT TCCTAAAGAA
AGGCCTGATC ACAGGCAAGT ACTGCTATAT CCTAGGCTCA GGGCTGCAAT AAAACGACTA
AACGACTTTG ATGATGATGG AATAGAACAG GCAATCGCCC AGATTCATGA AGACAACTTT
CCAATTGGAA TGCCGATGAT TGATGCAAAT GAAAAAATTC GAATCAAACT AACCGGTCGT
GCAGGAGAAA ACGCAATTGC ACAACCCATC ACCATAAAAC AGTATGGAAA GAAAGGAATT
GAATATCCTA CTGTAAAATT CTTTGATTTT GATCCTAAAA AAATTGGAAC TAAAGAAGAC
AAAAACGAGT ATCTTGTCAC CAACCAATTC AAACATCTTG GCAATCGAAC TGAAATTGAA
TGCGACATTG TTATTTTTGT AAATGGTATT CCCCTAGTCC TAATTGAATG CAAAAAACCA
ACAACTATTG ATCTTATGAA GACAGTATGG AAAGAAAATC TGGAAAAATA TCAAAGAGAT
GGCTCTCACA GTTTGGGTCA TGAAAAATTA TTTTTCTTTA ATCACGTAAT CATGGCAACT
AGCGCTATTC AAGCAAGATA TGGAACATTA AAGGCACTGC CAAACAAGTA TGCAAAATGG
ACTAGTCTTA CAAACATCAC GACAAAGGAA CTAGAAAAGT TGGTTGGAAG AAAACCCACA
CCACAAGACA TTATGCTTGC TGGAATGATA AAAAAAGAAA CACTGCTTGA TATGCTCAAA
AATTTTGTTC TGTATGAAAT TGAAGAACAT AAAAAAATAA AAAAAGTTGC AAAACATCAA
CAATACCGTG TAGTTACAAA GTCTGTAGAC AGAATAAGTC ATCATAAAAA AGTAGAAGAC
AAGGGTGGTG TCATTTGGCA CACTCAGGGT TCAGGAAAAT CTCTTTCAAT GGTATGGTTT
GCAACTCAAC TCTATTACAA ATTCAGCAGA CCGACAATCA TGGTAATTAC TGATAGGAGG
CAACTTAACA AGCAGATTTT TGACACCTTT AGAAATTGTG GATTCCCAGA GCCTGAAAAA
CCACGAAATC GAAGTCAATT AGCAAAAACC CTTCAGTATT CAAAGGGAAA AACAATCATG
GTAAACCTCC AAAAATTTGA CAAACCTGAA AAATTTGTAG AGACAAAAGA AAAGATCTAT
GTACTTGTAG ATGAGGCACA CCGGTCTCAA TACAAGTGGA CTGCTGGTTA TATGCGTAAG
GCCATGCCTA ACGCTGTATT TTTTGCATTT ACTGGTACTC CTCTCGATAG AGAAAATAAA
AACACATACC GAAGATTTGG ACCATTAATT GACAAGTATT CCTTTACAGA ATCTAAAGAA
GATGGCGCAA CAGTCAAAGT AGAACATATG GGATTGCTTC CAGAGATTGA AATTGAGGGT
GGTAACTCAC TTAACAATAT TTTTGATAAT TTATTTGGTC ATTTACCAAA AGCCCAACAA
GCGGAGATTA GAAGAAAATA TGCAACAAAA AAGAAGATTG CATCATCTCC TGCAAGAATA
CGAAAGATTT GTGAAAAAAT TGTAGATCAT TACACAAAAA AGATTCTTCC AAATGGATAC
AAGGCAATGA TTGTTGCACC AACTAGAGCA GCAGCAGTAA CATACAAACG AGAACTAGAA
GATCTTACAA AACTTCCTGC CAGAATTATT ATGGATTCTA AAAAAGATGA AGTTGGTCCT
GATGAATTCA GTTGGGCAGA TTATTATCTT CCACAAAATG AAGCATTGAA AAAAGCAGAG
GAATTTACAA ATCCTGATGA TCCCACAAAA TTTTTGATTG TTGTAGATAT GTTGCTTGTT
GGGTATGATG CACCAATTGT TCAAGTGATG TATCTTGATA GACCTCTACG AGAACATGCA
TTGCTTCAAG CAATAGCTCG AGTAAATAGA ATTTATGATG AACACAAGGA TAGGGGATTC
ATTATCGACT TTTGGGGAGT TACAAGAGAC TTGCAAAATG CACTAAAGAT GTTTGAAGAA
CAAGATGTTC AAGGTGCATT GGATAATGTT GATGATGATT TGTTAGAGTT AGACGTACGA
CACAAAGATT TCATGGAAAA AATCCAAGGA ATAGATTCCA AAGATCATAA CGAGATTGCA
AAAAGATTCG AAGATGAAGA TGAACAAGAG GAATTTGAAT ATGCGTTCAA GCGTTTTGCA
AAGGCACTAG AAGCTGTTCT TCCAGACAAA GAAGCAGTCC CCTATGAAAA AGACTTGAAG
AAAGGGTATG AGATTAGGGG TAAAATCCGT GCCTGGTTTT ATGGTGACAG GGCTGATCTA
AGCGAATATG GCAAGAAAGT ACAAAAGATA ATCGACAAGC ATATTCGAGC TATAGGAATT
TCAGAAATTT CTGGTTCCAA AGAAATTACA TATGACAACT TTTTGGGATT TATTGCAAAA
TTCAAAGGCG ATAACAGGGC AAAAGCAGCT CTGATAAAGA AAAGAGCAAC ACTCGTAATT
CGTGAAATGA GTCCTGACAA CCCTGCATTT TATCAAAAAC TCAAAGAAAG ACTGGAAAAA
TTAATCCAAG ACGAAAAGGA ACGAAGAGTA GATGATGCAC AATACTTTGA AGGAATAAGG
GATATCTTTG AAGAGGCACT ATCTGGTCCT GAAAAATTAC AAAAACAAAC AGGAATCAAA
GACCGATTTC AACTTGCCAT TTACATGCTA TTAGAAGAAA AACATTCAGA TTTAAAACAA
AACAAAAAAT ATTCTGAAGA GATATTCAAG AAAGTCAAAA AGGCTGCATC TGTAAAAGAC
TGGAGAGACA AGGAACCGCA AGAAAACGAA ATTGAATTGG CTGTAATTGA TACACTTGAT
AAAAAAATAT TTGATGAAAA AACTCGAGAC AAATTAGCTT CTGAAGTATA CAAAATGGCA
GTGAATAACA ATGAATGGTA A
 
Protein sequence
MNFRIEDNEE GRQVELPALE LITALGYTYI PNYELNIPKE RPDHRQVLLY PRLRAAIKRL 
NDFDDDGIEQ AIAQIHEDNF PIGMPMIDAN EKIRIKLTGR AGENAIAQPI TIKQYGKKGI
EYPTVKFFDF DPKKIGTKED KNEYLVTNQF KHLGNRTEIE CDIVIFVNGI PLVLIECKKP
TTIDLMKTVW KENLEKYQRD GSHSLGHEKL FFFNHVIMAT SAIQARYGTL KALPNKYAKW
TSLTNITTKE LEKLVGRKPT PQDIMLAGMI KKETLLDMLK NFVLYEIEEH KKIKKVAKHQ
QYRVVTKSVD RISHHKKVED KGGVIWHTQG SGKSLSMVWF ATQLYYKFSR PTIMVITDRR
QLNKQIFDTF RNCGFPEPEK PRNRSQLAKT LQYSKGKTIM VNLQKFDKPE KFVETKEKIY
VLVDEAHRSQ YKWTAGYMRK AMPNAVFFAF TGTPLDRENK NTYRRFGPLI DKYSFTESKE
DGATVKVEHM GLLPEIEIEG GNSLNNIFDN LFGHLPKAQQ AEIRRKYATK KKIASSPARI
RKICEKIVDH YTKKILPNGY KAMIVAPTRA AAVTYKRELE DLTKLPARII MDSKKDEVGP
DEFSWADYYL PQNEALKKAE EFTNPDDPTK FLIVVDMLLV GYDAPIVQVM YLDRPLREHA
LLQAIARVNR IYDEHKDRGF IIDFWGVTRD LQNALKMFEE QDVQGALDNV DDDLLELDVR
HKDFMEKIQG IDSKDHNEIA KRFEDEDEQE EFEYAFKRFA KALEAVLPDK EAVPYEKDLK
KGYEIRGKIR AWFYGDRADL SEYGKKVQKI IDKHIRAIGI SEISGSKEIT YDNFLGFIAK
FKGDNRAKAA LIKKRATLVI REMSPDNPAF YQKLKERLEK LIQDEKERRV DDAQYFEGIR
DIFEEALSGP EKLQKQTGIK DRFQLAIYML LEEKHSDLKQ NKKYSEEIFK KVKKAASVKD
WRDKEPQENE IELAVIDTLD KKIFDEKTRD KLASEVYKMA VNNNEW