Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0608 |
Symbol | |
ID | 5773842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 544537 |
End bp | 547557 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316243 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001581942 |
Protein GI | 161528116 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTA GAATAGAGGA TAATGAGGAA GGGCGACAAG TTGAACTTCC TGCATTAGAG CTAATCACTG CATTAGGCTA CACATACATT CCAAATTATG AGCTAAACAT TCCTAAAGAA AGGCCTGATC ACAGGCAAGT ACTGCTATAT CCTAGGCTCA GGGCTGCAAT AAAACGACTA AACGACTTTG ATGATGATGG AATAGAACAG GCAATCGCCC AGATTCATGA AGACAACTTT CCAATTGGAA TGCCGATGAT TGATGCAAAT GAAAAAATTC GAATCAAACT AACCGGTCGT GCAGGAGAAA ACGCAATTGC ACAACCCATC ACCATAAAAC AGTATGGAAA GAAAGGAATT GAATATCCTA CTGTAAAATT CTTTGATTTT GATCCTAAAA AAATTGGAAC TAAAGAAGAC AAAAACGAGT ATCTTGTCAC CAACCAATTC AAACATCTTG GCAATCGAAC TGAAATTGAA TGCGACATTG TTATTTTTGT AAATGGTATT CCCCTAGTCC TAATTGAATG CAAAAAACCA ACAACTATTG ATCTTATGAA GACAGTATGG AAAGAAAATC TGGAAAAATA TCAAAGAGAT GGCTCTCACA GTTTGGGTCA TGAAAAATTA TTTTTCTTTA ATCACGTAAT CATGGCAACT AGCGCTATTC AAGCAAGATA TGGAACATTA AAGGCACTGC CAAACAAGTA TGCAAAATGG ACTAGTCTTA CAAACATCAC GACAAAGGAA CTAGAAAAGT TGGTTGGAAG AAAACCCACA CCACAAGACA TTATGCTTGC TGGAATGATA AAAAAAGAAA CACTGCTTGA TATGCTCAAA AATTTTGTTC TGTATGAAAT TGAAGAACAT AAAAAAATAA AAAAAGTTGC AAAACATCAA CAATACCGTG TAGTTACAAA GTCTGTAGAC AGAATAAGTC ATCATAAAAA AGTAGAAGAC AAGGGTGGTG TCATTTGGCA CACTCAGGGT TCAGGAAAAT CTCTTTCAAT GGTATGGTTT GCAACTCAAC TCTATTACAA ATTCAGCAGA CCGACAATCA TGGTAATTAC TGATAGGAGG CAACTTAACA AGCAGATTTT TGACACCTTT AGAAATTGTG GATTCCCAGA GCCTGAAAAA CCACGAAATC GAAGTCAATT AGCAAAAACC CTTCAGTATT CAAAGGGAAA AACAATCATG GTAAACCTCC AAAAATTTGA CAAACCTGAA AAATTTGTAG AGACAAAAGA AAAGATCTAT GTACTTGTAG ATGAGGCACA CCGGTCTCAA TACAAGTGGA CTGCTGGTTA TATGCGTAAG GCCATGCCTA ACGCTGTATT TTTTGCATTT ACTGGTACTC CTCTCGATAG AGAAAATAAA AACACATACC GAAGATTTGG ACCATTAATT GACAAGTATT CCTTTACAGA ATCTAAAGAA GATGGCGCAA CAGTCAAAGT AGAACATATG GGATTGCTTC CAGAGATTGA AATTGAGGGT GGTAACTCAC TTAACAATAT TTTTGATAAT TTATTTGGTC ATTTACCAAA AGCCCAACAA GCGGAGATTA GAAGAAAATA TGCAACAAAA AAGAAGATTG CATCATCTCC TGCAAGAATA CGAAAGATTT GTGAAAAAAT TGTAGATCAT TACACAAAAA AGATTCTTCC AAATGGATAC AAGGCAATGA TTGTTGCACC AACTAGAGCA GCAGCAGTAA CATACAAACG AGAACTAGAA GATCTTACAA AACTTCCTGC CAGAATTATT ATGGATTCTA AAAAAGATGA AGTTGGTCCT GATGAATTCA GTTGGGCAGA TTATTATCTT CCACAAAATG AAGCATTGAA AAAAGCAGAG GAATTTACAA ATCCTGATGA TCCCACAAAA TTTTTGATTG TTGTAGATAT GTTGCTTGTT GGGTATGATG CACCAATTGT TCAAGTGATG TATCTTGATA GACCTCTACG AGAACATGCA TTGCTTCAAG CAATAGCTCG AGTAAATAGA ATTTATGATG AACACAAGGA TAGGGGATTC ATTATCGACT TTTGGGGAGT TACAAGAGAC TTGCAAAATG CACTAAAGAT GTTTGAAGAA CAAGATGTTC AAGGTGCATT GGATAATGTT GATGATGATT TGTTAGAGTT AGACGTACGA CACAAAGATT TCATGGAAAA AATCCAAGGA ATAGATTCCA AAGATCATAA CGAGATTGCA AAAAGATTCG AAGATGAAGA TGAACAAGAG GAATTTGAAT ATGCGTTCAA GCGTTTTGCA AAGGCACTAG AAGCTGTTCT TCCAGACAAA GAAGCAGTCC CCTATGAAAA AGACTTGAAG AAAGGGTATG AGATTAGGGG TAAAATCCGT GCCTGGTTTT ATGGTGACAG GGCTGATCTA AGCGAATATG GCAAGAAAGT ACAAAAGATA ATCGACAAGC ATATTCGAGC TATAGGAATT TCAGAAATTT CTGGTTCCAA AGAAATTACA TATGACAACT TTTTGGGATT TATTGCAAAA TTCAAAGGCG ATAACAGGGC AAAAGCAGCT CTGATAAAGA AAAGAGCAAC ACTCGTAATT CGTGAAATGA GTCCTGACAA CCCTGCATTT TATCAAAAAC TCAAAGAAAG ACTGGAAAAA TTAATCCAAG ACGAAAAGGA ACGAAGAGTA GATGATGCAC AATACTTTGA AGGAATAAGG GATATCTTTG AAGAGGCACT ATCTGGTCCT GAAAAATTAC AAAAACAAAC AGGAATCAAA GACCGATTTC AACTTGCCAT TTACATGCTA TTAGAAGAAA AACATTCAGA TTTAAAACAA AACAAAAAAT ATTCTGAAGA GATATTCAAG AAAGTCAAAA AGGCTGCATC TGTAAAAGAC TGGAGAGACA AGGAACCGCA AGAAAACGAA ATTGAATTGG CTGTAATTGA TACACTTGAT AAAAAAATAT TTGATGAAAA AACTCGAGAC AAATTAGCTT CTGAAGTATA CAAAATGGCA GTGAATAACA ATGAATGGTA A
|
Protein sequence | MNFRIEDNEE GRQVELPALE LITALGYTYI PNYELNIPKE RPDHRQVLLY PRLRAAIKRL NDFDDDGIEQ AIAQIHEDNF PIGMPMIDAN EKIRIKLTGR AGENAIAQPI TIKQYGKKGI EYPTVKFFDF DPKKIGTKED KNEYLVTNQF KHLGNRTEIE CDIVIFVNGI PLVLIECKKP TTIDLMKTVW KENLEKYQRD GSHSLGHEKL FFFNHVIMAT SAIQARYGTL KALPNKYAKW TSLTNITTKE LEKLVGRKPT PQDIMLAGMI KKETLLDMLK NFVLYEIEEH KKIKKVAKHQ QYRVVTKSVD RISHHKKVED KGGVIWHTQG SGKSLSMVWF ATQLYYKFSR PTIMVITDRR QLNKQIFDTF RNCGFPEPEK PRNRSQLAKT LQYSKGKTIM VNLQKFDKPE KFVETKEKIY VLVDEAHRSQ YKWTAGYMRK AMPNAVFFAF TGTPLDRENK NTYRRFGPLI DKYSFTESKE DGATVKVEHM GLLPEIEIEG GNSLNNIFDN LFGHLPKAQQ AEIRRKYATK KKIASSPARI RKICEKIVDH YTKKILPNGY KAMIVAPTRA AAVTYKRELE DLTKLPARII MDSKKDEVGP DEFSWADYYL PQNEALKKAE EFTNPDDPTK FLIVVDMLLV GYDAPIVQVM YLDRPLREHA LLQAIARVNR IYDEHKDRGF IIDFWGVTRD LQNALKMFEE QDVQGALDNV DDDLLELDVR HKDFMEKIQG IDSKDHNEIA KRFEDEDEQE EFEYAFKRFA KALEAVLPDK EAVPYEKDLK KGYEIRGKIR AWFYGDRADL SEYGKKVQKI IDKHIRAIGI SEISGSKEIT YDNFLGFIAK FKGDNRAKAA LIKKRATLVI REMSPDNPAF YQKLKERLEK LIQDEKERRV DDAQYFEGIR DIFEEALSGP EKLQKQTGIK DRFQLAIYML LEEKHSDLKQ NKKYSEEIFK KVKKAASVKD WRDKEPQENE IELAVIDTLD KKIFDEKTRD KLASEVYKMA VNNNEW
|
| |