Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2909 |
Symbol | |
ID | 3707426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3287993 |
End bp | 3291121 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637739386 |
Product | Type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_344885 |
Protein GI | 77166360 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGTCC TACCGGCAGG TTTTATATCC GTGCGTACAA CTACGGAAAA TTTCGGGAGA AGATTAGGCG TGTTTCAGGA CGAACTCGAT AAAGTAGAAA CCCCCGCCAT TGCCCAGTTG CAACGGCTAG GCTGGCGCTA TGTCCGTGGC GTTGAACTGT CACCGGAGGC TGCGGGCGCG GAACGGGCCT ACTACCGGGA CGTGGTGTTG GTTGGCCGTC TGGAAGGGGC TATCCGGCGC ATCAACCCTT GGCTCAGTGA GGAAAACCTG CGTAAGGTAG CGCGGGAAAT CACCCACCCC AACCATGTGG GTTTGATGGA ATACAACCAT GCCATTTACC AGATGTTGGT CAATTACCTG TCTATCGAAC AGGACTTGGG CAAGGGACGC AAGGGGCAGA CGGTCAAAAT TATTGATTTT GAAAATCCCG GTAACAATGA ATTCTTGTGC GTTAACCAGT TCAAGGTTGA AGGACTCAAT CAGAATATCA TTCCCGATAT CGTCTGCTTT GTGAATGGTT TGCCGTTGGC GGTGATTGAA TGCAAATCCC CTTACGTGGC GGATGCCATT AGCGAAGGTA TTAAGCAACT TCGCCGCTAT GCCAACCTTC GCTATCCGGA AACCGATGAA GGGGCGCAAA AGCTATTCTG GTACAACCAG CTAATGATCA CCACCTGCCG GGATCAAGCC AAGGTGGGCA CCATCAGTTC CAGTGCCCAG CATTACGGGG AATGGAAAGA CGCCTACCCC TTTACCGATG GGGACATCCG TGCACATCCC TTCAGCCCCG GTGGCAAATA CGAGGTACGG GAAATGGCGC CACCGCTCTG GTATGCCGGT GAGTTTGAAC AGGCCAGCCC CGTCACGCCC CAACAGCGCC TGCTGGCGGG TATGTTAGAT CCCGGCAACT TTCTCGACCT GCTCCAAAAC TTCACCATCT TCGAAGCCGT TGAGGGTCGC CTGGTCAAGA AGGTGGCCCG CTATCAGCAA TACCGCGCCA TGAACAAGGT CATCAAACGC CTTAAAAGCG GCACGGATCG TAAGGAAAAG TCCGGGGTGG TGTGGCATAC CCAGGGGTCG GGCAAGTCCC TGACCATGGT GATGCTGGCG GTGAAGATGC GCCGTGATCC GGCGTTACAG CAATACAAGC TGGTGTTCGT CACCGACCGC ACCCAACTGG ATACCCAGTT GTCCAATACC TTTCGCGGCG CCCAGAACGA AACGGTCTAC AACGCGGGCT CCGTGGCGGA GTTGAAAACC CTGTTGAGCC GCGATTCGTC CGATATCGTC ACCGCCACGG TGCAGAAATT CCAGGACGCG GAAGCGGCAG GCGGCTTCAA AGACCTGAAC CCCAGCGGCA AGATCATTGT GCTGGCGGAC GAGGCCCACC GGACCCAGTT TGGCGGCTTG GCCATGACTA TCAATGCCGC CTTGCCCAAG GCCCCCAAGA TTGGTTTTAC CGGCACACCG CTATTGAAGA CCCAAAAAAT GGATCAAGCC TTCGGGGGCT ATATTGACCA GTACAAGATC AACGAAGCCG TGGAAGACGG CGCCACAGTG CGCATCATCT ACGAAGGTCG ACAGGTGCAA AGCGATGTGG TGGGCGATTC ATTGGATGCC CTGTTTGAAG CATACTTCCA AGGGTGCAGT GATGAAGAAA AACGGGCAAT CAAACAAAAA TATGGGGTAG AATGGGCGGT ACGGGAGGCC CCGGCAAGGA TTCGCTGGGT CTGTATTGAT CTGCTGAAGC ACTACCGTGA ACACATCCAG CCCAACGGTT TCAAGGCCAT GATTGTGGTG GGCAGTCGCC ATGCCGCCAC GGTATTCAAG CAAACCCTGG ATGAGCTGGA CGCGCCGCCG TCGGAGGTGA TTATTTCCGG CAAACACAAT GACCCGGCAA CCCTTGCCCA GTACACAGAC CGGGTCCACC AGAAGCAGGC GATTCAAAAC TTTACTAAGC CCCTGGGGGA AGACCCCACC GCCTTTCTCA TCGTCAAGGA CATGCTGCTC ACCGGCTTCG ATGCGCCGAT AGCGCAGGTC ATGTACATGG ATCGCAGCCT GAAAGATCAC GCGCTGATGC AAGCCATTGC CCGGGTGAAC CGTACCTGCA AGGGGAAGCA GGCGGGGTTT ATCGTAGATT ACCATGGCTT GTCTGATGAC CTGACCGAAG CCCTCAACCA GTTCAGCAGC GAAGATGTGC AAGGCACCTA CCATACGCTG AAGGACGAAA TACCCAAGCT GAAAGCTGCT CATACCCGTG TGGCCGCCAT CTTTGCCGGG GTGAAAGGCG CGGATGTGGA TGATTATGTG CTGCGCCTGA AGGATGAAGA CACCCGCCAG CAATTTGAAC GGGGCTTCAA ACGCTTCGCC AAGCAAATGG ACGTGATACT GCCGGATGTG GCTGCCAAGC CCTATGTGCC AGACCTGAAG TTCTGGGGCA AGGTACAGAA CGCTGCTCGT AACCGCTACC GCGACCCTGG TTTGAATATT CTCGACGCCG GTGAAAAGGT GCGCAAGCTG GTGGAAGAGC ACATCATCAG CACCGGCGTA GACCCCAAGA TACCACCGGT TGATCTGATG GCGGCAAATT TCAGGGAATC GGTAGAGCAG ATCAAGTCGC CGGAATCCCG TGCCTCTGAA ATTGAAAGCG CCATCAAGCA CCATCTTATC GTTAACCTTG AGGAAGACCC CGAGTTCTAT AAGTCGCTGA GCCTGCGTCT ACGGGAGATC ATCGAGAAAA CCAATGGCAA ATGGGAGCAG CAATTGGAAT TGCTCCTTCA GATGGTCGAT AACATCGAAA CCGAACATAA GCAGGCAGCG GATGAGGTTG GTCTCACCAA AACGGAATTC GCCTTCTATA ATATTCTCAT GGCTGAGGTC ACTCGGCATG GTGGTGATGG ATTAGTCGGC GATGAAGTTC ATGAGGATAT CAAAGCAACC AGCCAATTCC TGGTGAAGAC CTTTGATGAG GCAACCCAGA TCGTTGATTT CTTCCACAAG CCCGATGAAG TGAAGCGGAT GAAAAAGGAA ATCAAGCGGG CGATACTGGA TTGTTCCTAC GCGGATAAAG CGCTTGTGAC CGTTGTGCAA GAGCGCTTTA TGGACTTGGC TAAGCGAAAG TTCGGATAA
|
Protein sequence | MVVLPAGFIS VRTTTENFGR RLGVFQDELD KVETPAIAQL QRLGWRYVRG VELSPEAAGA ERAYYRDVVL VGRLEGAIRR INPWLSEENL RKVAREITHP NHVGLMEYNH AIYQMLVNYL SIEQDLGKGR KGQTVKIIDF ENPGNNEFLC VNQFKVEGLN QNIIPDIVCF VNGLPLAVIE CKSPYVADAI SEGIKQLRRY ANLRYPETDE GAQKLFWYNQ LMITTCRDQA KVGTISSSAQ HYGEWKDAYP FTDGDIRAHP FSPGGKYEVR EMAPPLWYAG EFEQASPVTP QQRLLAGMLD PGNFLDLLQN FTIFEAVEGR LVKKVARYQQ YRAMNKVIKR LKSGTDRKEK SGVVWHTQGS GKSLTMVMLA VKMRRDPALQ QYKLVFVTDR TQLDTQLSNT FRGAQNETVY NAGSVAELKT LLSRDSSDIV TATVQKFQDA EAAGGFKDLN PSGKIIVLAD EAHRTQFGGL AMTINAALPK APKIGFTGTP LLKTQKMDQA FGGYIDQYKI NEAVEDGATV RIIYEGRQVQ SDVVGDSLDA LFEAYFQGCS DEEKRAIKQK YGVEWAVREA PARIRWVCID LLKHYREHIQ PNGFKAMIVV GSRHAATVFK QTLDELDAPP SEVIISGKHN DPATLAQYTD RVHQKQAIQN FTKPLGEDPT AFLIVKDMLL TGFDAPIAQV MYMDRSLKDH ALMQAIARVN RTCKGKQAGF IVDYHGLSDD LTEALNQFSS EDVQGTYHTL KDEIPKLKAA HTRVAAIFAG VKGADVDDYV LRLKDEDTRQ QFERGFKRFA KQMDVILPDV AAKPYVPDLK FWGKVQNAAR NRYRDPGLNI LDAGEKVRKL VEEHIISTGV DPKIPPVDLM AANFRESVEQ IKSPESRASE IESAIKHHLI VNLEEDPEFY KSLSLRLREI IEKTNGKWEQ QLELLLQMVD NIETEHKQAA DEVGLTKTEF AFYNILMAEV TRHGGDGLVG DEVHEDIKAT SQFLVKTFDE ATQIVDFFHK PDEVKRMKKE IKRAILDCSY ADKALVTVVQ ERFMDLAKRK FG
|
| |