Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1804 |
Symbol | |
ID | 3705321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2039740 |
End bp | 2042946 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637738287 |
Product | Type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_343804 |
Protein GI | 77165279 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAG AGCAACTGGA ACAACAATGC CTGGCCTGGT TTGCCGAGGG CGGTTGGGAA CTGGCCCACG GCTCCGATCT GGCGCCTGGG CGTGCTGATT ACCGCCAAGT ATGGTTACTG GCCGATCTGG AAGCGGCCAT TCGCCGCATC AACCCCCACT TGCCGGAAAG CTGTATCGAG CAGGTAGTGG CGGTGGTTGG TAAGCCCGAA AGCCTGGATA CCGTGGTCAG CAATCGAGCC TTTCACCGGT TACTGCTGGA AGGGGTGCCG GTTGAATACA AGACCCTATC CTCCCTCTCC CCCTGGGAGA GGGCCGGGGT GAGGGAAAGG GAAGAGAAAA TAGTCCACGA CCGGGCGTTG CTGATCGATT TCGATGATCT GAACGCCAAC CGCTTCCGGG CCATCAATCA GTTCACCCTC TTGGGGAGCA AGCAACTGCG CCGCCCGGAT ATTATTTGCT TTATCAATGG CCTGCCCTTG GCGGTGCTGG AGCTGAAAAG CCCCCATGCC GAGAATGTGG ACATCTGGGA TGCCTTCCAT CAGCTTCAGA CTTACAAGGA CGAAATCCCC GAGCTGTTCG TCTTTAACGA GGCGCTGGTA ATCAGCGACG GCTACCATGC CCGGGTGGGT TCGCTTACGG CCAACCAGGA GCGCTTTATG CCCTGGCGCA CTCTCAAGCA CGAGGACGAC AAGCCCCTGC TGGACTGGCA GTTGGAAACC CTGGTGCGGG GTTTCTTCGA TCGGGAATTG TTCCTGGATT ACCTTCGTTA TTTCGTCATT TTCGAGACGG ATTCCGGTCG CCTGAGCAAG AAGATTGCCG GTTATCACCA GTTCCACGCG GTGCGGGAAG CGGTGAAGGC CACCGTGATT GCCGCCCAGG AGCCCAGGCA GCGCTGGGCC GGTGAAAAGC GCGCCACCTA CGCCGATGAC CTGGTGCCGG GCAGCAAAAA GGCCGGCGTG GTCTGGCACA CCCAGGGGTC CGGCAAGAGT CTTTCCATGT GCTGCTACGC GGGCAAGCTG CTGCAACAGC CCGAGATGAA CAACCCGACC CTGATGGTGG TCACCGACCG CAACGATCTG GACGGCCAAC TCTTCGCCAC CTTCAGCGCC GCCAAGGAAC TGCTGAAGCA GGAACCGGTG CAGGCGGAAG ACCGGGATAC CCTGCGCCGC TTGCTGGCCG AGCGGGCATC CGGTGGCATT ATCTTCACCA CGGTGCAGAA ATTCGCCCTG CTGGATGGGG AGAACGATCA TCCCATTCTC AACGACCGCC ATAATATCGT GGTGATTTCC GACGAGGCTC ACCGCAGTCA GTACGGCCTT AAGGCCACCC TGAAGAAGGA TGGCCGCTAC ACCTTCGGCT ACGCCAAGCA CATGCGCGAT GCCCTGCCCA ATGCCTCCTT TATCGGTTTT ACCGGTACCC CCATTGCCAA TGAAGATAAG GATACCCGCG CCGTGTTCGG CGATTATGTG TCCATCTATG ACATTCAGGA TGCGGTGGAC GATGGGGCTA CCGTGCCCAT CTATTACGAA TCCCGGCTGG CCAAATTGGA TATCAACCGG GAGCTGATTG AGAAATTATC CGACCAAGTG GAAGCAGTGG TGGAGGATGA GGAAGACCTC GGCCAGCGGG AAAAAACCAA GGGCGAGTGG AGCCGCCTGG AAAAGCTGGT GGGGTCTGGG CCGCGGCTTA AGCAGGTGGC TGCCGATCTG GTGCGGCACT TTGAAATCCG CTCTCAGTCC ATGGACGGTA AAGCCATGAT CGTGGCCATG AGCCGGGAGA TTTGCGTGCA TCTGTATAAT GAGATTGTCG CCCTGCGCCC GGACTGGCAC GACCCGGACC CGGAGAAAGG GGCCATCAAG ATTGTGATGA CTGGCTCCGC CTCTGACAGG CCCTTGTTGC AACCGCACCT TTACAACCAG CAGACCAAGA AACGACTGGA GAAGCGCTTC AAGGACATCT ATGATCCCCT CAAGCTGGTG ATTGTGCGGG ATATGTGGCT CACCGGCTTT GACGCCCCTT GCTGCCATAC CATGTATGTG GACAAGCCCA TGAAAGGCCA TAACCTGATG CAGGCCATTG CCCGCGTCAA CCGGGTGTTC AAGAACAAGC CCGGCGGGCT GGTGGTGGAC TATATCGGTA TCGCCAATGC GCTCAAGCAA GCCCTGAAAA CCTATACCGA CGCCAAGGGC AAGGGCGAGC CGACCCACAG CGCGGAAGAA GCCTTTGCCG TGCTGCTGGA GAAGCTGGAC ATTATCCACG GGCTGTTTGC CAAGACACCC CAAAATGCTG GCTTTGATTA CAGCAGCTTT GAGCATGAGG CGACCCGATT GCTGATTCCC ACCGCCAACT ATATATTGAG CCTTGAGGGC GGTAAGAAGC GTTTCCTCGA TACGATTCTT GCTGTGAATA TGGCCTACTC TTTGTGTGGC ACCCTGGAGG AGGCCCGGGC CTATCATAAG GAGGTCGCTT TCCTATCGGC GGTGAAGGCT GCCCTTACCA AGCACACCCG CGTGGACAAG AAATTGACCC AGGAGGAAAA AAATTCCGCC CTCAAGCAGA TCCTGGACAA TGCCCTGGTG GCGGAAGGCG TGACCGACGT GTTTGCGTTG TGCGGATTAG ATAAACCTAA CATCGGCCTG CTCTCGGAGG AATTCCTCGA AGACGTGCGG CGGATGCCTT ACAAGAATTT CGCCGTGGAG CTACTGGAAA AGCTGCTGAA AGACAACATC AAGGCCAAAA CCCGCAATAA CGTGGTGCAG GAGAAGAAAT ACGCTGATCG GCTGCAAGAG ACCCTGCGCC AATACAACAA CCGGGGCATT GAAACCGCCC AGGTGATAGA AGAGCTGATC GCCATGGCCA AGCAATTCCA GGCGGAACTG GAGCGCGACG AAGCCCTGGG CCTGAACCCG GATGAAGTAG CCTTCTACGA TGCCCTGGCC AACAATGAGA GTGCGGTGCG GGAGTTGGGT GATGAGACGC TGAAGAAAAT CGCCGTGGAA ATCACTGACA AGCTGCGCAG GTCCACTACC GTGGACTGGC AGGTGCGGGA AAGCATCAGG GCAAAATTGC GGATTCTGGT GCGCCGAACG TTGCAACGGT ACAAATATCC GCCGGACAAG GCCCCGGAAG CGGTAGAGCT GATTTTGCAG CAAGCCGAGG TACTATCGGA TGAAAAGCGC AACGCGCTAA CAAGAAAAAA CGGGTAA
|
Protein sequence | MTEEQLEQQC LAWFAEGGWE LAHGSDLAPG RADYRQVWLL ADLEAAIRRI NPHLPESCIE QVVAVVGKPE SLDTVVSNRA FHRLLLEGVP VEYKTLSSLS PWERAGVRER EEKIVHDRAL LIDFDDLNAN RFRAINQFTL LGSKQLRRPD IICFINGLPL AVLELKSPHA ENVDIWDAFH QLQTYKDEIP ELFVFNEALV ISDGYHARVG SLTANQERFM PWRTLKHEDD KPLLDWQLET LVRGFFDREL FLDYLRYFVI FETDSGRLSK KIAGYHQFHA VREAVKATVI AAQEPRQRWA GEKRATYADD LVPGSKKAGV VWHTQGSGKS LSMCCYAGKL LQQPEMNNPT LMVVTDRNDL DGQLFATFSA AKELLKQEPV QAEDRDTLRR LLAERASGGI IFTTVQKFAL LDGENDHPIL NDRHNIVVIS DEAHRSQYGL KATLKKDGRY TFGYAKHMRD ALPNASFIGF TGTPIANEDK DTRAVFGDYV SIYDIQDAVD DGATVPIYYE SRLAKLDINR ELIEKLSDQV EAVVEDEEDL GQREKTKGEW SRLEKLVGSG PRLKQVAADL VRHFEIRSQS MDGKAMIVAM SREICVHLYN EIVALRPDWH DPDPEKGAIK IVMTGSASDR PLLQPHLYNQ QTKKRLEKRF KDIYDPLKLV IVRDMWLTGF DAPCCHTMYV DKPMKGHNLM QAIARVNRVF KNKPGGLVVD YIGIANALKQ ALKTYTDAKG KGEPTHSAEE AFAVLLEKLD IIHGLFAKTP QNAGFDYSSF EHEATRLLIP TANYILSLEG GKKRFLDTIL AVNMAYSLCG TLEEARAYHK EVAFLSAVKA ALTKHTRVDK KLTQEEKNSA LKQILDNALV AEGVTDVFAL CGLDKPNIGL LSEEFLEDVR RMPYKNFAVE LLEKLLKDNI KAKTRNNVVQ EKKYADRLQE TLRQYNNRGI ETAQVIEELI AMAKQFQAEL ERDEALGLNP DEVAFYDALA NNESAVRELG DETLKKIAVE ITDKLRRSTT VDWQVRESIR AKLRILVRRT LQRYKYPPDK APEAVELILQ QAEVLSDEKR NALTRKNG
|
| |