Gene Noc_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1804 
Symbol 
ID3705321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2039740 
End bp2042946 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content57% 
IMG OID637738287 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_343804 
Protein GI77165279 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG AGCAACTGGA ACAACAATGC CTGGCCTGGT TTGCCGAGGG CGGTTGGGAA 
CTGGCCCACG GCTCCGATCT GGCGCCTGGG CGTGCTGATT ACCGCCAAGT ATGGTTACTG
GCCGATCTGG AAGCGGCCAT TCGCCGCATC AACCCCCACT TGCCGGAAAG CTGTATCGAG
CAGGTAGTGG CGGTGGTTGG TAAGCCCGAA AGCCTGGATA CCGTGGTCAG CAATCGAGCC
TTTCACCGGT TACTGCTGGA AGGGGTGCCG GTTGAATACA AGACCCTATC CTCCCTCTCC
CCCTGGGAGA GGGCCGGGGT GAGGGAAAGG GAAGAGAAAA TAGTCCACGA CCGGGCGTTG
CTGATCGATT TCGATGATCT GAACGCCAAC CGCTTCCGGG CCATCAATCA GTTCACCCTC
TTGGGGAGCA AGCAACTGCG CCGCCCGGAT ATTATTTGCT TTATCAATGG CCTGCCCTTG
GCGGTGCTGG AGCTGAAAAG CCCCCATGCC GAGAATGTGG ACATCTGGGA TGCCTTCCAT
CAGCTTCAGA CTTACAAGGA CGAAATCCCC GAGCTGTTCG TCTTTAACGA GGCGCTGGTA
ATCAGCGACG GCTACCATGC CCGGGTGGGT TCGCTTACGG CCAACCAGGA GCGCTTTATG
CCCTGGCGCA CTCTCAAGCA CGAGGACGAC AAGCCCCTGC TGGACTGGCA GTTGGAAACC
CTGGTGCGGG GTTTCTTCGA TCGGGAATTG TTCCTGGATT ACCTTCGTTA TTTCGTCATT
TTCGAGACGG ATTCCGGTCG CCTGAGCAAG AAGATTGCCG GTTATCACCA GTTCCACGCG
GTGCGGGAAG CGGTGAAGGC CACCGTGATT GCCGCCCAGG AGCCCAGGCA GCGCTGGGCC
GGTGAAAAGC GCGCCACCTA CGCCGATGAC CTGGTGCCGG GCAGCAAAAA GGCCGGCGTG
GTCTGGCACA CCCAGGGGTC CGGCAAGAGT CTTTCCATGT GCTGCTACGC GGGCAAGCTG
CTGCAACAGC CCGAGATGAA CAACCCGACC CTGATGGTGG TCACCGACCG CAACGATCTG
GACGGCCAAC TCTTCGCCAC CTTCAGCGCC GCCAAGGAAC TGCTGAAGCA GGAACCGGTG
CAGGCGGAAG ACCGGGATAC CCTGCGCCGC TTGCTGGCCG AGCGGGCATC CGGTGGCATT
ATCTTCACCA CGGTGCAGAA ATTCGCCCTG CTGGATGGGG AGAACGATCA TCCCATTCTC
AACGACCGCC ATAATATCGT GGTGATTTCC GACGAGGCTC ACCGCAGTCA GTACGGCCTT
AAGGCCACCC TGAAGAAGGA TGGCCGCTAC ACCTTCGGCT ACGCCAAGCA CATGCGCGAT
GCCCTGCCCA ATGCCTCCTT TATCGGTTTT ACCGGTACCC CCATTGCCAA TGAAGATAAG
GATACCCGCG CCGTGTTCGG CGATTATGTG TCCATCTATG ACATTCAGGA TGCGGTGGAC
GATGGGGCTA CCGTGCCCAT CTATTACGAA TCCCGGCTGG CCAAATTGGA TATCAACCGG
GAGCTGATTG AGAAATTATC CGACCAAGTG GAAGCAGTGG TGGAGGATGA GGAAGACCTC
GGCCAGCGGG AAAAAACCAA GGGCGAGTGG AGCCGCCTGG AAAAGCTGGT GGGGTCTGGG
CCGCGGCTTA AGCAGGTGGC TGCCGATCTG GTGCGGCACT TTGAAATCCG CTCTCAGTCC
ATGGACGGTA AAGCCATGAT CGTGGCCATG AGCCGGGAGA TTTGCGTGCA TCTGTATAAT
GAGATTGTCG CCCTGCGCCC GGACTGGCAC GACCCGGACC CGGAGAAAGG GGCCATCAAG
ATTGTGATGA CTGGCTCCGC CTCTGACAGG CCCTTGTTGC AACCGCACCT TTACAACCAG
CAGACCAAGA AACGACTGGA GAAGCGCTTC AAGGACATCT ATGATCCCCT CAAGCTGGTG
ATTGTGCGGG ATATGTGGCT CACCGGCTTT GACGCCCCTT GCTGCCATAC CATGTATGTG
GACAAGCCCA TGAAAGGCCA TAACCTGATG CAGGCCATTG CCCGCGTCAA CCGGGTGTTC
AAGAACAAGC CCGGCGGGCT GGTGGTGGAC TATATCGGTA TCGCCAATGC GCTCAAGCAA
GCCCTGAAAA CCTATACCGA CGCCAAGGGC AAGGGCGAGC CGACCCACAG CGCGGAAGAA
GCCTTTGCCG TGCTGCTGGA GAAGCTGGAC ATTATCCACG GGCTGTTTGC CAAGACACCC
CAAAATGCTG GCTTTGATTA CAGCAGCTTT GAGCATGAGG CGACCCGATT GCTGATTCCC
ACCGCCAACT ATATATTGAG CCTTGAGGGC GGTAAGAAGC GTTTCCTCGA TACGATTCTT
GCTGTGAATA TGGCCTACTC TTTGTGTGGC ACCCTGGAGG AGGCCCGGGC CTATCATAAG
GAGGTCGCTT TCCTATCGGC GGTGAAGGCT GCCCTTACCA AGCACACCCG CGTGGACAAG
AAATTGACCC AGGAGGAAAA AAATTCCGCC CTCAAGCAGA TCCTGGACAA TGCCCTGGTG
GCGGAAGGCG TGACCGACGT GTTTGCGTTG TGCGGATTAG ATAAACCTAA CATCGGCCTG
CTCTCGGAGG AATTCCTCGA AGACGTGCGG CGGATGCCTT ACAAGAATTT CGCCGTGGAG
CTACTGGAAA AGCTGCTGAA AGACAACATC AAGGCCAAAA CCCGCAATAA CGTGGTGCAG
GAGAAGAAAT ACGCTGATCG GCTGCAAGAG ACCCTGCGCC AATACAACAA CCGGGGCATT
GAAACCGCCC AGGTGATAGA AGAGCTGATC GCCATGGCCA AGCAATTCCA GGCGGAACTG
GAGCGCGACG AAGCCCTGGG CCTGAACCCG GATGAAGTAG CCTTCTACGA TGCCCTGGCC
AACAATGAGA GTGCGGTGCG GGAGTTGGGT GATGAGACGC TGAAGAAAAT CGCCGTGGAA
ATCACTGACA AGCTGCGCAG GTCCACTACC GTGGACTGGC AGGTGCGGGA AAGCATCAGG
GCAAAATTGC GGATTCTGGT GCGCCGAACG TTGCAACGGT ACAAATATCC GCCGGACAAG
GCCCCGGAAG CGGTAGAGCT GATTTTGCAG CAAGCCGAGG TACTATCGGA TGAAAAGCGC
AACGCGCTAA CAAGAAAAAA CGGGTAA
 
Protein sequence
MTEEQLEQQC LAWFAEGGWE LAHGSDLAPG RADYRQVWLL ADLEAAIRRI NPHLPESCIE 
QVVAVVGKPE SLDTVVSNRA FHRLLLEGVP VEYKTLSSLS PWERAGVRER EEKIVHDRAL
LIDFDDLNAN RFRAINQFTL LGSKQLRRPD IICFINGLPL AVLELKSPHA ENVDIWDAFH
QLQTYKDEIP ELFVFNEALV ISDGYHARVG SLTANQERFM PWRTLKHEDD KPLLDWQLET
LVRGFFDREL FLDYLRYFVI FETDSGRLSK KIAGYHQFHA VREAVKATVI AAQEPRQRWA
GEKRATYADD LVPGSKKAGV VWHTQGSGKS LSMCCYAGKL LQQPEMNNPT LMVVTDRNDL
DGQLFATFSA AKELLKQEPV QAEDRDTLRR LLAERASGGI IFTTVQKFAL LDGENDHPIL
NDRHNIVVIS DEAHRSQYGL KATLKKDGRY TFGYAKHMRD ALPNASFIGF TGTPIANEDK
DTRAVFGDYV SIYDIQDAVD DGATVPIYYE SRLAKLDINR ELIEKLSDQV EAVVEDEEDL
GQREKTKGEW SRLEKLVGSG PRLKQVAADL VRHFEIRSQS MDGKAMIVAM SREICVHLYN
EIVALRPDWH DPDPEKGAIK IVMTGSASDR PLLQPHLYNQ QTKKRLEKRF KDIYDPLKLV
IVRDMWLTGF DAPCCHTMYV DKPMKGHNLM QAIARVNRVF KNKPGGLVVD YIGIANALKQ
ALKTYTDAKG KGEPTHSAEE AFAVLLEKLD IIHGLFAKTP QNAGFDYSSF EHEATRLLIP
TANYILSLEG GKKRFLDTIL AVNMAYSLCG TLEEARAYHK EVAFLSAVKA ALTKHTRVDK
KLTQEEKNSA LKQILDNALV AEGVTDVFAL CGLDKPNIGL LSEEFLEDVR RMPYKNFAVE
LLEKLLKDNI KAKTRNNVVQ EKKYADRLQE TLRQYNNRGI ETAQVIEELI AMAKQFQAEL
ERDEALGLNP DEVAFYDALA NNESAVRELG DETLKKIAVE ITDKLRRSTT VDWQVRESIR
AKLRILVRRT LQRYKYPPDK APEAVELILQ QAEVLSDEKR NALTRKNG