Gene Noc_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0160 
Symbol 
ID3706193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp173812 
End bp176919 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content49% 
IMG OID637736677 
Producttype I site-specific restriction-modification system, R subunit (helicase) 
Protein accessionYP_342223 
Protein GI77163698 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCTA CTGGAGAGTT AGAGCTAAAT CTCATGCCTA CCCAGAATAG CCCGCCGAAT 
ATTGCTATGG ATAAGGTCCA GAAAACGGCG CTTCAACAAG TCATTATTGA TCACCTCCTT
AGTCATGGCT GGCAGAAGGG AAAAGCAGAG CTTTACGATC CCGTCTTGGC CCTGTACCCG
GAAGATCTTA TCGGCTTTAT CCGGGAGTGC CATTCCCAAC CGCTGCCAAA ACTTATCCAG
GACGCTCCCG ATCAAGCCGT CCAAATACTT CTTAGGCGGG CGGCTGAAGA AATGGACCAG
CGCGGAGCCT TAGAGGTGCT ACGGCATGGT TTTAAGGAAC AGGAGAATGA TATTAGCCTC
TGCCAGTTTC ACTCCTACCA GGAATTCCAT TCGGGAACCC TGGCCCGCTA TAAAAAAAAC
CGGCTGCGGG TGGTGCCTAA CCTGTCTTAT TCTCCCTATA CTCACGAAGA CTACATTCCC
TACCTCGATC TGGTATTTTT TGTTAACGGG GTGCCGGTAG CTACTTTAAA AGCGACCTCG
GAATCCCAGC CCTCCCTCTC TGCTATCATC CGTCAATACC AACGGGATTG TCCCCCTCAC
GATCCCCGTA TGGGCCATGA AGAACCGCTG CTGGCCTTTA GAAAACGCGC CCTAGTACAT
TTTGCCGTCA GCCAGGAAGA AGTTTGGGTA AGCACCCGCT TACAAGGTCT GGCAACCCAT
TTCACGCCTT TCAATCAAGG CCATAAAGGT GGCGCTGGTA ATCCTCCCAA TCCAGCAGGT
TACCCTACGG ATTATCTATG GCAAAAAATT TTTGAGTGCG ACACCTGGCT TGATCTGCTA
GGTCAATTTA TCCATTTCGA GAGAAAAAAC GCTGCCTGGA AAGAAGAAAA AGAATTCCCC
TCGGAAACAC TGGTCTTCCC CCGCTTTCAT CAATGGGAGA TGGTTACCCA ATTAGCAGGG
GCCGTCCGCC AGGAAGGGCC AGGGCAGCAG TATCTCATTC AACACGGCGC TGGTTCGGGG
CAACGCTATT CTATTGCTTG GACTGCCCAC CATGTAGCTG CATTAGCTGA CAGGAAGCAT
CAGAAAATTT TTACCGGGGT CATCGTGGTC ACCGACTGCC ATGAATTCCA CGCCCAGCTC
CAGAACACAC TCTACTCACC TAAGTATGAG AGAAAACAGA TATACCGGCT TACTCAAGAA
ATTGCTCAAA ACTGCCAAGC CGAATGGTTA GCAGCAACAT TGGCGGCAGC CGGCATCCAC
ATTATCACGG TTACGCCCCA AGCCTTTCCT TCCGTGTTGG CCCTCATCCA AAACCAAGTT
ATTTTTAAGG AACATACTTT CGCTGTTATT GCTCACCAGG CTCCTTTTCC TACCAGCAGA
AGTCTGTTCT CTAGGCTAGG GGAAGTGCTG TTATCTGAGT CGGCACAAAA GGAGATTACA
ACCAATACTG AAGATCTTTT GATCACCTCC CTAACAAACA GTCCCCCCTC GCCCCATATC
AGCTATTTTC TCTTCACGGC TGCTCCTCAA GAGAAAACCC TGGCGTTATT TGGACAACCG
ACTAGCCCGG CGCTTCCTTT CTCTGATAAC AACCAACCCG AGCCTTTCCA TCACTATACT
CTACAACAGG CCATTGAAGA AGGCTTCATG CTCAACCCGC TCAAACGCTA TATGACTTAC
GTCACGGCCG CCAGACTGGC CCAGCAGCAA CTTTCACTCA ATCAAGAAGA AAATCTTTCC
TGCCGTATTT TCCCTTGGAT GGGGCAGTCC CCTGACAACC TCAATATCGT ACAAAAAGCC
GAGATAATCA CCGAACATTT CCGCCACCAA GTGGTTCACC TGATGAATGG CCAAGCCAAA
GCCATACTCA CCACCCATTC CTGCCAAGCC GCGCAACGTT ACCAACGGAC TTTTGAACGC
TACATCGCTG CTCAGCAGTA TCAGGACGTC CAGACTCTAA CCAATGATCA TCAGGTGATC
ATTGGCGCTC ACAAACATCA GGCGAAATTC TCATCATCCC AGCTCTGCGC CCTGTATATA
GATAAAAAAT TTACCGGTAC CGATTGCGTT CAAACGCTAT CCCACCTGGG CGAGATCCAC
CCCGGCAAAG AGGCGCCCTT TATTCTCGAC TTTGCTAACC CCGCCGAGCA AGTACTGCAT
GCCTTCCGGC CCTATCATCC TGCCGCAGAG ATTTTTTCTG TCTCTGATCC TAAATTAATT
TACCAGCTTC AATCTTATCT GGATGAGGCA CGCATTTACA ATTGGCAAGA TGTGGAGTCC
TTTGCTGCCG CCTTTTTCGA TACCGAGCAA ACCACAGAGC GCCTAAACTA TCATTGCCAA
TCCGCGGTGG AACGTTTCCG GGAGCAGTAT CAGGAAAACA TAAAAATTAT CCAGACCGCC
CAACAAGCCA AACAGGAAGC AACAGCTGCC GGCGACAGGA TACGCAGAGA GAATGCTTGC
CACAGTTTTA AACAGGCCGA GGAAAGAAAA AATGCCTTGG ACCGATTCAA AGAAAATCTG
CTCAGCTTTG CTGATTGCTA TGAACATCTT TCCCAGATCC TAGACTATGG CAATCAAGAA
CTAGAAAAAC TCAATGTCTA TGCCCGCCAC CTATACTCTC TCCTGGGTGA GATGCAGCAG
AGCGAAGCTA TTGATTCACC CCCGCGAGAA TTCATCTCTT ACCGGCTGAA TAAAATTTGT
GAACGAACAT TCAAAGCAGC CCGACAAGAA TCTCTGGGAG AGCCCGCGTA CAACACCACC
GGAGAATCGC GGGCAAATCC CTGGAGAAAA GAGGCGTCCT GTTCTCTTCT TATCAATCGC
CTTAAGCCGC TCTTGGCAGG CGAATATCTC AGCGACAGAG ATCGATTAAA CTATTTACAC
GCTATCAAGG ATAAGCTGCT TGAAAATCCG GCACTAGCGG CTCAGCTGGA AGATCAGCGG
CCGAACCCCA TCCCATCGAG TGATTTCTCC CAGATAGTGC AGCATACCGT AATGGAAAAT
TTAAAAAATC ACCATGAAAT GGCGGCCCAA TTACTCCATG ATGAAGGGAT GGTTAAGGAT
TTTAGCCGCC TCCTGCTAGA TCTGCTTTCA AACCAGCCTG AGCAATAG
 
Protein sequence
MRATGELELN LMPTQNSPPN IAMDKVQKTA LQQVIIDHLL SHGWQKGKAE LYDPVLALYP 
EDLIGFIREC HSQPLPKLIQ DAPDQAVQIL LRRAAEEMDQ RGALEVLRHG FKEQENDISL
CQFHSYQEFH SGTLARYKKN RLRVVPNLSY SPYTHEDYIP YLDLVFFVNG VPVATLKATS
ESQPSLSAII RQYQRDCPPH DPRMGHEEPL LAFRKRALVH FAVSQEEVWV STRLQGLATH
FTPFNQGHKG GAGNPPNPAG YPTDYLWQKI FECDTWLDLL GQFIHFERKN AAWKEEKEFP
SETLVFPRFH QWEMVTQLAG AVRQEGPGQQ YLIQHGAGSG QRYSIAWTAH HVAALADRKH
QKIFTGVIVV TDCHEFHAQL QNTLYSPKYE RKQIYRLTQE IAQNCQAEWL AATLAAAGIH
IITVTPQAFP SVLALIQNQV IFKEHTFAVI AHQAPFPTSR SLFSRLGEVL LSESAQKEIT
TNTEDLLITS LTNSPPSPHI SYFLFTAAPQ EKTLALFGQP TSPALPFSDN NQPEPFHHYT
LQQAIEEGFM LNPLKRYMTY VTAARLAQQQ LSLNQEENLS CRIFPWMGQS PDNLNIVQKA
EIITEHFRHQ VVHLMNGQAK AILTTHSCQA AQRYQRTFER YIAAQQYQDV QTLTNDHQVI
IGAHKHQAKF SSSQLCALYI DKKFTGTDCV QTLSHLGEIH PGKEAPFILD FANPAEQVLH
AFRPYHPAAE IFSVSDPKLI YQLQSYLDEA RIYNWQDVES FAAAFFDTEQ TTERLNYHCQ
SAVERFREQY QENIKIIQTA QQAKQEATAA GDRIRRENAC HSFKQAEERK NALDRFKENL
LSFADCYEHL SQILDYGNQE LEKLNVYARH LYSLLGEMQQ SEAIDSPPRE FISYRLNKIC
ERTFKAARQE SLGEPAYNTT GESRANPWRK EASCSLLINR LKPLLAGEYL SDRDRLNYLH
AIKDKLLENP ALAAQLEDQR PNPIPSSDFS QIVQHTVMEN LKNHHEMAAQ LLHDEGMVKD
FSRLLLDLLS NQPEQ