Gene Noc_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2909 
Symbol 
ID3707426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3287993 
End bp3291121 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content54% 
IMG OID637739386 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_344885 
Protein GI77166360 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGTCC TACCGGCAGG TTTTATATCC GTGCGTACAA CTACGGAAAA TTTCGGGAGA 
AGATTAGGCG TGTTTCAGGA CGAACTCGAT AAAGTAGAAA CCCCCGCCAT TGCCCAGTTG
CAACGGCTAG GCTGGCGCTA TGTCCGTGGC GTTGAACTGT CACCGGAGGC TGCGGGCGCG
GAACGGGCCT ACTACCGGGA CGTGGTGTTG GTTGGCCGTC TGGAAGGGGC TATCCGGCGC
ATCAACCCTT GGCTCAGTGA GGAAAACCTG CGTAAGGTAG CGCGGGAAAT CACCCACCCC
AACCATGTGG GTTTGATGGA ATACAACCAT GCCATTTACC AGATGTTGGT CAATTACCTG
TCTATCGAAC AGGACTTGGG CAAGGGACGC AAGGGGCAGA CGGTCAAAAT TATTGATTTT
GAAAATCCCG GTAACAATGA ATTCTTGTGC GTTAACCAGT TCAAGGTTGA AGGACTCAAT
CAGAATATCA TTCCCGATAT CGTCTGCTTT GTGAATGGTT TGCCGTTGGC GGTGATTGAA
TGCAAATCCC CTTACGTGGC GGATGCCATT AGCGAAGGTA TTAAGCAACT TCGCCGCTAT
GCCAACCTTC GCTATCCGGA AACCGATGAA GGGGCGCAAA AGCTATTCTG GTACAACCAG
CTAATGATCA CCACCTGCCG GGATCAAGCC AAGGTGGGCA CCATCAGTTC CAGTGCCCAG
CATTACGGGG AATGGAAAGA CGCCTACCCC TTTACCGATG GGGACATCCG TGCACATCCC
TTCAGCCCCG GTGGCAAATA CGAGGTACGG GAAATGGCGC CACCGCTCTG GTATGCCGGT
GAGTTTGAAC AGGCCAGCCC CGTCACGCCC CAACAGCGCC TGCTGGCGGG TATGTTAGAT
CCCGGCAACT TTCTCGACCT GCTCCAAAAC TTCACCATCT TCGAAGCCGT TGAGGGTCGC
CTGGTCAAGA AGGTGGCCCG CTATCAGCAA TACCGCGCCA TGAACAAGGT CATCAAACGC
CTTAAAAGCG GCACGGATCG TAAGGAAAAG TCCGGGGTGG TGTGGCATAC CCAGGGGTCG
GGCAAGTCCC TGACCATGGT GATGCTGGCG GTGAAGATGC GCCGTGATCC GGCGTTACAG
CAATACAAGC TGGTGTTCGT CACCGACCGC ACCCAACTGG ATACCCAGTT GTCCAATACC
TTTCGCGGCG CCCAGAACGA AACGGTCTAC AACGCGGGCT CCGTGGCGGA GTTGAAAACC
CTGTTGAGCC GCGATTCGTC CGATATCGTC ACCGCCACGG TGCAGAAATT CCAGGACGCG
GAAGCGGCAG GCGGCTTCAA AGACCTGAAC CCCAGCGGCA AGATCATTGT GCTGGCGGAC
GAGGCCCACC GGACCCAGTT TGGCGGCTTG GCCATGACTA TCAATGCCGC CTTGCCCAAG
GCCCCCAAGA TTGGTTTTAC CGGCACACCG CTATTGAAGA CCCAAAAAAT GGATCAAGCC
TTCGGGGGCT ATATTGACCA GTACAAGATC AACGAAGCCG TGGAAGACGG CGCCACAGTG
CGCATCATCT ACGAAGGTCG ACAGGTGCAA AGCGATGTGG TGGGCGATTC ATTGGATGCC
CTGTTTGAAG CATACTTCCA AGGGTGCAGT GATGAAGAAA AACGGGCAAT CAAACAAAAA
TATGGGGTAG AATGGGCGGT ACGGGAGGCC CCGGCAAGGA TTCGCTGGGT CTGTATTGAT
CTGCTGAAGC ACTACCGTGA ACACATCCAG CCCAACGGTT TCAAGGCCAT GATTGTGGTG
GGCAGTCGCC ATGCCGCCAC GGTATTCAAG CAAACCCTGG ATGAGCTGGA CGCGCCGCCG
TCGGAGGTGA TTATTTCCGG CAAACACAAT GACCCGGCAA CCCTTGCCCA GTACACAGAC
CGGGTCCACC AGAAGCAGGC GATTCAAAAC TTTACTAAGC CCCTGGGGGA AGACCCCACC
GCCTTTCTCA TCGTCAAGGA CATGCTGCTC ACCGGCTTCG ATGCGCCGAT AGCGCAGGTC
ATGTACATGG ATCGCAGCCT GAAAGATCAC GCGCTGATGC AAGCCATTGC CCGGGTGAAC
CGTACCTGCA AGGGGAAGCA GGCGGGGTTT ATCGTAGATT ACCATGGCTT GTCTGATGAC
CTGACCGAAG CCCTCAACCA GTTCAGCAGC GAAGATGTGC AAGGCACCTA CCATACGCTG
AAGGACGAAA TACCCAAGCT GAAAGCTGCT CATACCCGTG TGGCCGCCAT CTTTGCCGGG
GTGAAAGGCG CGGATGTGGA TGATTATGTG CTGCGCCTGA AGGATGAAGA CACCCGCCAG
CAATTTGAAC GGGGCTTCAA ACGCTTCGCC AAGCAAATGG ACGTGATACT GCCGGATGTG
GCTGCCAAGC CCTATGTGCC AGACCTGAAG TTCTGGGGCA AGGTACAGAA CGCTGCTCGT
AACCGCTACC GCGACCCTGG TTTGAATATT CTCGACGCCG GTGAAAAGGT GCGCAAGCTG
GTGGAAGAGC ACATCATCAG CACCGGCGTA GACCCCAAGA TACCACCGGT TGATCTGATG
GCGGCAAATT TCAGGGAATC GGTAGAGCAG ATCAAGTCGC CGGAATCCCG TGCCTCTGAA
ATTGAAAGCG CCATCAAGCA CCATCTTATC GTTAACCTTG AGGAAGACCC CGAGTTCTAT
AAGTCGCTGA GCCTGCGTCT ACGGGAGATC ATCGAGAAAA CCAATGGCAA ATGGGAGCAG
CAATTGGAAT TGCTCCTTCA GATGGTCGAT AACATCGAAA CCGAACATAA GCAGGCAGCG
GATGAGGTTG GTCTCACCAA AACGGAATTC GCCTTCTATA ATATTCTCAT GGCTGAGGTC
ACTCGGCATG GTGGTGATGG ATTAGTCGGC GATGAAGTTC ATGAGGATAT CAAAGCAACC
AGCCAATTCC TGGTGAAGAC CTTTGATGAG GCAACCCAGA TCGTTGATTT CTTCCACAAG
CCCGATGAAG TGAAGCGGAT GAAAAAGGAA ATCAAGCGGG CGATACTGGA TTGTTCCTAC
GCGGATAAAG CGCTTGTGAC CGTTGTGCAA GAGCGCTTTA TGGACTTGGC TAAGCGAAAG
TTCGGATAA
 
Protein sequence
MVVLPAGFIS VRTTTENFGR RLGVFQDELD KVETPAIAQL QRLGWRYVRG VELSPEAAGA 
ERAYYRDVVL VGRLEGAIRR INPWLSEENL RKVAREITHP NHVGLMEYNH AIYQMLVNYL
SIEQDLGKGR KGQTVKIIDF ENPGNNEFLC VNQFKVEGLN QNIIPDIVCF VNGLPLAVIE
CKSPYVADAI SEGIKQLRRY ANLRYPETDE GAQKLFWYNQ LMITTCRDQA KVGTISSSAQ
HYGEWKDAYP FTDGDIRAHP FSPGGKYEVR EMAPPLWYAG EFEQASPVTP QQRLLAGMLD
PGNFLDLLQN FTIFEAVEGR LVKKVARYQQ YRAMNKVIKR LKSGTDRKEK SGVVWHTQGS
GKSLTMVMLA VKMRRDPALQ QYKLVFVTDR TQLDTQLSNT FRGAQNETVY NAGSVAELKT
LLSRDSSDIV TATVQKFQDA EAAGGFKDLN PSGKIIVLAD EAHRTQFGGL AMTINAALPK
APKIGFTGTP LLKTQKMDQA FGGYIDQYKI NEAVEDGATV RIIYEGRQVQ SDVVGDSLDA
LFEAYFQGCS DEEKRAIKQK YGVEWAVREA PARIRWVCID LLKHYREHIQ PNGFKAMIVV
GSRHAATVFK QTLDELDAPP SEVIISGKHN DPATLAQYTD RVHQKQAIQN FTKPLGEDPT
AFLIVKDMLL TGFDAPIAQV MYMDRSLKDH ALMQAIARVN RTCKGKQAGF IVDYHGLSDD
LTEALNQFSS EDVQGTYHTL KDEIPKLKAA HTRVAAIFAG VKGADVDDYV LRLKDEDTRQ
QFERGFKRFA KQMDVILPDV AAKPYVPDLK FWGKVQNAAR NRYRDPGLNI LDAGEKVRKL
VEEHIISTGV DPKIPPVDLM AANFRESVEQ IKSPESRASE IESAIKHHLI VNLEEDPEFY
KSLSLRLREI IEKTNGKWEQ QLELLLQMVD NIETEHKQAA DEVGLTKTEF AFYNILMAEV
TRHGGDGLVG DEVHEDIKAT SQFLVKTFDE ATQIVDFFHK PDEVKRMKKE IKRAILDCSY
ADKALVTVVQ ERFMDLAKRK FG