Gene Noc_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2689 
Symbol 
ID3704446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3046391 
End bp3049582 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content60% 
IMG OID637739171 
Producthypothetical protein 
Protein accessionYP_344672 
Protein GI77166147 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGATA GCAGGGAAGC TCAATTCCAA CAGGACATCA TCAACGCCCT CGCCGCTCAG 
GGCTGGCGGG TGGGCACGGC CAGGGGCTAT GACCGTCCCA GCGCCCTGTA TACCGAGGAT
TTCCTGGGCT ACTTCAAGGA TGCCTGGCCG GAGCGCTGGG ACAAGTTCGC CAAGGCCAAC
CCCAATGACC CGGAAAGCGT CCTAGTGCAG AAACTGGTGC GGGAGCTGGA GCAGCACGGC
ACCCTGGATG TGCTGCGCCA CGGCTTCAAG GTGCCGGCGG TGAAGGTGGA ATTGTGCAGC
TTCAAGCCCG ACCACGCCAT GAACCCGGAC ACGCTCAAGG GCTACCAGTG TAACCGTCTG
CGGGTGGTGC CGGAGGTGGC CTACTCGCCC CATGCCCGCG ACGCCACGGG GCAGGGCGGC
GGCTACAACC CGCGGCTGGA CCTGGTGCTG TTCGTCAACG GCCTGCCCAC CGCCACCCTG
GAGCTGAAAA GCCAGTTTAA GCAGTCGGTG GAAAACGCCA AGCGCCAGTA CCGCCATGAC
CGCCCGGTCA AAGACCCGCT GAGCCGCAAG CCCGAGCCCC TACTCACCTT CAAGCGCGGG
GCGCTGGTGC ATTTTGCCGT GAGCCAGGAC GAAGTGGCCA TGACCACCCG GCTGGCCGGC
AAGGACACCT GCTTCCTGCC CTTCAACCTT GGAAGTGAAG ACGGCGGCGC CGGTAATCCC
CCGCCGGCGG ACGACGGCCA GTACGCCACC GGTTACCTGT GGCAGCGGCT GTTCCAGCCC
GCTGCCTGGC TCAAGGTGCT GGGGCGCTTT CTGCACCTGG AGAAGAAAAC CGTTGAGGGC
TTTGACGGCC AGCTCAGCAC CAAAGAGACC ATGATCTTCC CCCGCTATCA CCAGTGGGAG
GTGGTCAATC AGCTGATTGA AACCACCCGC AGTGAAGGGC CAGGCAAGCG CTACCTCATT
CAGCACAGCG CCGGCTCGGG CAAGTCCAAC TCCATTGCCT GGACGGCGCA CCAGTTGGCC
GCGCTGTATG ACGACGCGGG GCAGAAGCTG TTCAACTCGG TGATCGTGGT AACCGACCGC
ACGGTGCTGG ACAGCCAGTT ACAGAACACT ATCTACCAGT TCGAGCACGC CCACGGCGTG
GTGCGGCCCA TCACCCGAGA TATCGGCAAC CAGAGCAAGT CCCAGCAACT GGCCGAGGCC
CTGACTGAGC AGACCCGCAT CATCATCGTC ACCATTCAGA CCTTTCCGGC CCTGTTCCAG
GTGCTGGATA AATACCCCAA CCTGGCCAGT GGCCGCTATG CGGTCATCGC CGACGAGGCC
CACTCTTCGC AAACCGGCTC CTCGGCCAGC AAGCTCAAGG CTATCTTGAG TTCCGAGCAA
GCGGCCGCTG ATCATCAAGA GCCGAAAGAG ATCAGCGCCG AAGACCTGCT CGATGCCGCT
GTACAGGCCC GCCAGCCCAA TGAACGCATC AGCTACTACG CTTTTACCGC CACCCCCAAG
GCCAAGACCC TGGAGCTATT TGGCCGCCCG CCGGAGCCGA GCGTGCCGCC CAGCGCCGAC
AACAAGCCCG AGGCTTTTCA TCTGTATTCC ATGCGCCAGG CCATCGAGGA GGGTTTTATT
CTTGATGTGC TGCAGAACTA CCTCAGCTAC AGCACCGCGT GGAAGATCGC CCACCCGGAA
GGCGAAGACG AGGAAGTTGA CTCAAAGAAA GCGCGCATCA AGCTGGCGCG CTGGGTGCGG
CTGCATCCGT ATAATATTAG CCAGAAGGTC GAGGTCATCG TCGAGCACTT CCGCGCCAAC
ATCCGCCATC TATTGAACGG CCAAGCCAAG GCCATGGTGG TGACCAGTGG CCGCCAGGAG
GCGGTGCGCT ACCAGTTGGC GGTAAAGAGC TATGTCAGGC GGATGGGCTA CAGGGATGTG
CATCCGCTGG TGGCGTTTTC CGGCAGCGTG TTGCCTGATG AGGTGATTCC GGAAGAAGTC
ACCGAGACCA GCAGCCTGCT CAATGCGGAC CTCCATGGCC GCGACCTGGC CGAGGCTTTT
GACACCCACG ATTTCAACGT ACTCATCGCC GCCAACAAGT ACCAGACCGG CTTCGATCAG
CCCAAGCTGT GCGCCATGTA CGTGGATAAG AAGCTGCGGG GGGTGGACTG CGTGCAGACC
TTGTCGCGCT TGAACAGGAA GTTCGGCGAG GGCAAACAGA CCTTTATCCT TGACTTCTTC
AACGAGCCAC AGGATATCCT CGATGCTTTT TTGCCCTACT ACACCCGGGC CGAGCTGACC
GATGTCACCC ATCCACAGGT TATCTACGAC CTGCAGAGGA CGCTGGATGA GGAAGGCATT
TATCACTGGA ACGAGGTTGA AGCCTTTGCG CTGGCCTTCT TCGATCCCAA GGCGGTGGCC
AGCAAACTCA GCTATCACTG CCAGCCGGCC CGCGAGCGCT TCGCCAGGCG CTATGCCTTC
AGCCTGGACT CCCGCCAGCA GGCGCTGGGT TTCAAACGCA CCGCCGAGGT CAATGGTGAT
AATACCGGCC TAAAGAAGGC CGAGCACGTG CTCAAGGAAG CCGGTGAGCA GATCGACCGA
CTGGACCTGT TCCGCAAGAA CCTGCAGAGC TTTGTGCGCC TCTATGAGTT CCTCTCGCAG
ATCGTGCCCT ATGAGGACCG TGAGCTGGAA CAGTTGTGTG TGTTCGCCAA GCACCTGCAC
CCGCTGCTGC GCGTGGATCG CCTCCAGGAG GAGGTAGATA TCGGTGAACT GCAGCTAACC
CATTACCGCC TGAGCAAGCG AGCCGAACAG CAGTTGCGGT TGAATGAGGA GGCCGCGGAA
TACACCCTCA AGCCCGGCAG CGATATCGGC AGCGGCCAGC CCCACGACCC GGAAAAGAAA
CGCCTGTCGG AAATCATCGA GGCACTGAAT GAGATTTTTG GCGCCGAGGT CAGTGATGAG
GACCAATTGC AATTTCTCAT CGGTATCGCC CAGCGTATCA GCCGCCAAGA GGATGTGATG
GCCCAGGTTA ATAGCCATTC AGTGGACCAG GTCATGCACG GTCTGTTTCC CAAGCGGGTG
CTGGATACCG TACTGGACGC CATGACCGAC CACGAAAAGC TGTCCCTGGA AGTGCTGGAC
AACAAAACCA AGAGCCGAGA CTTTGCGCTG GTCATCCTAA AAATGCTCAC TCAGCATACG
AGCTTTTCGT AA
 
Protein sequence
MADSREAQFQ QDIINALAAQ GWRVGTARGY DRPSALYTED FLGYFKDAWP ERWDKFAKAN 
PNDPESVLVQ KLVRELEQHG TLDVLRHGFK VPAVKVELCS FKPDHAMNPD TLKGYQCNRL
RVVPEVAYSP HARDATGQGG GYNPRLDLVL FVNGLPTATL ELKSQFKQSV ENAKRQYRHD
RPVKDPLSRK PEPLLTFKRG ALVHFAVSQD EVAMTTRLAG KDTCFLPFNL GSEDGGAGNP
PPADDGQYAT GYLWQRLFQP AAWLKVLGRF LHLEKKTVEG FDGQLSTKET MIFPRYHQWE
VVNQLIETTR SEGPGKRYLI QHSAGSGKSN SIAWTAHQLA ALYDDAGQKL FNSVIVVTDR
TVLDSQLQNT IYQFEHAHGV VRPITRDIGN QSKSQQLAEA LTEQTRIIIV TIQTFPALFQ
VLDKYPNLAS GRYAVIADEA HSSQTGSSAS KLKAILSSEQ AAADHQEPKE ISAEDLLDAA
VQARQPNERI SYYAFTATPK AKTLELFGRP PEPSVPPSAD NKPEAFHLYS MRQAIEEGFI
LDVLQNYLSY STAWKIAHPE GEDEEVDSKK ARIKLARWVR LHPYNISQKV EVIVEHFRAN
IRHLLNGQAK AMVVTSGRQE AVRYQLAVKS YVRRMGYRDV HPLVAFSGSV LPDEVIPEEV
TETSSLLNAD LHGRDLAEAF DTHDFNVLIA ANKYQTGFDQ PKLCAMYVDK KLRGVDCVQT
LSRLNRKFGE GKQTFILDFF NEPQDILDAF LPYYTRAELT DVTHPQVIYD LQRTLDEEGI
YHWNEVEAFA LAFFDPKAVA SKLSYHCQPA RERFARRYAF SLDSRQQALG FKRTAEVNGD
NTGLKKAEHV LKEAGEQIDR LDLFRKNLQS FVRLYEFLSQ IVPYEDRELE QLCVFAKHLH
PLLRVDRLQE EVDIGELQLT HYRLSKRAEQ QLRLNEEAAE YTLKPGSDIG SGQPHDPEKK
RLSEIIEALN EIFGAEVSDE DQLQFLIGIA QRISRQEDVM AQVNSHSVDQ VMHGLFPKRV
LDTVLDAMTD HEKLSLEVLD NKTKSRDFAL VILKMLTQHT SFS