Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2689 |
Symbol | |
ID | 3704446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3046391 |
End bp | 3049582 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637739171 |
Product | hypothetical protein |
Protein accession | YP_344672 |
Protein GI | 77166147 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.11077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGATA GCAGGGAAGC TCAATTCCAA CAGGACATCA TCAACGCCCT CGCCGCTCAG GGCTGGCGGG TGGGCACGGC CAGGGGCTAT GACCGTCCCA GCGCCCTGTA TACCGAGGAT TTCCTGGGCT ACTTCAAGGA TGCCTGGCCG GAGCGCTGGG ACAAGTTCGC CAAGGCCAAC CCCAATGACC CGGAAAGCGT CCTAGTGCAG AAACTGGTGC GGGAGCTGGA GCAGCACGGC ACCCTGGATG TGCTGCGCCA CGGCTTCAAG GTGCCGGCGG TGAAGGTGGA ATTGTGCAGC TTCAAGCCCG ACCACGCCAT GAACCCGGAC ACGCTCAAGG GCTACCAGTG TAACCGTCTG CGGGTGGTGC CGGAGGTGGC CTACTCGCCC CATGCCCGCG ACGCCACGGG GCAGGGCGGC GGCTACAACC CGCGGCTGGA CCTGGTGCTG TTCGTCAACG GCCTGCCCAC CGCCACCCTG GAGCTGAAAA GCCAGTTTAA GCAGTCGGTG GAAAACGCCA AGCGCCAGTA CCGCCATGAC CGCCCGGTCA AAGACCCGCT GAGCCGCAAG CCCGAGCCCC TACTCACCTT CAAGCGCGGG GCGCTGGTGC ATTTTGCCGT GAGCCAGGAC GAAGTGGCCA TGACCACCCG GCTGGCCGGC AAGGACACCT GCTTCCTGCC CTTCAACCTT GGAAGTGAAG ACGGCGGCGC CGGTAATCCC CCGCCGGCGG ACGACGGCCA GTACGCCACC GGTTACCTGT GGCAGCGGCT GTTCCAGCCC GCTGCCTGGC TCAAGGTGCT GGGGCGCTTT CTGCACCTGG AGAAGAAAAC CGTTGAGGGC TTTGACGGCC AGCTCAGCAC CAAAGAGACC ATGATCTTCC CCCGCTATCA CCAGTGGGAG GTGGTCAATC AGCTGATTGA AACCACCCGC AGTGAAGGGC CAGGCAAGCG CTACCTCATT CAGCACAGCG CCGGCTCGGG CAAGTCCAAC TCCATTGCCT GGACGGCGCA CCAGTTGGCC GCGCTGTATG ACGACGCGGG GCAGAAGCTG TTCAACTCGG TGATCGTGGT AACCGACCGC ACGGTGCTGG ACAGCCAGTT ACAGAACACT ATCTACCAGT TCGAGCACGC CCACGGCGTG GTGCGGCCCA TCACCCGAGA TATCGGCAAC CAGAGCAAGT CCCAGCAACT GGCCGAGGCC CTGACTGAGC AGACCCGCAT CATCATCGTC ACCATTCAGA CCTTTCCGGC CCTGTTCCAG GTGCTGGATA AATACCCCAA CCTGGCCAGT GGCCGCTATG CGGTCATCGC CGACGAGGCC CACTCTTCGC AAACCGGCTC CTCGGCCAGC AAGCTCAAGG CTATCTTGAG TTCCGAGCAA GCGGCCGCTG ATCATCAAGA GCCGAAAGAG ATCAGCGCCG AAGACCTGCT CGATGCCGCT GTACAGGCCC GCCAGCCCAA TGAACGCATC AGCTACTACG CTTTTACCGC CACCCCCAAG GCCAAGACCC TGGAGCTATT TGGCCGCCCG CCGGAGCCGA GCGTGCCGCC CAGCGCCGAC AACAAGCCCG AGGCTTTTCA TCTGTATTCC ATGCGCCAGG CCATCGAGGA GGGTTTTATT CTTGATGTGC TGCAGAACTA CCTCAGCTAC AGCACCGCGT GGAAGATCGC CCACCCGGAA GGCGAAGACG AGGAAGTTGA CTCAAAGAAA GCGCGCATCA AGCTGGCGCG CTGGGTGCGG CTGCATCCGT ATAATATTAG CCAGAAGGTC GAGGTCATCG TCGAGCACTT CCGCGCCAAC ATCCGCCATC TATTGAACGG CCAAGCCAAG GCCATGGTGG TGACCAGTGG CCGCCAGGAG GCGGTGCGCT ACCAGTTGGC GGTAAAGAGC TATGTCAGGC GGATGGGCTA CAGGGATGTG CATCCGCTGG TGGCGTTTTC CGGCAGCGTG TTGCCTGATG AGGTGATTCC GGAAGAAGTC ACCGAGACCA GCAGCCTGCT CAATGCGGAC CTCCATGGCC GCGACCTGGC CGAGGCTTTT GACACCCACG ATTTCAACGT ACTCATCGCC GCCAACAAGT ACCAGACCGG CTTCGATCAG CCCAAGCTGT GCGCCATGTA CGTGGATAAG AAGCTGCGGG GGGTGGACTG CGTGCAGACC TTGTCGCGCT TGAACAGGAA GTTCGGCGAG GGCAAACAGA CCTTTATCCT TGACTTCTTC AACGAGCCAC AGGATATCCT CGATGCTTTT TTGCCCTACT ACACCCGGGC CGAGCTGACC GATGTCACCC ATCCACAGGT TATCTACGAC CTGCAGAGGA CGCTGGATGA GGAAGGCATT TATCACTGGA ACGAGGTTGA AGCCTTTGCG CTGGCCTTCT TCGATCCCAA GGCGGTGGCC AGCAAACTCA GCTATCACTG CCAGCCGGCC CGCGAGCGCT TCGCCAGGCG CTATGCCTTC AGCCTGGACT CCCGCCAGCA GGCGCTGGGT TTCAAACGCA CCGCCGAGGT CAATGGTGAT AATACCGGCC TAAAGAAGGC CGAGCACGTG CTCAAGGAAG CCGGTGAGCA GATCGACCGA CTGGACCTGT TCCGCAAGAA CCTGCAGAGC TTTGTGCGCC TCTATGAGTT CCTCTCGCAG ATCGTGCCCT ATGAGGACCG TGAGCTGGAA CAGTTGTGTG TGTTCGCCAA GCACCTGCAC CCGCTGCTGC GCGTGGATCG CCTCCAGGAG GAGGTAGATA TCGGTGAACT GCAGCTAACC CATTACCGCC TGAGCAAGCG AGCCGAACAG CAGTTGCGGT TGAATGAGGA GGCCGCGGAA TACACCCTCA AGCCCGGCAG CGATATCGGC AGCGGCCAGC CCCACGACCC GGAAAAGAAA CGCCTGTCGG AAATCATCGA GGCACTGAAT GAGATTTTTG GCGCCGAGGT CAGTGATGAG GACCAATTGC AATTTCTCAT CGGTATCGCC CAGCGTATCA GCCGCCAAGA GGATGTGATG GCCCAGGTTA ATAGCCATTC AGTGGACCAG GTCATGCACG GTCTGTTTCC CAAGCGGGTG CTGGATACCG TACTGGACGC CATGACCGAC CACGAAAAGC TGTCCCTGGA AGTGCTGGAC AACAAAACCA AGAGCCGAGA CTTTGCGCTG GTCATCCTAA AAATGCTCAC TCAGCATACG AGCTTTTCGT AA
|
Protein sequence | MADSREAQFQ QDIINALAAQ GWRVGTARGY DRPSALYTED FLGYFKDAWP ERWDKFAKAN PNDPESVLVQ KLVRELEQHG TLDVLRHGFK VPAVKVELCS FKPDHAMNPD TLKGYQCNRL RVVPEVAYSP HARDATGQGG GYNPRLDLVL FVNGLPTATL ELKSQFKQSV ENAKRQYRHD RPVKDPLSRK PEPLLTFKRG ALVHFAVSQD EVAMTTRLAG KDTCFLPFNL GSEDGGAGNP PPADDGQYAT GYLWQRLFQP AAWLKVLGRF LHLEKKTVEG FDGQLSTKET MIFPRYHQWE VVNQLIETTR SEGPGKRYLI QHSAGSGKSN SIAWTAHQLA ALYDDAGQKL FNSVIVVTDR TVLDSQLQNT IYQFEHAHGV VRPITRDIGN QSKSQQLAEA LTEQTRIIIV TIQTFPALFQ VLDKYPNLAS GRYAVIADEA HSSQTGSSAS KLKAILSSEQ AAADHQEPKE ISAEDLLDAA VQARQPNERI SYYAFTATPK AKTLELFGRP PEPSVPPSAD NKPEAFHLYS MRQAIEEGFI LDVLQNYLSY STAWKIAHPE GEDEEVDSKK ARIKLARWVR LHPYNISQKV EVIVEHFRAN IRHLLNGQAK AMVVTSGRQE AVRYQLAVKS YVRRMGYRDV HPLVAFSGSV LPDEVIPEEV TETSSLLNAD LHGRDLAEAF DTHDFNVLIA ANKYQTGFDQ PKLCAMYVDK KLRGVDCVQT LSRLNRKFGE GKQTFILDFF NEPQDILDAF LPYYTRAELT DVTHPQVIYD LQRTLDEEGI YHWNEVEAFA LAFFDPKAVA SKLSYHCQPA RERFARRYAF SLDSRQQALG FKRTAEVNGD NTGLKKAEHV LKEAGEQIDR LDLFRKNLQS FVRLYEFLSQ IVPYEDRELE QLCVFAKHLH PLLRVDRLQE EVDIGELQLT HYRLSKRAEQ QLRLNEEAAE YTLKPGSDIG SGQPHDPEKK RLSEIIEALN EIFGAEVSDE DQLQFLIGIA QRISRQEDVM AQVNSHSVDQ VMHGLFPKRV LDTVLDAMTD HEKLSLEVLD NKTKSRDFAL VILKMLTQHT SFS
|
| |