Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_3028 |
Symbol | |
ID | 3705775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3422932 |
End bp | 3425322 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637739502 |
Product | hypothetical protein |
Protein accession | YP_345000 |
Protein GI | 77166475 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.194288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA AACAACTCAC GGAGCGAGAC ATCTGCACCA AGTTCATCAC ACCTGCGCTT GAACAGTCCG GATGGGATAT TGCTACTCAA ATACGTGAAG AATTCCCGCT GACCAAAGGG CGAATTATCG TCCGTGGCAA ACTGCATACC CGTGCTAAAC ACAAACGTGC CGATTACGTA CTTTTCTACA AGCCCAATAT TCCTATTGCT GTGATTGAAG CGAAAGACAA TAACCACAGC CTGGGTGACG GTATGCAACA GGGCTTGGGT TATGCGGAGA TGCTTCAGGT TCCGTTTGTT TTCAGTTCAA ACGGTGACGG CTTTCTGTTT CACAACAAAA TCGCCAAAGA CGGCATCATT GAGCGCGAGT TAGCACTACA CGAATTCCCT TCGGCAGAGA CACTCTGGCA ATGGTGGGCA GAGCACAAAG GGCTTGATCA ACAACAAAAC AAACTGGTGA CGCAAGATTA CTACAGTGAT GGCAGTAACA AAACGCCGCG CTACTACCAG CTCTTGGCCA TTAACAAAAC CATCGAAGCC ATTGCCAATG GCCAGAATCG CATACTACTG GTTATGGCGA CGGGCACAGG AAAAACTTTT ACCGCCTTCC AAATCATCTG GCGCTTGTGG CGATCCAAAG CCAAAAAGCG CATCCTGTTT TTGGCGGATC GCAATATTCT GGTCGATCAG ACCATGACCA ATGACTTCAA GCCCTTCGGT TCGGCCATGA CCAAAATCCA GAAGCGCCAG GCTAACAAGT CATACGAAAT CTATCTCTCG CTTTATCAGG CGGTTACCGG CAACGAAGAA AAGAAAAACA TTTACAAACA ATTCAGCCCG GATTTTTTCG ACCTAATCGT CATTGATGAA TGCCACAGAG GCAGTGCCGC AGCCGACTCC GCTTGGCGCG AAATCCTGGA GTATTTCTCC TCTGCCACCC AAATCGGCCT AACCGCCACA CCGAAAGAAA CCAAAGAAGT GTCGAATATC GACTATTTTG GCGACCCGAT TTACACCTAC AGCCTGCGTC AAGGCATTGA TGACGGTTTT TTGGCCCCCT ACAAGGTGGT GCGTATAGAT CTCGACCGAG ATTTAACCGG CTGGCGGCCT GACAAAGGGA TGACCGATAA ACACGGTAAC GAGATCGAAG ATCGTATCTA CAACCAGAAA GACTTTGATA AAACGCTGGT ACTAGAACAG CGCACTCAAT TAGTCGCCAA GAAAATTACG GAATTTCTCA AGCAAACCAA TCGCTTTGAT AAAGCCATCG TGTTTTGCGA AAACATCGAC CATGCCGAGC GTATGCGGCA GGCTTTGGTC AACGAAAATG CCGATTTGGT CGCGCAAAAC AGCAAGTACA TCATGCGCAT AACCGGCGAC AACGAAGAAG GCAAGGCCGA GCTAGACAAC TTTATTTTTC CTGAGAGCAA ATACCCGGTG ATTGCCACGA CGTCCAAGTT AATGACCACA GGCGTCGATG CGCAAACCTG CAAATTGATC GTGCTCGACC AACGCATCCA GTCCATGACG GAATTTAAGC AAATTATCGG GCGTGGCGCC CGTATCAATG AAGATTACGG CAAGTTCTAC TTCACCATCA TCGACTTCAA AAAAGCCACC GAACTCTTTG CTGATCCAGA CTTTGATGGC GACCCGGTAC AAGTGTATGA ACCCTCTGGA TCTGAATCCC CCGTGCCGCC AGATATACCT TTAGAGGAAG TTGCAGGCGA GGCGGATAAA GAAGGAATGA CATACCCCAA ACCCGGTGAT GATCGCGAGT GGAGTGGCAT TGCGGAGCCT GATGATGAAG GCGGCGGTGT GCGTCGCTAT GTCGTGGCCA ATGTACCGGT CACAGTTGCG GCTGAACGTG TACAGTATTT TGATGCTAAT GGCAAGCTCA TCACCGAATC ACTTAAAGAC TACACCCGCA AGGCGGTGGC GAAAGAATAC GCCACACTCG ACGATTTCCT GCGCCGCTGG AGCAGTGCCG AGAAGAAGCA GGCCATTATT AAAAAGCTGG CCGAGCACGG TGTGTTTTTT GAAGCGCTGG CCGATGAAGT GGGCGAAAAA TCAGGCAAAG CCTTTGATCC TTTTGATCTG GTGTGTCATA TCGCCTGGGA TATGCCGCCG CTAACCCGTA AAGAGCGAGC CGAGCAGGTT AAAAAGCGCA ACTACTTTAC CCAGTACGGT GAGCAGGCTC GAAAGGTGTT AGAAGCTTTA TTGGATAAAT ACGCCGATGA AGGCGTGGCG CAAATTGAAG AAACCCGAAT TCTTACTATC GCGCCCTTTA CCCAATTTGG CACACCGCTG GAAATCATCC GCGCCTTCGG TGGTCCGGAT CAGTACCAGC AAGCCGTAAA CGAACTCGAA CAGGCGCTTT ATAACGCCTG A
|
Protein sequence | MNKKQLTERD ICTKFITPAL EQSGWDIATQ IREEFPLTKG RIIVRGKLHT RAKHKRADYV LFYKPNIPIA VIEAKDNNHS LGDGMQQGLG YAEMLQVPFV FSSNGDGFLF HNKIAKDGII ERELALHEFP SAETLWQWWA EHKGLDQQQN KLVTQDYYSD GSNKTPRYYQ LLAINKTIEA IANGQNRILL VMATGTGKTF TAFQIIWRLW RSKAKKRILF LADRNILVDQ TMTNDFKPFG SAMTKIQKRQ ANKSYEIYLS LYQAVTGNEE KKNIYKQFSP DFFDLIVIDE CHRGSAAADS AWREILEYFS SATQIGLTAT PKETKEVSNI DYFGDPIYTY SLRQGIDDGF LAPYKVVRID LDRDLTGWRP DKGMTDKHGN EIEDRIYNQK DFDKTLVLEQ RTQLVAKKIT EFLKQTNRFD KAIVFCENID HAERMRQALV NENADLVAQN SKYIMRITGD NEEGKAELDN FIFPESKYPV IATTSKLMTT GVDAQTCKLI VLDQRIQSMT EFKQIIGRGA RINEDYGKFY FTIIDFKKAT ELFADPDFDG DPVQVYEPSG SESPVPPDIP LEEVAGEADK EGMTYPKPGD DREWSGIAEP DDEGGGVRRY VVANVPVTVA AERVQYFDAN GKLITESLKD YTRKAVAKEY ATLDDFLRRW SSAEKKQAII KKLAEHGVFF EALADEVGEK SGKAFDPFDL VCHIAWDMPP LTRKERAEQV KKRNYFTQYG EQARKVLEAL LDKYADEGVA QIEETRILTI APFTQFGTPL EIIRAFGGPD QYQQAVNELE QALYNA
|
| |