Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1903 |
Symbol | |
ID | 3705497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2174246 |
End bp | 2176135 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637738381 |
Product | peptidase M41, FtsH |
Protein accession | YP_343897 |
Protein GI | 77165372 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.89968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCAACC AAAAACCAGA TATCGAAATC AATAAACCTG CAAAGATACC TTTTTGGCGA AGACCTTGGT TTTCATTCCT GCTCTACGGG CTATTTGTCC TCGTGTCCTA TTCAACCTTC AAAACGGAGC AGGAAGAAAT CCCTTATAGT CAATTTCTAA GATATGTTCA AGCTGGTCAG GTTCAACAAG CCGTCATCAG GAAGGACATG ATCGAAGGAA CCTTAAAGCC CGAGAAAAAA CACAAAGGTA AAAAGACGGA AAAACCCCGG CAATTCGTCA CGGTACCTCT CTGGGATGAC CCACTTGCGG TAATGTTGGA AAAACATGGA GTGAACTATG TTGTCAGGAC CGGTGGCGAT TGGCTCTCAA ATCTGCTATT TAACTGGATT CTTCCCTTTG CGGTCATCTC TTTTTTGTGG ATATGGATCG GGCGTAAGGT AAACAGGGGC GGAAGCTTTT TAAGCTTAGG GGGCAACCGC GTCCGTATCC ACCCGAATAC CCTGCCCAAG GTCACCTTCG ACGATGTCGC CGGGGTTGAA GAAGCTAAAG AGGAACTCCG AGAGATCATT GTCTTTCTGA GAGATCCCAG CCTTATTCAG GATTTAGGAG GGCGCATGCC CAAAGGAGTT CTTTTGGCCG GCCCTCCGGG AACTGGCAAG ACTTTGCTGG CCCGGGCTGT TGCTGGGGAA GCCCGGGTTC CTTTCTTTAA TATCAGTGGC TCCGAGTTTA TCGAACTATT TGTGGGTGTT GGCGCAGCAC GCGCACGGGA TTTGTTTGAG CAGGCGCGGA AAAAAGCACC CTGTATTATT TTCATTGATG AGTTGGATGC AATTGGACGT ACTCGCGCCG GAGCGGTATC CATGGGGGGG CATGATGAAC GGGAGCAAAC CTTGAACCAG TTGTTGGTGG AAATGGACGG CTTTGACCCT TCCGTGGGGG TAGTGGTAAT GGCGGCGACC AACCGGGCGG AGATTCTTGA CAAAGCTTTA TTACGCGCCG GGCGCTTCGA CCGCCGGGTG TTGGTAGACA AACCCGACTT GGAAGGACGC ATCGAAATTC TCAAGATCCA TGTGCGGGCG CTTAAGCTTG GGCAAGACGT GGATCTCAAG GTGGTTGCCC AGCGTACGCC TGGATTTGTC GGCGCTGATT TAGCCAATAT CGCCAATGAA GCCGCCCTGC ATGGAGTCCG AATGGGTCAT GAAGCCATTA CTCTGGGTGA CTTTGAAGCG GCTGTTGACC GGGTAATCGC GGGACCGGAG AAAAAGCACC AAATTCTGAG CCCCGAGGAA AAACGCCGGG TAGCCTACCA TGAATCCGGC CACACTTTGG TAGCGGAGAC GGTTCCCACA GGGGAACCCG TACATAAGGT ATCCATTATT CCCCGGGGTG GCGGCGCTTT GGGATACACT TTGCAACTTC CGGTAAAAGA GAAATTTCTG GCCAACGCTT CCGAACTCAA GGACCAACTG GCCATCCTGT TGGGGGGGCG CTCGGCGGAA GAAATTGTCT ATGGCGATGT CTCCAGCGGC GCACAAAATG ATCTGGAAAA AGCTACTGAA ATTGCCCACG GCATGGTTTG TCAGTTAGGA ATGAACGAAA AACTTGGTCC CTTAACTTAC GGCAAACGCC ACCAATCCCT TTATTTGGGC GTAGATTATG GCGAGGAGAA AAATTACAGT GAAGCGACCG CCCAGGTTAT CGATGCGGAA GTTAAGAAGC TCATAGAAGA AGCCCATCAA CGGGCACGGG AAATACTCAC CGAGCAGCGG CAAATCCTGG AAATCCTGGC AGAACTTTTA GAAGAAAAAG AAATTATCAG TGGAAATGAA GTTAAACAGG TCATCGACAA CGCCCGGGGG GCTCTGGACA AAACGCCATT CCGGAAATAG
|
Protein sequence | MANQKPDIEI NKPAKIPFWR RPWFSFLLYG LFVLVSYSTF KTEQEEIPYS QFLRYVQAGQ VQQAVIRKDM IEGTLKPEKK HKGKKTEKPR QFVTVPLWDD PLAVMLEKHG VNYVVRTGGD WLSNLLFNWI LPFAVISFLW IWIGRKVNRG GSFLSLGGNR VRIHPNTLPK VTFDDVAGVE EAKEELREII VFLRDPSLIQ DLGGRMPKGV LLAGPPGTGK TLLARAVAGE ARVPFFNISG SEFIELFVGV GAARARDLFE QARKKAPCII FIDELDAIGR TRAGAVSMGG HDEREQTLNQ LLVEMDGFDP SVGVVVMAAT NRAEILDKAL LRAGRFDRRV LVDKPDLEGR IEILKIHVRA LKLGQDVDLK VVAQRTPGFV GADLANIANE AALHGVRMGH EAITLGDFEA AVDRVIAGPE KKHQILSPEE KRRVAYHESG HTLVAETVPT GEPVHKVSII PRGGGALGYT LQLPVKEKFL ANASELKDQL AILLGGRSAE EIVYGDVSSG AQNDLEKATE IAHGMVCQLG MNEKLGPLTY GKRHQSLYLG VDYGEEKNYS EATAQVIDAE VKKLIEEAHQ RAREILTEQR QILEILAELL EEKEIISGNE VKQVIDNARG ALDKTPFRK
|
| |