Gene Noc_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1903 
Symbol 
ID3705497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2174246 
End bp2176135 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content51% 
IMG OID637738381 
Productpeptidase M41, FtsH 
Protein accessionYP_343897 
Protein GI77165372 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.89968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCAACC AAAAACCAGA TATCGAAATC AATAAACCTG CAAAGATACC TTTTTGGCGA 
AGACCTTGGT TTTCATTCCT GCTCTACGGG CTATTTGTCC TCGTGTCCTA TTCAACCTTC
AAAACGGAGC AGGAAGAAAT CCCTTATAGT CAATTTCTAA GATATGTTCA AGCTGGTCAG
GTTCAACAAG CCGTCATCAG GAAGGACATG ATCGAAGGAA CCTTAAAGCC CGAGAAAAAA
CACAAAGGTA AAAAGACGGA AAAACCCCGG CAATTCGTCA CGGTACCTCT CTGGGATGAC
CCACTTGCGG TAATGTTGGA AAAACATGGA GTGAACTATG TTGTCAGGAC CGGTGGCGAT
TGGCTCTCAA ATCTGCTATT TAACTGGATT CTTCCCTTTG CGGTCATCTC TTTTTTGTGG
ATATGGATCG GGCGTAAGGT AAACAGGGGC GGAAGCTTTT TAAGCTTAGG GGGCAACCGC
GTCCGTATCC ACCCGAATAC CCTGCCCAAG GTCACCTTCG ACGATGTCGC CGGGGTTGAA
GAAGCTAAAG AGGAACTCCG AGAGATCATT GTCTTTCTGA GAGATCCCAG CCTTATTCAG
GATTTAGGAG GGCGCATGCC CAAAGGAGTT CTTTTGGCCG GCCCTCCGGG AACTGGCAAG
ACTTTGCTGG CCCGGGCTGT TGCTGGGGAA GCCCGGGTTC CTTTCTTTAA TATCAGTGGC
TCCGAGTTTA TCGAACTATT TGTGGGTGTT GGCGCAGCAC GCGCACGGGA TTTGTTTGAG
CAGGCGCGGA AAAAAGCACC CTGTATTATT TTCATTGATG AGTTGGATGC AATTGGACGT
ACTCGCGCCG GAGCGGTATC CATGGGGGGG CATGATGAAC GGGAGCAAAC CTTGAACCAG
TTGTTGGTGG AAATGGACGG CTTTGACCCT TCCGTGGGGG TAGTGGTAAT GGCGGCGACC
AACCGGGCGG AGATTCTTGA CAAAGCTTTA TTACGCGCCG GGCGCTTCGA CCGCCGGGTG
TTGGTAGACA AACCCGACTT GGAAGGACGC ATCGAAATTC TCAAGATCCA TGTGCGGGCG
CTTAAGCTTG GGCAAGACGT GGATCTCAAG GTGGTTGCCC AGCGTACGCC TGGATTTGTC
GGCGCTGATT TAGCCAATAT CGCCAATGAA GCCGCCCTGC ATGGAGTCCG AATGGGTCAT
GAAGCCATTA CTCTGGGTGA CTTTGAAGCG GCTGTTGACC GGGTAATCGC GGGACCGGAG
AAAAAGCACC AAATTCTGAG CCCCGAGGAA AAACGCCGGG TAGCCTACCA TGAATCCGGC
CACACTTTGG TAGCGGAGAC GGTTCCCACA GGGGAACCCG TACATAAGGT ATCCATTATT
CCCCGGGGTG GCGGCGCTTT GGGATACACT TTGCAACTTC CGGTAAAAGA GAAATTTCTG
GCCAACGCTT CCGAACTCAA GGACCAACTG GCCATCCTGT TGGGGGGGCG CTCGGCGGAA
GAAATTGTCT ATGGCGATGT CTCCAGCGGC GCACAAAATG ATCTGGAAAA AGCTACTGAA
ATTGCCCACG GCATGGTTTG TCAGTTAGGA ATGAACGAAA AACTTGGTCC CTTAACTTAC
GGCAAACGCC ACCAATCCCT TTATTTGGGC GTAGATTATG GCGAGGAGAA AAATTACAGT
GAAGCGACCG CCCAGGTTAT CGATGCGGAA GTTAAGAAGC TCATAGAAGA AGCCCATCAA
CGGGCACGGG AAATACTCAC CGAGCAGCGG CAAATCCTGG AAATCCTGGC AGAACTTTTA
GAAGAAAAAG AAATTATCAG TGGAAATGAA GTTAAACAGG TCATCGACAA CGCCCGGGGG
GCTCTGGACA AAACGCCATT CCGGAAATAG
 
Protein sequence
MANQKPDIEI NKPAKIPFWR RPWFSFLLYG LFVLVSYSTF KTEQEEIPYS QFLRYVQAGQ 
VQQAVIRKDM IEGTLKPEKK HKGKKTEKPR QFVTVPLWDD PLAVMLEKHG VNYVVRTGGD
WLSNLLFNWI LPFAVISFLW IWIGRKVNRG GSFLSLGGNR VRIHPNTLPK VTFDDVAGVE
EAKEELREII VFLRDPSLIQ DLGGRMPKGV LLAGPPGTGK TLLARAVAGE ARVPFFNISG
SEFIELFVGV GAARARDLFE QARKKAPCII FIDELDAIGR TRAGAVSMGG HDEREQTLNQ
LLVEMDGFDP SVGVVVMAAT NRAEILDKAL LRAGRFDRRV LVDKPDLEGR IEILKIHVRA
LKLGQDVDLK VVAQRTPGFV GADLANIANE AALHGVRMGH EAITLGDFEA AVDRVIAGPE
KKHQILSPEE KRRVAYHESG HTLVAETVPT GEPVHKVSII PRGGGALGYT LQLPVKEKFL
ANASELKDQL AILLGGRSAE EIVYGDVSSG AQNDLEKATE IAHGMVCQLG MNEKLGPLTY
GKRHQSLYLG VDYGEEKNYS EATAQVIDAE VKKLIEEAHQ RAREILTEQR QILEILAELL
EEKEIISGNE VKQVIDNARG ALDKTPFRK