Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0272 |
Symbol | |
ID | 3706443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 300556 |
End bp | 302475 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637736788 |
Product | peptidase M41, FtsH |
Protein accession | YP_342332 |
Protein GI | 77163807 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.520053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACG AAAAGCAAGC CTCTCCACCT CCAGCAGCGC CCCCCCTTAA TTGGCGCTAT CTGCTCTGGA TAATCCTCTT GGGTATTTTT CTAATATCCT GGCTTGGCAA TGCAGGCCGT CAGGCAGGAG ATGAAATTAC TTATACGGAG TTTAAGCAGG CATTACATCA AGGGAAGATT GCAAAAGTTA CCCTTGAGGG TCAGCATATT TCAGGCACCT ATCACGAAGC TGGCGGAAAT ATACAGCCAG AAGGCAAGGA CTCTAAAGGA TTTTCCACGA CGAGACCTCC CTTTGATGAT CCAGAGTTGA TGAAACTGCT GGAGCAAAAA GGAGTAGTTG TGCAAGCGAA GAGCGAAGAA CCCTCCTTGT GGATGCAGGC CATCATTGGA ATACTGCCTT GGTTCTTGAT ATTGGGCCTG ATTTTTTATG TCTCCTATAG GATGCAGCAA CGCATGATGG GAGGAGGCAG GGGTGGTCCC TTTGGTTTCG GCAAGGCGCC GGTAAAACGC TTTCGCGAGG GGAGTATAGG AGTCACTTTT GAAGACGTTG CCGGGGTGGA AAACGCAAAG CGTGATCTGC GAGAGATTGT CGATTATCTG AAGGAACCAG GGCAGTTCAA AGCGGTAGGC GCCAAGATTC CCAAAGGTAT CCTTTTGGTA GGCCGCCCTG GGACGGGAAA AACACTTTTG GCCAGGGCGG TGGCCGGCGA AGCGGGTGTC CCTTTCTATA GCATTAGCGG TTCGGATTTT ATAGAGATGT TCGTTGGCGT GGGAGCAGCC CGGGTGCGGG ATATGTTTAA GGCCGCCAAG GAAGAGGCGC CTTCGATTTT GTTTATTGAT GAAATCGATT CCGTGGGACG CGCCCGGGGG ACGGGACTTG GGGGTGGCCA CGATGAACGG GAGCAGACTT TAAATCAAAT CTTGGGCGAA ATGGACGGTT TTGCAGCCCA TGAAAATGTT GTGGTTTTGG CGGCGACTAA CCGTCCTGAC GTGCTTGATC CTGCTTTACT TAGACCGGGA AGGTTTGACC GTAAAGTCGT TCTCGATCTT CCCGATAAAA AAGCCCGTCA ACGGGTGTTA GAAGTCCATA CCAAGAATGT TCCCCTCGCT GCTGATGTGG ATTTGGAAAG AGTTGCTAGG CGTACGGTGG GCTTTTCTGG GGCTGATCTT GCCAATCTGG TGAATGAGGC GGCCTTGCTG ACCGGACGGG AACGCAAGAA GGAGGTGGAC ATGGACATGT TTAACCTTGC CAGGGATAAA ATCGTATTGG GCGCCAAGCG GGAAACGATT TTAGGCGAGG AGGAGAAAAA GCTTGTGGCT TACCATGAAT CGGGCCATGC TTTGACGGCA TGGTTATTAC CTGAGGCTGA TCCCCTGCAC CAAGTCAGCA TTATTCCTCG TGGTATGGCT TTAGGAGTGA CGGAACAAGC TCCAGAAGAA GAACGGCATA GTTTGTCGCG GGCTTATTTG CTTGATCGGC TGGGGGTGAT GCTTGGGGGA CGCATCTCGG AAAAAATTAC TTTTGGTGAT GTCACCTCAG GGGCTGAATC CGATCTTAAA CAGGCGACTC AATTGGCCCG CCGTATGGTT TGCCAATGGG GAATGAGTGA TAAGATTGGG GCAGCGGCCT TTTCACGAAG TGAAGAGCAT GTTTTTCTGG GCCGAGAATT GTCTCAACCG CGGGATTTTA GCGAGCAGAC GGCTCAAATT ATTGATGATG AAATCCGCCG TATCCTTAGT GAAGTGGAAA GGAAGACGGA GAATTTGCTT CAAGAAAACC GCGCGAAGCT AGACGCATTA GCAAAAGCGC TTATCGAAGC CGAAACTCTT AATTTGGTGG AAGTAGAAAA AATCTTTAAG AATGTTAAAG AACTCCCGCA GGAAGGTCAT AATGAAGCTG TTGCTACGGG TGCTGGATGA
|
Protein sequence | MDNEKQASPP PAAPPLNWRY LLWIILLGIF LISWLGNAGR QAGDEITYTE FKQALHQGKI AKVTLEGQHI SGTYHEAGGN IQPEGKDSKG FSTTRPPFDD PELMKLLEQK GVVVQAKSEE PSLWMQAIIG ILPWFLILGL IFYVSYRMQQ RMMGGGRGGP FGFGKAPVKR FREGSIGVTF EDVAGVENAK RDLREIVDYL KEPGQFKAVG AKIPKGILLV GRPGTGKTLL ARAVAGEAGV PFYSISGSDF IEMFVGVGAA RVRDMFKAAK EEAPSILFID EIDSVGRARG TGLGGGHDER EQTLNQILGE MDGFAAHENV VVLAATNRPD VLDPALLRPG RFDRKVVLDL PDKKARQRVL EVHTKNVPLA ADVDLERVAR RTVGFSGADL ANLVNEAALL TGRERKKEVD MDMFNLARDK IVLGAKRETI LGEEEKKLVA YHESGHALTA WLLPEADPLH QVSIIPRGMA LGVTEQAPEE ERHSLSRAYL LDRLGVMLGG RISEKITFGD VTSGAESDLK QATQLARRMV CQWGMSDKIG AAAFSRSEEH VFLGRELSQP RDFSEQTAQI IDDEIRRILS EVERKTENLL QENRAKLDAL AKALIEAETL NLVEVEKIFK NVKELPQEGH NEAVATGAG
|
| |