Gene Noc_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0272 
Symbol 
ID3706443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp300556 
End bp302475 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content50% 
IMG OID637736788 
Productpeptidase M41, FtsH 
Protein accessionYP_342332 
Protein GI77163807 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.520053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG AAAAGCAAGC CTCTCCACCT CCAGCAGCGC CCCCCCTTAA TTGGCGCTAT 
CTGCTCTGGA TAATCCTCTT GGGTATTTTT CTAATATCCT GGCTTGGCAA TGCAGGCCGT
CAGGCAGGAG ATGAAATTAC TTATACGGAG TTTAAGCAGG CATTACATCA AGGGAAGATT
GCAAAAGTTA CCCTTGAGGG TCAGCATATT TCAGGCACCT ATCACGAAGC TGGCGGAAAT
ATACAGCCAG AAGGCAAGGA CTCTAAAGGA TTTTCCACGA CGAGACCTCC CTTTGATGAT
CCAGAGTTGA TGAAACTGCT GGAGCAAAAA GGAGTAGTTG TGCAAGCGAA GAGCGAAGAA
CCCTCCTTGT GGATGCAGGC CATCATTGGA ATACTGCCTT GGTTCTTGAT ATTGGGCCTG
ATTTTTTATG TCTCCTATAG GATGCAGCAA CGCATGATGG GAGGAGGCAG GGGTGGTCCC
TTTGGTTTCG GCAAGGCGCC GGTAAAACGC TTTCGCGAGG GGAGTATAGG AGTCACTTTT
GAAGACGTTG CCGGGGTGGA AAACGCAAAG CGTGATCTGC GAGAGATTGT CGATTATCTG
AAGGAACCAG GGCAGTTCAA AGCGGTAGGC GCCAAGATTC CCAAAGGTAT CCTTTTGGTA
GGCCGCCCTG GGACGGGAAA AACACTTTTG GCCAGGGCGG TGGCCGGCGA AGCGGGTGTC
CCTTTCTATA GCATTAGCGG TTCGGATTTT ATAGAGATGT TCGTTGGCGT GGGAGCAGCC
CGGGTGCGGG ATATGTTTAA GGCCGCCAAG GAAGAGGCGC CTTCGATTTT GTTTATTGAT
GAAATCGATT CCGTGGGACG CGCCCGGGGG ACGGGACTTG GGGGTGGCCA CGATGAACGG
GAGCAGACTT TAAATCAAAT CTTGGGCGAA ATGGACGGTT TTGCAGCCCA TGAAAATGTT
GTGGTTTTGG CGGCGACTAA CCGTCCTGAC GTGCTTGATC CTGCTTTACT TAGACCGGGA
AGGTTTGACC GTAAAGTCGT TCTCGATCTT CCCGATAAAA AAGCCCGTCA ACGGGTGTTA
GAAGTCCATA CCAAGAATGT TCCCCTCGCT GCTGATGTGG ATTTGGAAAG AGTTGCTAGG
CGTACGGTGG GCTTTTCTGG GGCTGATCTT GCCAATCTGG TGAATGAGGC GGCCTTGCTG
ACCGGACGGG AACGCAAGAA GGAGGTGGAC ATGGACATGT TTAACCTTGC CAGGGATAAA
ATCGTATTGG GCGCCAAGCG GGAAACGATT TTAGGCGAGG AGGAGAAAAA GCTTGTGGCT
TACCATGAAT CGGGCCATGC TTTGACGGCA TGGTTATTAC CTGAGGCTGA TCCCCTGCAC
CAAGTCAGCA TTATTCCTCG TGGTATGGCT TTAGGAGTGA CGGAACAAGC TCCAGAAGAA
GAACGGCATA GTTTGTCGCG GGCTTATTTG CTTGATCGGC TGGGGGTGAT GCTTGGGGGA
CGCATCTCGG AAAAAATTAC TTTTGGTGAT GTCACCTCAG GGGCTGAATC CGATCTTAAA
CAGGCGACTC AATTGGCCCG CCGTATGGTT TGCCAATGGG GAATGAGTGA TAAGATTGGG
GCAGCGGCCT TTTCACGAAG TGAAGAGCAT GTTTTTCTGG GCCGAGAATT GTCTCAACCG
CGGGATTTTA GCGAGCAGAC GGCTCAAATT ATTGATGATG AAATCCGCCG TATCCTTAGT
GAAGTGGAAA GGAAGACGGA GAATTTGCTT CAAGAAAACC GCGCGAAGCT AGACGCATTA
GCAAAAGCGC TTATCGAAGC CGAAACTCTT AATTTGGTGG AAGTAGAAAA AATCTTTAAG
AATGTTAAAG AACTCCCGCA GGAAGGTCAT AATGAAGCTG TTGCTACGGG TGCTGGATGA
 
Protein sequence
MDNEKQASPP PAAPPLNWRY LLWIILLGIF LISWLGNAGR QAGDEITYTE FKQALHQGKI 
AKVTLEGQHI SGTYHEAGGN IQPEGKDSKG FSTTRPPFDD PELMKLLEQK GVVVQAKSEE
PSLWMQAIIG ILPWFLILGL IFYVSYRMQQ RMMGGGRGGP FGFGKAPVKR FREGSIGVTF
EDVAGVENAK RDLREIVDYL KEPGQFKAVG AKIPKGILLV GRPGTGKTLL ARAVAGEAGV
PFYSISGSDF IEMFVGVGAA RVRDMFKAAK EEAPSILFID EIDSVGRARG TGLGGGHDER
EQTLNQILGE MDGFAAHENV VVLAATNRPD VLDPALLRPG RFDRKVVLDL PDKKARQRVL
EVHTKNVPLA ADVDLERVAR RTVGFSGADL ANLVNEAALL TGRERKKEVD MDMFNLARDK
IVLGAKRETI LGEEEKKLVA YHESGHALTA WLLPEADPLH QVSIIPRGMA LGVTEQAPEE
ERHSLSRAYL LDRLGVMLGG RISEKITFGD VTSGAESDLK QATQLARRMV CQWGMSDKIG
AAAFSRSEEH VFLGRELSQP RDFSEQTAQI IDDEIRRILS EVERKTENLL QENRAKLDAL
AKALIEAETL NLVEVEKIFK NVKELPQEGH NEAVATGAG