Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2569 |
Symbol | |
ID | 3704573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2920559 |
End bp | 2922484 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637739049 |
Product | peptidase M41, FtsH |
Protein accession | YP_344552 |
Protein GI | 77166027 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000845189 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGACA TGGCAAAAAA TATTATTTTG TGGGTAGTGA TTGCCCTAGT ACTGATGTCG GTTTTTAACA GCTTCGACAC GCGTCAGGTA AGCGGCCATC ATATCGATTA CTCTCGATTC ATCGCCGATG TTAAAAGTGG GCAAGTGAAT AAAGTGGTGA TCGACGGGCG TCACATCAGT GGTGAAACAA GCGAAGGGAA ACACTTTACA ACCTATAGTC CAGGTAATGA TCCAGGTTTG ATCGGCGATC TCTTGGGTAA TGGTGTTGTT ATTGAAGCCA AGCCAGAAGA GGGAACAGGG CTTTTGATGC AGGTTTTTAT CTCTTGGTTC CCTATGTTAT TGCTCATTGC AGTATGGATT TTTTTCATGC GGCAGATGCA GGGCGGAGCT GGAGGCCGTG GAGCAATGTC GTTTGGTAAA AGCCGTGCTC GCATGCTGAG CGAAGAGCAG GTGAAAGTTA CCTTTAGTGA TGTTGCTGGA TGTGACGAAG CAAAGGAAGA AGTTCAGGAA TTGGTAGAGT TCTTGCGCGA GCCCGGTCGT TTTCAAAAAT TAGGAGGTAA AATTCCTCGG GGCGTGCTTA TGGTAGGACC GCCAGGAACA GGAAAAACCT TATTGGCAAG GGCGATTGCT GGCGAAGCCA AAGTCCCCTT CTTCACCATT TCCGGCTCTG ATTTTGTGGA AATGTTTGTG GGCGTGGGCG CCTCGCGAGT GCGAGATATG TTTGAGAACG CCAAAAAGCA CGCGCCTTGC ATTATTTTCA TCGACGAAAT TGATGCCGTT GGGCGTCAGC GCGGTGCCGG CCTTGGAGGC GGCCATGATG AGCGGGAGCA AACCCTCAAT CAAATGCTGG TGGAGATGGA TGGGTTTGAA GGCAACGAAG GCGTTATTGT TATTGCTGCG ACCAACCGCC CTGATGTGCT GGATCCGGCT CTGTTGCGGC CAGGACGCTT TGATCGGCAG GTTGTGGTTT CTCTCCCAGA TATCCGAGGG CGAGCGCAAA TCCTTAAAGT TCACCTTCGT AAGGTCCCAG TCGCGGAAGA TGTGGAGCCG GCCCTCATTG CACGAGGTAC CCCGGGTTTC TCTGGCGCCG ATCTTGCTAA CTTGGTCAAT GAGGCCGCTC TGTTTGCCGC TCGTGGCAGC AAGCGCTTGG TCGATATGCA AGACTTGGAG CAAGCCAAGG ATAAAATCCT GATGGGTGTC GAACGGCGTT CAGCAGTGAT GAGTGAGGAC GACAAGAGGC TCACTGCTTA TCATGAAGCT GGGCACGCTA TCATTGGCCG TTTGGTACCC TCGCACGATC CAGTTTACAA GGTAAGCATT ATCCCTCGGG GCCGAGCGCT AGGCGTTACT ATGTTCTTGC CGGAAGAAGA TCGCTACAGC CTGAGCAAGC TACAGATAGA GAGTCAGATT TCCAGCCTCT TTGGCGGGCG CTTAGCTGAA GAATTGATCT TTGGCGTGGA GTACGTAACT ACGGGAGCTT CTAATGATAT CCAGCGTGCT ACCGAGTTAG CCCGTAATAT GGTGACTAAA TGGGGACTTT CGGAAAAGCT GGGCCCACTG GCCTATGGCG AAGAGGAAGG CGAAGTGTTT CTGGGACATT CTGTGACCCA GCATAAGGGT ATTGCGGATA CGACGGCTTC AGAAATTGAT ACCGAAATAC GGGCTATTAT TGACCGCAAT TACCTGCGGG CAAAACAGCT CTTAGAGGAG AATATGGACA AATTGCACGT TATGTCCGAT GCTCTAATGA AATATGAAAC CATTGATAAG GAACAAATTG ATGACATTAT GGCTGGTAAA GAACCACGAC CACCTAAAGT AAGCGGGTCT GATGTGGAAC CGCCCAGTGG GAGTGATGCA GTGCCACCTA AAGGGAAGGA AGAACAGCCT GTAGGGGGGG GATCTATCCC TGCTAGCCAG CATTAA
|
Protein sequence | MSDMAKNIIL WVVIALVLMS VFNSFDTRQV SGHHIDYSRF IADVKSGQVN KVVIDGRHIS GETSEGKHFT TYSPGNDPGL IGDLLGNGVV IEAKPEEGTG LLMQVFISWF PMLLLIAVWI FFMRQMQGGA GGRGAMSFGK SRARMLSEEQ VKVTFSDVAG CDEAKEEVQE LVEFLREPGR FQKLGGKIPR GVLMVGPPGT GKTLLARAIA GEAKVPFFTI SGSDFVEMFV GVGASRVRDM FENAKKHAPC IIFIDEIDAV GRQRGAGLGG GHDEREQTLN QMLVEMDGFE GNEGVIVIAA TNRPDVLDPA LLRPGRFDRQ VVVSLPDIRG RAQILKVHLR KVPVAEDVEP ALIARGTPGF SGADLANLVN EAALFAARGS KRLVDMQDLE QAKDKILMGV ERRSAVMSED DKRLTAYHEA GHAIIGRLVP SHDPVYKVSI IPRGRALGVT MFLPEEDRYS LSKLQIESQI SSLFGGRLAE ELIFGVEYVT TGASNDIQRA TELARNMVTK WGLSEKLGPL AYGEEEGEVF LGHSVTQHKG IADTTASEID TEIRAIIDRN YLRAKQLLEE NMDKLHVMSD ALMKYETIDK EQIDDIMAGK EPRPPKVSGS DVEPPSGSDA VPPKGKEEQP VGGGSIPASQ H
|
| |