Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1798 |
Symbol | |
ID | 3786349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2054989 |
End bp | 2056896 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637811884 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_412487 |
Protein GI | 82702921 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAA AGGATAAAAA GACCCAACTT CCCATGGACA AAAAAACCCA GATCAACTTC TGGTACGTCA TTATCGCGGT ACTCGGCATT CTGCTCATAC AAAGCATGTA TGCCCGCTAT ACCAAGGTCG AGCCCATTCC TTATAGCCGC TTTCATACCC TGCTGGACGA GGACAAGATT GCGGAGATAG CCATCACCGA AAATCATATT TATGGCACCC TGAAGGGGGA AGGTGCCGAC GGGTTGAAGG ATTTCGTGAC CACGCGGGTG GAACCCGAGC TGGCTGATAA GCTCGATCAA CATCACGTGA CGTATACGGG CGTGGTCCAG AGCACATGGA TGCGCGATCT GCTGTCGTGG CTATTGCCGA TGGCGATTTT TTTCGGGATA TGGCTGTTTA TCATCCGCCG CATGAATCCG GGCGGCATGA CGGGCGGGCT GATGTCGATC GGCAAGAGCC GCGCCAAAGT TTTTGTGGAG AAAGAAACCA AGGTAACTTT TGCAGATGTG GCCGGAGTGG ATGAAGCCAA GGAAGAGTTG GAAGAAGTCA TCAATTTTTT GAAGGATCCT GCCGGATACA GCCGCCTGGG CGGGCGGGTA CCCAAGGGTA TCCTGCTGGT GGGGCCACCG GGTACGGGCA AGACGCTGCT GGCCCGTGCC GTGGCAGGCG AAGCGAATGT TCCGTTCTTC TCGATCTCCG GTTCCGAGTT CGTAGAGATG TTCGTCGGGG TCGGGGCCGC GCGCGTGCGC GACCTGTTTG AACAAGCGCG CCAGATGGCT CCCGCCATCA TCTTTATCGA TGAGTTGGAT TCGCTTGGAC GAGCCCGGGG CGCCTATGGG CTTGGGGGTC ATGACGAGAA GGAGCAGACG CTGAATCAAC TGCTGGCCGA GCTTGATGGT TTCGACCCTA AAAGTGGCGT GGTGCTGCTC GCGGCGACCA ACCGGCCCGA AATCCTCGAC CCCGCTCTGC TGCGGGCGGG GCGATTTGAC CGGCAGGTAC TGGTGGATCG GCCGGACAAA GTGGGGCGAG AACAGATTCT TGCCGTACAT CTGAAGAAGG TGAAGCTCGA TCCTGACGTA AAAAAAGAGC AGATTGCAGC TTTGACGCCG GGGTTTACGG GCGCCGATCT CGCCAACCTG GTCAACGAAG CTGCCCTGCT AGCCACCCGC AGGAATGGCG CGGCCGTGAC GATGGGGGAT TTCAATAACG CCATTTTGCG GGTGGTAGCC GGCCTTGAGA AGCGCAACCG CCTGCTCAAT CCGGCAGAGC GCCGGGTGGT CGCGTTTCAT GAACTGGGAC ACGCGATGGT GGCACTCGCC TTGCCTGGAA CCGACGCGGT ACACAAGGTT TCTATTATCC CGCGTGGAAT AGGCGCGCTC GGCTATACAG TGCAGCGGCC GACCGAAGAC CGGTTTCTGA TGACCCGGGC AGAGCTGGAA AACAAAATGG CGGTGATGAT GGGCGGACGA GCAGCCGAAC GTGTGGTATT CAACGAAATA TCGACTGGCG CATCGGATGA CATCGTCCGC GCGACCGACC TTGCCCGTGC GATGGTGCTC CGTTATGGAA TGACCGAGGC CCTCGGGAAT GTTGCGTATG ACCGCGAACG TTCGCAATTT CTGCAACCCG GCATTCCCAT GCCGCAAAGC CGGGACTATA GCGAAGAGAC GGCAAACACG GTTGACAGCA CTGTGCGTGC GCTCGTCGAT GGTGCATTGA AGCGGGCGAT AGAGATATTG GAAAACAACC GCGCGCTGCT CGACCGGACG GCGGAGGAGT TGCTTCGGGT CGAAACGTTG AATGAACCCG AGATAGAGAA CCTGAAACGG CAGATCACCG CCAGGCCCGT GCTTCCTCGC ACCGACACGC CCCTCCCGGA AAAGGACAAA GCCCTGGAGA AAGCGTAG
|
Protein sequence | MDKKDKKTQL PMDKKTQINF WYVIIAVLGI LLIQSMYARY TKVEPIPYSR FHTLLDEDKI AEIAITENHI YGTLKGEGAD GLKDFVTTRV EPELADKLDQ HHVTYTGVVQ STWMRDLLSW LLPMAIFFGI WLFIIRRMNP GGMTGGLMSI GKSRAKVFVE KETKVTFADV AGVDEAKEEL EEVINFLKDP AGYSRLGGRV PKGILLVGPP GTGKTLLARA VAGEANVPFF SISGSEFVEM FVGVGAARVR DLFEQARQMA PAIIFIDELD SLGRARGAYG LGGHDEKEQT LNQLLAELDG FDPKSGVVLL AATNRPEILD PALLRAGRFD RQVLVDRPDK VGREQILAVH LKKVKLDPDV KKEQIAALTP GFTGADLANL VNEAALLATR RNGAAVTMGD FNNAILRVVA GLEKRNRLLN PAERRVVAFH ELGHAMVALA LPGTDAVHKV SIIPRGIGAL GYTVQRPTED RFLMTRAELE NKMAVMMGGR AAERVVFNEI STGASDDIVR ATDLARAMVL RYGMTEALGN VAYDRERSQF LQPGIPMPQS RDYSEETANT VDSTVRALVD GALKRAIEIL ENNRALLDRT AEELLRVETL NEPEIENLKR QITARPVLPR TDTPLPEKDK ALEKA
|
| |