Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1891 |
Symbol | |
ID | 3784263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2177846 |
End bp | 2179726 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637811977 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_412578 |
Protein GI | 82703012 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCG ACAAGAGGGA AGAGCAGGCC GGTGTAGCGG CCCGGCTTGC CGAGAGGTTC ATATCAAACA TACGCGCAAA GGTTGCCTCA TCCAGCTGCC GTTTTGCAGG GGCGATACTG GCAGTTTTGC TGGGATTGGG TCCGGTCATT TCCAGTGCAG AGCCTCCGGA GGGCAGGGGG CCTGGAATGA ACAAGCCGGT TGGCTGGGCG AAGGGTCGCA TCCTGGTCAT GCCGCGTGCC GGCTTGCCGG AAAAGGAACT GGCCAAGGTT CTGGGCGAAC ATGGCGGCAA GGGCAGGAAG ATCGGGCAGA GCGATCTGTA TATCGTCGAC TTGCCGGGTA ATGCCTCCGA GAAGGCGGTG GCGGCAAGGC TTGCGCATCA TCCGGCGCTC AAGTTTGCCG AGATCGACCA GGAAGTCGAA CCCGCGCTCA TTCCGAACGA TCCCTACTAT GGCAGCGCCT GGCATTTGCC CAAGATTGGC GCTCCGTCCG CCTGGGACAG TTCCCTGGGC AGCGGCGTGA CCATCGCCAT CCTGGATTCA GGAGTCGATA GCACTCACCC TGATCTGGCT ACGGAACTGG TGCCAGGCTG GAATTTCTAC GAGAACAATT CCAATACATC GGATGTTTAC GGGCATGGCA CCAAGGTTGC GGGTGCGGCG GGAGCGGCAG GCAATAACGC GCTGGGGGTG GCCTCGGTGG CGGCACGGGC GAAGATCATG CCGATCCGGG TAAGTGGCTC TGATGGGTAT GCCACCTGGA GTGCGATTTC CCAAGGGCTT ATCTATGCGG CAGACCGCGG GGTTCGCGTC GCCAATGCCA GCTTTCTCGG ACTGACCGAC AGTGCGAGTA CTCGCAGTGC GGCGCAATAC CTGAAGAACA AAGGCGGCCT TGTCATTGTC AGCGGGGGCA ACACTGGCGT ACAGCAAAAT TATGCCGCAA CCACCAGCAT GATTCCTGTC TCTGCCACTG ACGGAAATGA CGTACGGACG AGCTGGTCTA GTTATGGTAA TTACATTGCT CTCGCGGCTC CCGGTGCAGG AATCTGGAGC ACTACCAAGG GTGGCGGCTA TGGCGCGGTT TCCGGCACGT CGTTCTCAAG CCCGGTAACG GCTGGGGTGG TGGCGCTCAT GATGGCGGCA AAGCCGACAT TATCCAATAC CCAGATCGAG AGTCTGCTGT ATTCCACGGC GGTGGATCTT GGAACGCCAG GGCGCGATCC TTACTACGGC TATGGGCGAG TGAATGCGGC GCGTGCCGTA CAGGCCGCCG CCGGCACCAC GCTGACAGCA GATACGCAGG CTCCGGCGGT TTCCATCACT TCCCCTGCTG GCGGATCAAG CGTAACAGGC CTGGTCGGCG TAAACGTGGC TGCGAGCGAT AACGTCGGCG TGACTCGTGT CGAGTTGCGG GTCAACAATA CCACAGTCGC AGTCGACACC ACGGCGCCGT TCGCCTTCAC CTGGGATTCG GCAGGCGTAG CCAACGGTAT GGCGAACCTG ACCGCTTATG CATTCGATGC GGCCGGCAAC TCCAAGGCTT CGACCACCGT CTCGGTGAAT GTGGCAAACG GAACGACAAC GGTGGCCAGG GATACTACCG CGCCACAGGT GAAGATCGTA AATCCGGTCA CAGGCAATGT TTCGGGCAGC AATGTGGCCA TCAGCGTGAA TGCAAGCGAT GATAGCGGCG CTTCCGGAAT CACCTGTACG CTCTACATCG ACGGCGTGCT CAAGGCTACC GGAAAAGGAA GTACGTTAGG ATATAGCTGG AATACCCGCC CAAGCAATGT GCGCGCGGGA GCACACACTA TCTGGACGGT CGCCAGAGAT GCGGCGGGCA ATACAGCATC TGCTTCGGTG AATGTGACAG TGATCAAGTA A
|
Protein sequence | MNADKREEQA GVAARLAERF ISNIRAKVAS SSCRFAGAIL AVLLGLGPVI SSAEPPEGRG PGMNKPVGWA KGRILVMPRA GLPEKELAKV LGEHGGKGRK IGQSDLYIVD LPGNASEKAV AARLAHHPAL KFAEIDQEVE PALIPNDPYY GSAWHLPKIG APSAWDSSLG SGVTIAILDS GVDSTHPDLA TELVPGWNFY ENNSNTSDVY GHGTKVAGAA GAAGNNALGV ASVAARAKIM PIRVSGSDGY ATWSAISQGL IYAADRGVRV ANASFLGLTD SASTRSAAQY LKNKGGLVIV SGGNTGVQQN YAATTSMIPV SATDGNDVRT SWSSYGNYIA LAAPGAGIWS TTKGGGYGAV SGTSFSSPVT AGVVALMMAA KPTLSNTQIE SLLYSTAVDL GTPGRDPYYG YGRVNAARAV QAAAGTTLTA DTQAPAVSIT SPAGGSSVTG LVGVNVAASD NVGVTRVELR VNNTTVAVDT TAPFAFTWDS AGVANGMANL TAYAFDAAGN SKASTTVSVN VANGTTTVAR DTTAPQVKIV NPVTGNVSGS NVAISVNASD DSGASGITCT LYIDGVLKAT GKGSTLGYSW NTRPSNVRAG AHTIWTVARD AAGNTASASV NVTVIK
|
| |