Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1888 |
Symbol | |
ID | 3784260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2173441 |
End bp | 2175321 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811974 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_412575 |
Protein GI | 82703009 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.67203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATATCA ATCATAAAGC GCCTGCAACT GGCTCAAGAG AACCCGCTTC AGGCATTATT CAATCAAGCA CTCGCAACTC GCAGCACATC TCCAAACCTT GGTATCCCCT TTGGAGCGGC ATACTGGCAG TTTTGCTGGG ATTGGGTCCG GTCATTTCCA GTGCAGAGCC TCCGGAGGGC AGGGGGCCTG GAATGAACAA GCCGCTTGGC TGGGCGAAGG GTCGCATCCT GGTCATGCCG CGTGCCGGCT TGCCGGAAAA GGAACTGGCC AAGGTTCTGG GCGAACATGG CGGCAAGGGC AGGAAGATCG GGCAGAGCGA TCTGTACATC GTCGACTTGC CGGGTAATGC CTCTGAGAAG GCGGTGGCGG CAAGGCTTGC GCATCATCCG GCGCTCAAGT TTGCCGAGAT CGACCAGGAA GTCGAACCCG CGCTCATACC GAACGATCCC TACTATGGCA GCGCCTGGCA TTTGCCCAAG ATTGGCGCTC CGTCCGCCTG GGACAGTTCC CTGGGCAGCG GCGTGACCAT CGCCATCCTG GATTCAGGAG TTGACCCGAA TCACCCTGAT CTCTCACCCC AGCTTGTTCC AGGCTGGAAT TTCTACGAAA ACAATTCCAA CACGTCGGAT GTTTACGGGC ATGGAACCAA GGTTGCAGGG TCAGCGGCAG CCATGGCAAA CAATGCCCAA GGCGTTGCCG GCGTGGCGGC GTTGGCTAAA ATAATGCCTG TCCGCGTGAG CGGGAGTACT GGATATGCCA CCTGGAGTGC GATTTCCCAA GGGCTTATCT ATGCGGCAGA CCGCGGCGTT CGCGTCGCCA ATGCCAGCTT CCTCGGACTG ACCGACAGTG CGAGCACCCG GAGCGCGGCG CAATACATGA AGGATAAAGG TGGCCTGGTG ATCGTGAGTG GCGGAAATAC GGGAGTGCTC CAGAATTATG CAGTAACGAC ATCGATGGTA GCTGTAGCCG CTACAGATGC GAACGACTTC AGAGCCAGTT GGTCGAGCTA CGGAAACTAC ATCTCCCTTG CCGCGCCTGG AGCCGGAATT TGGACGACGA CTATGGGGGG CGGCTATGGC GCGGCTTCCG GCACATCATT CTCAAGTCCG GTAACGGCCG GCGTGGTGGC GCTCATGATG GCGGCAAAGC CGACATTATC CAATACCCAG ATCGAGAGTC TGCTGTTTTC AACTGCCGTG GATCTGGGCA CAGCGGGGCG CGATCCCTAT TATGGCTATG GACGGGTTGA TGCTGCACGC GCGGTGCAGG CGGCGGCCTC GGCAGTGGTC GCCGGAGACA CGCAGGCCCC GGCGGTTTCC ATCACTTCCC CTGCTGGCGG ATCAAGCGTA ACAGGTCTGG TCGGCGTAAA CGTGGCTGCG AGCGATAACG TCGGCGTGAC CCGTGTCGAG TTGCGGGTCA ACAATACCAC AGTCGCAGTC GACACCACGG CGCCGTTCGC CTTCACCTGG GATTCGGCAG GCGTAGCCAA CGGTATGGCG AACCTGACCG CTTATGCATT CGATGCGGCC GGCAACTCCA AGGCTTCGAC CACCGTCTCG GTGAATGTGG CAAACGGAAC GACAACGGTG GCCAGGGATA CTACCGCGCC ACAGGTGAAG ATCGTAAATC CGGTCACAGG CAATGTTTCG GGAAGCAACG TGCCCATCAG CGTAAATGCA AGCGATGATA GCGGCGCTTC CGGAATTACC TATGCGCTCT ACATCGACGG CGTGCTCAAG GCTACCGGAA AGGGAAGTAC GCTGGGTTAC AACTGGAATA TACGTAATGT TGCCGCGGGT GCACATACTG TTCAGGTAGT CGCCAAGGAT GCGGCGGGAA ATAGCGCATC CTCATCCGTC AGCGTGACGG TAGTCAAGTA G
|
Protein sequence | MNINHKAPAT GSREPASGII QSSTRNSQHI SKPWYPLWSG ILAVLLGLGP VISSAEPPEG RGPGMNKPLG WAKGRILVMP RAGLPEKELA KVLGEHGGKG RKIGQSDLYI VDLPGNASEK AVAARLAHHP ALKFAEIDQE VEPALIPNDP YYGSAWHLPK IGAPSAWDSS LGSGVTIAIL DSGVDPNHPD LSPQLVPGWN FYENNSNTSD VYGHGTKVAG SAAAMANNAQ GVAGVAALAK IMPVRVSGST GYATWSAISQ GLIYAADRGV RVANASFLGL TDSASTRSAA QYMKDKGGLV IVSGGNTGVL QNYAVTTSMV AVAATDANDF RASWSSYGNY ISLAAPGAGI WTTTMGGGYG AASGTSFSSP VTAGVVALMM AAKPTLSNTQ IESLLFSTAV DLGTAGRDPY YGYGRVDAAR AVQAAASAVV AGDTQAPAVS ITSPAGGSSV TGLVGVNVAA SDNVGVTRVE LRVNNTTVAV DTTAPFAFTW DSAGVANGMA NLTAYAFDAA GNSKASTTVS VNVANGTTTV ARDTTAPQVK IVNPVTGNVS GSNVPISVNA SDDSGASGIT YALYIDGVLK ATGKGSTLGY NWNIRNVAAG AHTVQVVAKD AAGNSASSSV SVTVVK
|
| |