Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1749 |
Symbol | |
ID | 3786051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2001686 |
End bp | 2003107 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811835 |
Product | peptidase S1C, Do |
Protein accession | YP_412438 |
Protein GI | 82702872 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCTA AATTTTTACT GGTGGTGCTG CTTGGATTAT CCACGAGCGT GTTTGCCCAG GCGCCGGCGC GTGAATTGCC GGATTTTACC GGGTTGGTGG AAAGAGAAGG ACCCTCGGTC GTAAACATCA GCACCGTTCA GTCAGGCAAC TCAACCACGG AACGGGCTTT TCCGGGAATA CCGAACATAC CCGAGGATGA TCCGTTTTTC GAGTTTTTCC GGAGACACAT GCAGCCGCAT GGAGGAATGC CACGAGATTT CGAATCCAGA TCGGTAGGCT CCGGCTTTAT TATCAGTTCG GATGGCTATA TTCTGACAAA TACCCATCTG GTGGATGGAG CCGATGAAAT CAATGTGAAG CTTACGGACA AACGGGAGTT CCGGGCAAAG CTCATCGGCG CGGATCGCAA GACGGATATT GCTCTCCTCA AGATCGATGC TACCGGACTG CCGAAAGTCA CCCAGGGGGA TCCCAATAAC ATGAAGGTAG GAGAATGGGT GGTAGCGATC GGATCACCGT TCGGCTTTGA AAACAGCGTC ACTGCCGGAA TTGTCAGCGC CAAAGGACGC TCTCTGGCAC AGGAGAATTT CGTGCCCTTC ATCCAGACGG ATGTCGCGAT CAACCCCGGG AATTCCGGCG GACCGCTATT CAATATGAAC GGTGAGGTGG TCGGTGTCAA TTCCCAGATC TACAGTCGCA CGGGTGGATT CATGGGTTTG TCATTCGCCA TTCCCATCGA TGTTGCCCGG GATATATCGA ATCAGTTGAT TGCGAGCGGC AAGGTAAGCC GAGGCAGAAT CGGCGTGCTG ATTCAGGAAA TCACAAAGGA ACTGGCGGAA TCGTTCGGTT TACCCAAGCC AGCCGGCGCG CTGGTAGCTT CTGTCCAGAA AGGCGGCCCG GCAGACAAGG CAGGCATTCA AGCACGGGAT GTCATTCTCA AGTTCGACGG CAAGACCGTG AATTCCTCGG GCGATCTGCC ACGCATAGTA GGCTCGACCA AGCCAGGCAC GAAAGTGCAG ATGCAGGTGT GGCGAAATGG GTCGACCAAG GAGTTCACGA TTACAGTGGA TGAGCTTCCG GAGGATGAAA AACCGGCAGC GCGTTCTGGA AAGCGGGGCA AGACGCCGGA TACAGCGAAT CGCATCGGCT TGAGCCTGAT CGAGTTGACT CCCGATCAGA AGAAGGAACT GGAAACTGAA AGCGGGCTTC TGGTCGAGGA CATGGTGCCT GGTATCGCGA GTCGCGCGGG GGTCAGACCC GGTGATGTCA TTCTGAGCAT CAATAATCAG GATGTCAAAA CGGTTGATCA ATTCAACCAG TTGCTCAACA AGGTCGAGAA AGGACGCAAT ATCGCCCTGC TGGTCAAACG CGGTGATACA GCCACCTTCA TTACCATGAA GATGAATGGT GAACACAGAT AA
|
Protein sequence | MIAKFLLVVL LGLSTSVFAQ APARELPDFT GLVEREGPSV VNISTVQSGN STTERAFPGI PNIPEDDPFF EFFRRHMQPH GGMPRDFESR SVGSGFIISS DGYILTNTHL VDGADEINVK LTDKREFRAK LIGADRKTDI ALLKIDATGL PKVTQGDPNN MKVGEWVVAI GSPFGFENSV TAGIVSAKGR SLAQENFVPF IQTDVAINPG NSGGPLFNMN GEVVGVNSQI YSRTGGFMGL SFAIPIDVAR DISNQLIASG KVSRGRIGVL IQEITKELAE SFGLPKPAGA LVASVQKGGP ADKAGIQARD VILKFDGKTV NSSGDLPRIV GSTKPGTKVQ MQVWRNGSTK EFTITVDELP EDEKPAARSG KRGKTPDTAN RIGLSLIELT PDQKKELETE SGLLVEDMVP GIASRAGVRP GDVILSINNQ DVKTVDQFNQ LLNKVEKGRN IALLVKRGDT ATFITMKMNG EHR
|
| |