Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0510 |
Symbol | |
ID | 3785639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 576365 |
End bp | 577774 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637810592 |
Product | peptidase S1C, Do |
Protein accession | YP_411210 |
Protein GI | 82701644 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.727418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAC CGTCCGGATC AGTTTCGCAG CCAGCTCCAC TGTCAGTTTC ACCGCTGTCA GTTTCACTCT GGCTGGTTTT TATCGGCTTG CTGGTCATTG CGCTGCCGGG CTATCCGGCT TTGCCTTTCA GTGACACCAA AAATGGTGTG CCGACCCTGG CGCCGGTGCT TCAGGACGTG ACCCCGGCGG TCGTCAATAT TTCCGTAGTG ACGCGCTCGG CCATAGAAGA GAATCCCTTG TTTCGGGACC CTTTCTTCCG CCGCTTTTTC AATCTTCCGG AGCAGGAGGC CCGGCCGGAG CGCAGTGTGG GTTCCGGCGT GATCATGGAT GCGGCCAAGG GTTATGTGGT GACCAACTAC CACGTGATCA AGGATGCGCA GCAGGTGGTT GTGACACTCA AAGACAGGCG CCAATTCCAG GCGAAGCTGG TGGGGACGGA CCCGGGTACC GATATTGCCC TGCTGAAAAT CGATGCCAAG AACCTTAAGG CGCTGCGCCT CGGTGATTCC GATCTGCTCA ACGTCGGGGA TTTTGTCATC GCGATCGGGA ATCCGTTCGG TCTGGGGCAA ACGGTAACAT CCGGCATCGT CAGCGCATTG GGAAGAAGCG GACTGGATAT CGAAGGCTAT GAAGACTTCA TCCAGACCGA CGCCTCCATC AATCCGGGCA ACTCCGGCGG AGCCCTCATC AACCTCAAGG GGGAACTGAT CGGCATCAAC ACCGCCATCA TTGGTCCCGC AGGCGGCAAC GTCGGCATTG GTTTTGCGGT TCCCAGTGTG ATGGTGAAGG CAGTCCTCGA TCAAATCCTG CGCTTCGGCG AGGTACGCCG CGGCAGACTG GGAGCGAGCA GCGAAGATAT TACGCACGAC CTGGCGGCCT CACTGGGGTT GCCTTCCACG GAAGGCGCCA TCATCAGTGC GGTCGAGCCG GGATCGCCTG CGGAAAAAGC CGGGGTGAAA CCGCGTGATG TGATAACGAC TGTCAACGGA AAACCTGTGC GGAACTCGAT TGATTTGCGC AACAAGGTTG GACTGATGCC CATCGGGGAG ACGCTGGATC TCGGGCTGAT CAGGAATGGG AAGCGGCTGA CTGCGAAAGT AAAGATAGCC AAGCTGGCGG AGGCTTCCAG TGCAGGTGCG GAGGCCGTGC CGGAGGTCTC AGGCGCAACC GTGGCGGACT TGAAACCCGG CGCCCGCCGG GGAATTGAGG GCGTGATGGT AACAGACGTG GAAGTCAATA GTCCCGCCTG GCTGCGGGGC ATCCGGCCGG GGGACCTCAT CATTGGCGTC AATCGCCGCC AGGTCCGTTC GGTGCAGGAG TTGCTTGCTG CTCTCAAGGC CAGCAGCCAG GGACGGCTCA CCCTAAGCCT GATACGGGGC GACTTCAGGG TGACTATCAG GATCAGGTAA
|
Protein sequence | MRKPSGSVSQ PAPLSVSPLS VSLWLVFIGL LVIALPGYPA LPFSDTKNGV PTLAPVLQDV TPAVVNISVV TRSAIEENPL FRDPFFRRFF NLPEQEARPE RSVGSGVIMD AAKGYVVTNY HVIKDAQQVV VTLKDRRQFQ AKLVGTDPGT DIALLKIDAK NLKALRLGDS DLLNVGDFVI AIGNPFGLGQ TVTSGIVSAL GRSGLDIEGY EDFIQTDASI NPGNSGGALI NLKGELIGIN TAIIGPAGGN VGIGFAVPSV MVKAVLDQIL RFGEVRRGRL GASSEDITHD LAASLGLPST EGAIISAVEP GSPAEKAGVK PRDVITTVNG KPVRNSIDLR NKVGLMPIGE TLDLGLIRNG KRLTAKVKIA KLAEASSAGA EAVPEVSGAT VADLKPGARR GIEGVMVTDV EVNSPAWLRG IRPGDLIIGV NRRQVRSVQE LLAALKASSQ GRLTLSLIRG DFRVTIRIR
|
| |