Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2158 |
Symbol | |
ID | 8447769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2378419 |
End bp | 2379942 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041281 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003201525 |
Protein GI | 258652369 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0489221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0113989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCACT CCCTGACGGT GCCCGAGAAG GACAATCGGT CCGTGCCGAA CGGATCACCC ACCAACGGAT CGCCCAGCAA CGGCGTGCGC CGCCCGCCCG CCCCGGTCCC GTCGCCCGCT CCCGACCCCG TGCCGGCCGG CGCCTACGCC CGGCCGGGTG GGGTGAACGG TTCCTTCGGT CCCCGGCCCC AGCGCGGGCC GGTCGTCCTC GGTGCGCCGC CGGTGCCCGA ATACCTGGCC GATGCCTTCG GACGCGCACC CGGCCAGAGC GGCATCGGGG CCCCACCACC CACCGCCGAA CCACCGGCCG AGCCGCCGGC CCAGGACCCC TGGCGGGACC CGGCATCCGG CGCCCAGCTG GGCGCCCCGG CCCTGACCCG CGAGTCGGCA CCCCCGCCGC CGGCCCCGCC GGCGCAGCGA TTCACCCTGC GCCAGGCCCT GTTCGAGCGC CGGCTGCGCC CCTCGGCGAT CATCGGCATC CTGGTCAGCG CCCTGGTCAT CGGCGCGCTC GGCGCCGGCA TCGGGGTCTT CGCCGCCGGC CGGCTGCCCG CGCCGACCAC CGATCCCTCC TTCGAACTGG CTCCGGTGTC ACCGGGCATC ACCCGGGAGC CGGGATCGGT GGCCGACATC GCGGCCAAGG TCCTGCCGTC CGTGGTGTCG TTGGAGATCC GCACCGGTGA CGTGGGGGAG ACCGGCTCCG GCGTGGTCAT CGACGGCAGC GGCTACATCC TGACCAACAA CCACGTCGTG TCCAGCGCGG CGACCGACCC GTCGGCGACG CTCACCGTGA TCTTCGACGA CGCCGCGCAG AGCCGGGTGC CCGCGGTCAT CGTCGGTCGC GACCCGCTGA CCGACCTGGC CGTGATCAAG GTCGACGTCA CCGACGCCAC GGTCGCGCAG ATTGGCGACT CCAATGCGCT TGCCGTGGGC GATCCGGTCA TCGCCATCGG CTCGCCGCTC GGCCTGGCCG GCACCGTCAC CACCGGCATC GTCTCGGCCA AGAACCGCCC GGTGCGGCTG CAGGGCGGCG GGTCGGACAC CGACGCGGTG ATCGACGCCA TCCAGACCGA CGCCGCGGTC AACCCGGGCA ACTCCGGTGG CCCGCTGGTC GACGCCTCCG GCGCCGTGGT CGGCATCAAC TCGGCCATCC GCACCCTCGG CGGGGACTCC AGCGGCTCCA TCGGGCTGGG CTTTGCCATC CCGATCGCCA CGGCCAAGGA CGTCGCCGAG CAGATCATCC GCTCGGGCAG CGTGCAGCAC TCGACCATCG GGGTGAACGC CCGGTCGGCC ACCGACGGGA TCACCGACGG GGCCCAGGTG CAGAACGTCC AGGGCGGCGG GCCGGCCGCG GCGGCCGGCA TCGCCGAGGG GGACGTGATC ACCAAGGTGG GGGACCGGCA GGTGGGCAAC GCGGACGAGT TGATCGTCGC GGTCCGGCAG AACCCGGTCG GGGCCACGGT CCCGGTGGTG CTGCTGCGCG ACGGGCGGGC GATGACCGTC TCGGTCACGC TCGGCTCGGA GTAA
|
Protein sequence | MGHSLTVPEK DNRSVPNGSP TNGSPSNGVR RPPAPVPSPA PDPVPAGAYA RPGGVNGSFG PRPQRGPVVL GAPPVPEYLA DAFGRAPGQS GIGAPPPTAE PPAEPPAQDP WRDPASGAQL GAPALTRESA PPPPAPPAQR FTLRQALFER RLRPSAIIGI LVSALVIGAL GAGIGVFAAG RLPAPTTDPS FELAPVSPGI TREPGSVADI AAKVLPSVVS LEIRTGDVGE TGSGVVIDGS GYILTNNHVV SSAATDPSAT LTVIFDDAAQ SRVPAVIVGR DPLTDLAVIK VDVTDATVAQ IGDSNALAVG DPVIAIGSPL GLAGTVTTGI VSAKNRPVRL QGGGSDTDAV IDAIQTDAAV NPGNSGGPLV DASGAVVGIN SAIRTLGGDS SGSIGLGFAI PIATAKDVAE QIIRSGSVQH STIGVNARSA TDGITDGAQV QNVQGGGPAA AAGIAEGDVI TKVGDRQVGN ADELIVAVRQ NPVGATVPVV LLRDGRAMTV SVTLGSE
|
| |