Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1788 |
Symbol | |
ID | 4597700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1899776 |
End bp | 1901683 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639776387 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_922987 |
Protein GI | 119716022 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCCT CGCGCACCCG GCTGCTCGCG CTCACCAGCG CCCTCGCGGC CGTCCTCTCC CTCACCGGCG TCGCCTCCGC GGCGGCGCCG GCGGCACCTG CGGCCGCCAC GGTCGGCTCC GACCTCCAGG GCGCGCACCG CGGCCTGGAC CGCCTGCTCG CCGACGGTAC GCCGAACCAC CGGGCGATCG TCACGTTCGC CGCCGTGCCG ACCAGCACCC AGATCGGCGC GCTGCAGGCC CTCGGCCTCG TCGTGCAGCC GATGTCCCAC CTGCCGCTCG CCCTCGTCGA GGGCCCGGTG CCCGCGATGG TGCAGGCGGT CACGGCCGGC ATCGGCCTCG ACGTCTACCC CGACGAGCGG CTCCAGCTCC TCGACACCCC GTCGACCAAC GCGATGTCGT CGAGCCCGGC GGCGGCCCAG GCCCTGCGCA CCCGCGGCTT CACCGGCAAG GGCGTGACCG TCGGCGTCGT CGACTCCGGC TGCGACGCGA CGCACTCCGA CCTCGCCGAC CACGTCGTGC ACAACGTGAG CCTGCTCAGC CCCGAGTACG CCAACGCCGG CACCGACCCG GCGATCGTCG TACCCGTCGA CCAGGGCCCG GTGAGCAACA CCGACCTCGG CAGCGGCCAC GGGACGCACG TCGCGGGCAT CATCGCCGCC GACTCCTCCT CGGCCGAGGA CGGCAGTCGG TACGGCGTCG CCCCGGACGC CGACCTCGCC TGCTTCGCGA TCGGCGCAGT GCTGTTCACG ACCGCGGTCG TCACCGCCTA CGACTACATG CTCGACCAGC CGGACCTGCT CGGCATCGAC GTCGTCAACA ACTCCTGGGG CAACAGCTAC CGCCAGTTCG ACCCCGCCGA CCCGGTCGCC GTCGCCACCA AGGCCGTGGC CGACCGTGGC GTGACCGTGG TGTTCGCCGC CGGCAACTCC GGCAGCGGCG ACGTCCCGAT GAGCCTGAAC CCGTTCTCCC AGTCGCCCTG GGTGATCTCC GTGGCGGCCG GCACCCTGGA CCGGCACCGC GGCGACTTCT CCTCCAACGG CCTGGTCCAC GACAACTCGC AACCCACGGC CATCGGCACC GAGGGACACA CCACCTACAC CGGCGACCGG ATCGGGCTGG TGCACCCGGA CCTCACCGCC CCCGGCGTCG ACATCGGCTC GACCTGCGAC AGCGCCGGCA CCCTGATCGG GCCGTGCGGA CCGGACGAGA ACGCCTCGGC CTCGGGCACC TCGATGGCCT CGCCGCACAT CGCCGGCGCG GCCGCGGTGC TGCTGCAGGC CCAGCCCCGG CTCAGCCCGG AGCAGGTGCG GCTCGCCCTG CAGGCGACGG CGACGCCGGT CCAGGCGACG GGCGGCCCGG CCGCGCTGCC GTTCTGGGAG GTCGGGTACG GCTACGCCAA CCTCGACCGC GCGGTCCAGC TGGTGCGCTC CGACGGCTGG CAGGGGCGGC TGCGTGCCGC CGCCCACCGG GCGGACCGCC GGGTGCTCGC CGCGGACGGC ACCGCGGTCG TCCGATCCGA CTTCTTCGTC CACGAGGCGC CCCCGGCGAC CGCGGGCGGC AGTGACAGCG CGTCGTACGA CGTGCCCGTG TCCGCGCGCA CCCGCGGGCT CGCCGTGAGC CTGGCGTTCC CGTCCGGCGG CAGCGTCGGC GCCAGCCTGT TCAGCTACAC CGTGCAGGTC CTCGACCCCA GCGGGAAGGT GATCGCGACG ACCACTTCGG ACCCGGTCGC GGGCTCGGGC ACCGCGCTGG CGACGGTCCG GCTGCCCCAG GGCGCGGAGG CCGGGACGTA CACGTTCGAG GTCACCGGCG ACTACGCCGC CTCCGACCCG GACACCGTCG ACAGCGACTC GCTGCTGGGC CGGTTCGTCA CCCTGCACGT GGCCCAGCTG CGGAGCAGCC GGCGCTAG
|
Protein sequence | MPSSRTRLLA LTSALAAVLS LTGVASAAAP AAPAAATVGS DLQGAHRGLD RLLADGTPNH RAIVTFAAVP TSTQIGALQA LGLVVQPMSH LPLALVEGPV PAMVQAVTAG IGLDVYPDER LQLLDTPSTN AMSSSPAAAQ ALRTRGFTGK GVTVGVVDSG CDATHSDLAD HVVHNVSLLS PEYANAGTDP AIVVPVDQGP VSNTDLGSGH GTHVAGIIAA DSSSAEDGSR YGVAPDADLA CFAIGAVLFT TAVVTAYDYM LDQPDLLGID VVNNSWGNSY RQFDPADPVA VATKAVADRG VTVVFAAGNS GSGDVPMSLN PFSQSPWVIS VAAGTLDRHR GDFSSNGLVH DNSQPTAIGT EGHTTYTGDR IGLVHPDLTA PGVDIGSTCD SAGTLIGPCG PDENASASGT SMASPHIAGA AAVLLQAQPR LSPEQVRLAL QATATPVQAT GGPAALPFWE VGYGYANLDR AVQLVRSDGW QGRLRAAAHR ADRRVLAADG TAVVRSDFFV HEAPPATAGG SDSASYDVPV SARTRGLAVS LAFPSGGSVG ASLFSYTVQV LDPSGKVIAT TTSDPVAGSG TALATVRLPQ GAEAGTYTFE VTGDYAASDP DTVDSDSLLG RFVTLHVAQL RSSRR
|
| |