Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3888 |
Symbol | |
ID | 9247759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4659989 |
End bp | 4661509 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003681791 |
Protein GI | 297562817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTTG ACCCCGGACC GGGCGAGCAG CCCGGCGCCC CCCGATCCGA CCAGGCCTCC GCGCACCCCG TGGGCGGAGG GTTCCCCGGT TCCGCCAACC CCTCCGGTCC CGCCGTCTAC GGCGCTTCCG CCGCCAACCC CGGCTCCGGT GATCCCTCGG GCTCCACCGC CCACACCAGC TTCGCCAACC CCTCCGGTCC CGGCCACCCG GGTATCGGCG GTCCCTCCGG TCCTGCCAAT CCCATGAACC CCGGCGCTCC TTCGGCTCCC GGGCAGCACA CCGGCCACAC CGGTCAGTCT GCCGCCCACG GCGGCTACGC GACGGCCTTC CAGGCGGGGC ACGCCGCCCC CGGCCCGGGC GCTCCGCCCC CCGGTGTCCC CGGACCCCAC ACGACCGGCC CCCACAGCGC CCCGCAGCCC CCCGCGCCCA AGCGGCCCCG GCGCGGCGTC CCGGTGTGGA TGGCGCTGTC GGGCATGCTC GTCGTCGCGC TCATCGCCGG AGGCGCCGGC GGTGTCGCGG GCAACCTCCT CGACGGCTCC TCCACTGACG AGGCCGCGCA GGAGGAGGGT CCGGTCATGA ACGAGCCGCC GCCCGAGGCC CCCCGGCGCG ACCCCGACAC CATCGCCGGT GTGGCCCAGC GGGTCAGCCC GAGCGTGGTG TTCATCCACA GCGCCGATCC CACCATCCCG AGCAGCGGCT CCGGGTTCGT CATCGACGGG AACTACGTGG TGACCAACGA CCACGTCTCC GCCGGTCTGG AGGCGGACGG CATCGTCGTG GAGTACAGCG ACGGCAGCCT CTCCAGCGCC TCCGTGGTCG GCTCCGACCC CAGCTCCGAC CTCGCGGTGC TGTCGCTGGA CGACCCGATC GACGTCGAGC CGCTCCAGTT CGGCGACTCC GAGCAGGTCA TCGTCGGTGA CGAGGTGATC GCCATCGGCG CGCCCCTGGG CCTGTCCGGA ACCGTCACGC AGGGCATCAT CAGCGCCGTC AACCGCCCGG TCAGCTCCGG CGAGGGCGAG AACGCCAGCC GCTTCTACGC GCTCCAGACC GACGCCGCCA TCAACCCGGG CAACTCCGGC GGCCCGCTGG TGGACCTGGA GGGCCGGGTC ATCGGGGTCA ACTCGATGAT CGTCACCATG AGCTCCATGG GGGAGCCCAC GGGCAACATC GGCCTGGGCT TCGCGATCCC GTCGGTGGAG GCCGAACGCG TCGTGAACCG CCTCGTCGAG TACGGCGAGA CCAGCTACGC CGACATCGGG GCCGAGATCG ACCTGGACAG TCCGGTCGCG GGCGCGGTCA TCGCCGACGG CGGGGGCGCG GTGGAGAGCG GCGGCCCGGC CGACGAGGCG GGGCTGGAGC CGGGCGACGT CATCCTCTCC CTGGACGGGC GCCCGGTGAA CTCGGGCCAG GAGCTGCTCG CCATGCTGCG CAGCCGCAGC CCGGGCGAGG AGGTCGAGGT CGAGTTCGAC CGCGACGGCC GACGCGACAC CGTCACGGTC ACGCTGGGCT CGTCGGACTG A
|
Protein sequence | MNLDPGPGEQ PGAPRSDQAS AHPVGGGFPG SANPSGPAVY GASAANPGSG DPSGSTAHTS FANPSGPGHP GIGGPSGPAN PMNPGAPSAP GQHTGHTGQS AAHGGYATAF QAGHAAPGPG APPPGVPGPH TTGPHSAPQP PAPKRPRRGV PVWMALSGML VVALIAGGAG GVAGNLLDGS STDEAAQEEG PVMNEPPPEA PRRDPDTIAG VAQRVSPSVV FIHSADPTIP SSGSGFVIDG NYVVTNDHVS AGLEADGIVV EYSDGSLSSA SVVGSDPSSD LAVLSLDDPI DVEPLQFGDS EQVIVGDEVI AIGAPLGLSG TVTQGIISAV NRPVSSGEGE NASRFYALQT DAAINPGNSG GPLVDLEGRV IGVNSMIVTM SSMGEPTGNI GLGFAIPSVE AERVVNRLVE YGETSYADIG AEIDLDSPVA GAVIADGGGA VESGGPADEA GLEPGDVILS LDGRPVNSGQ ELLAMLRSRS PGEEVEVEFD RDGRRDTVTV TLGSSD
|
| |