Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0402 |
Symbol | |
ID | 9244240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 492540 |
End bp | 494228 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | signal peptide peptidase SppA, 36K type |
Protein accession | YP_003678356 |
Protein GI | 297559382 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACC TCGCGAAGAT CATGAAGACC CGACAGCGCC CCCAAGGGCC GCTGGTGCTG GAGCTGGACC TGACGGAGGG GGTCGCCGAC CAGGCCCCCG GGGATCCGTT CAGCCAGATC ATGAACCGGC GCAGGCAGCA GTACCTGGAC GTCCTGGAGG GGATCCGCCG CGGCGCGCGC GACCCGCGGG TGGCGGCCCT CCTGGTCCGG GTGGACGCCC GGTCCCTGGG ATTCGCCAAG GTCCAGGAGC TGCGCGACAC CGTCGCCGAC TTCCGCGCGG CGGGCAAACC CGCGGTGGCC TGGGCCGACT CCTTCGGCGA GACCGGCGAG GGCAACCTGC CCTACTACCT GGCCTGCGCG TTCTCGCGCG TGGTCATGGC GCCCACCGGC GTGCTCGGCC TGACCGGCCT GATGATGCGC ACGACCTTCG TCAAGGGCGC CCTGGACAAG CTGGACGTGT CCTACGAGGT GGGCGCGCGC CACGAGTACA AGAACGCGAT GAACAGCGTC ACCGAGACCG GTTACACCGC CGCCCAGCGC GAGGCCAGCG ACCGGATCGT CACCTCGCTG GGCGACCAGA TCGTCGAGGC GGTCTCCCTG GCCCGGGGGC TGCCCCGGGA GGAGGTGCGC GCGCTGGTCT CCAAGGGCCC CTTCCTGGCC CGCGAGGCGG TCGAGCACAA GCTGGTGGAC GGGCTCGCCC ACCGGGACGA GGTGTACGCG CAGCTGTTCG GGGAGCTGAG CGGTGAGCCT CAGCTGCAGT TCGTCACCCG CTACCACCGC AAGCACACCG CGCCCCAGCA GCTGTCCCGC AACACCGGGG GCCACATCGC GCTGATCTCG GCCACCGGAA CGATCAGCCT GGGCCGGACC CGGCGCTCGC CCCTGGGCGG CGGCACCGTC ATGGGCTCGG ACACCGTGGC GGCGGCCTTC CGCGCCGCGC GCAAGGACCC CCAGGTCAAG GCGGTCGTGT TCCGGGTGGA CAGCCGGGGA GGCTCCCCGA CCGCCTCCGA CGCGATCCGC CGCGAGACCG AGCTGACCAG CAAGGCGGGC ATCCCCGTGG TGGCCGTGAT GGGCGACGTC GCCGCCTCCG GCGGCTACTA CGTGACCCTG GGCTCGGACG CGGTCGTCGC TCAGCCGGGC ACCCTGACCG GCTCCATCGG CGTGATCACC GGCAAACCGG TCCTGGGCGC GCTGAAGGAG CAGTACGGCG TGACCAGCGA CTCCGTGCGC ACCGGCGAGC ACGCGGGCAT GTTCGACACC GACCGGCCCT TCACCGAGTC CGAGTGGGAG CGGGTCAACG CGCTCCTGGA CGAGATCTAC GAGGACTTCA CCGGCAAGGT CGCCGCCGCG CGCGGGATGA CCCGCGAGCA GGTGCACGAG GTGGCCCGGG GCCGGGTGTG GACCGGCCGC GACGCCCACG AGCGCGGTCT GGTGGACGAG CTGGGCGGCC TGGAGACCGC CGTCCGGCTG GCCCGTGAGA AGGCCGGCGC GGGGCCGCTC CCGGTGCGGC CCTTCCCCCG CCCGAACCCG CTCGACCGGA TCCGCCAACA CGAGTCCAGC GAGGACGTGG GTGCCTCGGG CCCGCAGACC GTGGTCAGCG CCTGGGGTCC CCTGGAGCAC GTGGCCGTGG CGATGGGGCT GCCGGTCGGC GGCCCGCTGA TGATGCCGGG GCTGTGGGAG ATCCGTTGA
|
Protein sequence | MVDLAKIMKT RQRPQGPLVL ELDLTEGVAD QAPGDPFSQI MNRRRQQYLD VLEGIRRGAR DPRVAALLVR VDARSLGFAK VQELRDTVAD FRAAGKPAVA WADSFGETGE GNLPYYLACA FSRVVMAPTG VLGLTGLMMR TTFVKGALDK LDVSYEVGAR HEYKNAMNSV TETGYTAAQR EASDRIVTSL GDQIVEAVSL ARGLPREEVR ALVSKGPFLA REAVEHKLVD GLAHRDEVYA QLFGELSGEP QLQFVTRYHR KHTAPQQLSR NTGGHIALIS ATGTISLGRT RRSPLGGGTV MGSDTVAAAF RAARKDPQVK AVVFRVDSRG GSPTASDAIR RETELTSKAG IPVVAVMGDV AASGGYYVTL GSDAVVAQPG TLTGSIGVIT GKPVLGALKE QYGVTSDSVR TGEHAGMFDT DRPFTESEWE RVNALLDEIY EDFTGKVAAA RGMTREQVHE VARGRVWTGR DAHERGLVDE LGGLETAVRL AREKAGAGPL PVRPFPRPNP LDRIRQHESS EDVGASGPQT VVSAWGPLEH VAVAMGLPVG GPLMMPGLWE IR
|
| |