Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4209 |
Symbol | |
ID | 9248083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5026408 |
End bp | 5027577 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003682107 |
Protein GI | 297563133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGGGG GCAACAGCCG CAACAGGATA CACACACGCC GCGCCCGTGG CGGGGCCAGC GCGGTCTTCG CGGCCGCCAC CGCCGCCGTC CTCGGTCTGA CCTCGGCCCC GGCCGCGGCG GACCTGATCC CCGACTACCG GCCCGAGCAG TGGGGACTCC AGGCCGTGGG GGCGCCGCAG CTGTGGGAGG AGAACCAGGG TCAGGGCGCC ACGGTCGCGC TTCCGGGCGT CTCCGTCGAC GAGGGGCACC CGGACCTGGT CGACAACGTC CAGCTGGACA CGCGGTTCGG TGAGAACGAC GGCGACATCG AGCAGGGCAA CGCCGCGGCC GGACTGGTCG CGGCCCACGG GTACGGCAGG GACGCCGACG GCGGCGTCCT GGGCGTGGCG CCCGAGGCCA CCGTGCTCGT GCTGCCCACG GGGGACCAGC TCGCCGAGGC CGTGCGGTTC GCCTCCCAGG AGGGCGCCCA GGTCATCCTG CTGCCCGAGC CCGCCGGGCC CGGCCTCGCC GAGGCCACCC AGGAGGCCTC CTCCAACGGC GCGCTGGTCG TCGGCCCGGC GGGGGAGGAC GAGGACCCCA ACGTGCTCAC CGTCGCGGGG ACCGACCAGG ACGGAGCCCT CATCCAGGGC GCGCCCGGGG CCGGGATGAT CGCGCTGACC GCCCCCGGGG CCGACCTGGT CACGGCGGGA CCGGAACCGG GCCAGGCCGA GGTGACCGGG GCTCCCTACG CCGCCGCGAT GGTCGCCGGG GCCGCCGCCC TGATGCGCGC GGAGCACCCG CAGCTGCGGC CCGACCAGAT CCGCGACGCC CTGGTGGACG GCTCCCAGCC CGGCCCCGAC GGCCTGCCCG CGCTGCACCT GCCCAGCGCG GAGCAGCAGG CGTCCGGCGT CGCCCAGGAC ATCCCGCTCA TCGACGAGGA CCTGGCCGGG CGGGACGACC AGTCGGGGCT GGTGCCCGCG TGGGCGTGGT TCGTCACCGT CGGCGCCGTG GTGGTCCTGG GAGTGCTCAT CCTGGTCGTG TGGGTGCGCC GCTCCACCGC CGACCCCTAC GGCGTGAAGG CCGAGCGCCG CGAGCAGGAC GAGGAGATCG CCGCCGAGCG CGCCGCCGAG GCCGCGCCCG CCAACCGCCG CCGCAAGGGC GGACGCCGCC GCAAGACGCG CGGTAACTGA
|
Protein sequence | MLGGNSRNRI HTRRARGGAS AVFAAATAAV LGLTSAPAAA DLIPDYRPEQ WGLQAVGAPQ LWEENQGQGA TVALPGVSVD EGHPDLVDNV QLDTRFGEND GDIEQGNAAA GLVAAHGYGR DADGGVLGVA PEATVLVLPT GDQLAEAVRF ASQEGAQVIL LPEPAGPGLA EATQEASSNG ALVVGPAGED EDPNVLTVAG TDQDGALIQG APGAGMIALT APGADLVTAG PEPGQAEVTG APYAAAMVAG AAALMRAEHP QLRPDQIRDA LVDGSQPGPD GLPALHLPSA EQQASGVAQD IPLIDEDLAG RDDQSGLVPA WAWFVTVGAV VVLGVLILVV WVRRSTADPY GVKAERREQD EEIAAERAAE AAPANRRRKG GRRRKTRGN
|
| |