Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3977 |
Symbol | |
ID | 9247848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4756751 |
End bp | 4757884 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003681880 |
Protein GI | 297562906 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.745092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGGCC TCGCCGCGGC CGGCGCCCTC TCCGTCCCCC TGGCCCTGTC GGGCGCCGTC TCGGCCGGCG CCGACGAGCT CGCCCCCCTC CACACCGTCG GCGCCTCGGC CACCGGCGAC TACTTCGTCG TCCTGGAGGA CACGCTCAGC GCCGCCTCCG TCACCCCCTC GTCCCTCGGC ATCTCCTCCG ACCAGGTCAA CCACACCTAC GACGAGGTCG TCAACGGCTA CTCGGCCAGC CTGAGCGCGG CGGAGGTGCG CGAACTGCGC GGGCAGTCGG GCGTCGCCTA CGTCGAGGAG GTCGGCGTCG CGCACACCAC CGTCACGTGG GGCCAGGACC GCATCGACCA GGAGGACCTC CCCCTGGACG GCTCCTACGG CACCACTGGC GACGGGGCCG GCGTCTCCGC CTACATCATC GACAGCGGCA TCGACGCCTC GCACCCCGAC TTCGGCGGTC GCGCCTCCTC CGCCTTCGAC GCCTACGGCG GCGACGGCTC CGACGGCAAC GGCCACGGGA CCCACGTCGC GGGCACCATC GGCTCCGCCA CGTACGGCGT CGCCCCCGCC GCCGACCTCT TCGGCGTCAA GGTGCTCGAC GACAACGGCT CGGGCTCCTA CGACGACGTC ATCGCCGGTA TCGACTGGGT GGCCGCCAAC GCCGGATCCA ACGCCGTGGC CAACCTGTCC CTGGGCGGCC CGTCCTCCCC GACCCTGGAC GAGGCCGTCA ACGGCCTGGC CGAGTCCGGC GTGTTCGTCG CGGTCGCGGC GGGCAACGAG GGCCAGGACG CGGGCAACAC CTCTCCGGGC GGCGCCGAGG GCGTGACCAC GGTCGGCGCC TCCGACGCGA CCGACGCCGC GGCCGTCTTC TCCAACCACG GCCCGTCCGT CGACATCTAC GCCCCCGGCG TGGACGTGGA GTCGACCGTC CCCGGCGGCG GAACCGACTC CTATGACGGC ACCTCGATGG CCAGCCCGCA CGTGGCCGGG GCCGCCGCCC TGTACAAGAG CGTGAACGGC GACGACGACC AGGCCACCAT CCAGGACTGG CTCGTCTCCA ACGCCGGCGT GGACAAGCTC AGCGGCGTTC CCGCGGGCAC GGTCAACCTG CTGCTCAACG TCCAGGGCCT CTGA
|
Protein sequence | MLGLAAAGAL SVPLALSGAV SAGADELAPL HTVGASATGD YFVVLEDTLS AASVTPSSLG ISSDQVNHTY DEVVNGYSAS LSAAEVRELR GQSGVAYVEE VGVAHTTVTW GQDRIDQEDL PLDGSYGTTG DGAGVSAYII DSGIDASHPD FGGRASSAFD AYGGDGSDGN GHGTHVAGTI GSATYGVAPA ADLFGVKVLD DNGSGSYDDV IAGIDWVAAN AGSNAVANLS LGGPSSPTLD EAVNGLAESG VFVAVAAGNE GQDAGNTSPG GAEGVTTVGA SDATDAAAVF SNHGPSVDIY APGVDVESTV PGGGTDSYDG TSMASPHVAG AAALYKSVNG DDDQATIQDW LVSNAGVDKL SGVPAGTVNL LLNVQGL
|
| |