Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4041 |
Symbol | |
ID | 9247913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4834450 |
End bp | 4835484 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003681944 |
Protein GI | 297562970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACC TGCCGCTGGT CATGGGGATC GAGACGTCCT GCGACGAGAC CGGCGTGGCC CTGGTGCGCG GCTGCGAGCT GCTCGGCGAC GCCGTGGCCT CCAGCGTGGA CCAGCACGTC CGGTTCGGCG GCGTGGTGCC CGAGGTGGCA AGCCGCGCGC ACCTGGAGGC GATGACCCCG ACCGTGCACC GGGCGCTGGA AAAGGCCGGG GCCAAGCTGT CGGACGTGGA CGCCATCGCC GTCACCGCCG GTCCGGGCCT GGCGGGCGCC CTGCTCGTGG GCGTGTCCGC GGCCAAGGCG TACGCGATGG CCCTGGGCAA GCCGCTCTAC GGCGTGAACC ACCTCGTGGG CCACGTGGCC GTGGACCAGC TGGAGCACGG ACCGCTGCCC AAGCCCTCGA TCGCCCTGCT GGTGTCGGGC GGCCACACCT CGCTGCTGCT GGTCAACGAC CTGGCCACCG AGGTGGTCTC GCTCGGCGAC ACCGTGGACG ACGCCGCCGG TGAGGCCTAC GACAAGGTGG CGCGCCTGCT CGACCTGCCC TACCCGGGCG GCCCGCCCAT CGACAAGGCG GCGCAGCGGG GCGACCCCAA GGCGATCCGC TTCCCGCGCG GCAAGTGGGG CGACGGCACC TACGACTTCT CGTTCTCGGG CCTGAAGACC GCCGTGGCCC GCCACGTGGA GGACACCGAC CGCCGGGGCG AGCCCCTGGT GGTCGCCGAC ATCGCCGCCG CCTTCCAGGA GTCGGTGGTG GACGTGCTCA CCCGCAAGGC CGTGGACGCC TGCGTGGAGC ACGGCGTGAG CACGCTGGTC ATCAGCGGGG GCGTGGCGGC GAACTCGGCG CTGCGCGCGC TGGCCGAGGA GCGCTGCCGG GAGGCCGGCG TCGAACTGCG CGTCCCGCGC CCGCGCCTGT GCACCGACAA CGGCGCGATG ATCGCCGCCT TGGGCGCCGA GGTCGTGGCG GCGGGCCTGC CCGCGTCCAC GCTGGACCTG GCCACGGACA CCTCCCTGCC GGTGAGCTCC CCGCTGGCGG TGTAG
|
Protein sequence | MSDLPLVMGI ETSCDETGVA LVRGCELLGD AVASSVDQHV RFGGVVPEVA SRAHLEAMTP TVHRALEKAG AKLSDVDAIA VTAGPGLAGA LLVGVSAAKA YAMALGKPLY GVNHLVGHVA VDQLEHGPLP KPSIALLVSG GHTSLLLVND LATEVVSLGD TVDDAAGEAY DKVARLLDLP YPGGPPIDKA AQRGDPKAIR FPRGKWGDGT YDFSFSGLKT AVARHVEDTD RRGEPLVVAD IAAAFQESVV DVLTRKAVDA CVEHGVSTLV ISGGVAANSA LRALAEERCR EAGVELRVPR PRLCTDNGAM IAALGAEVVA AGLPASTLDL ATDTSLPVSS PLAV
|
| |