Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0404 |
Symbol | |
ID | 9244242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 495162 |
End bp | 496490 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | peptidase M16 domain protein |
Protein accession | YP_003678358 |
Protein GI | 297559384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCA TCGGAGCGGC GGGACGTGTC GTTCAGTACA CGCTCGACAA CGGATTGCGT CTGGTGACCG CTGCGGCCTC CACGGGCCAG GTCGCCTCCG TGAACCTGTG GTACGGCGTG GGGTCCCGGC ACGAGGTCCC CGGGCGCACG GGGTTCGCGC ACCTCTTCGA GCACCTGATG TTCCAGGGCA GCGGCGGCGT CGCCAAGGGC GAGCACTTCG AGGAGGTCGA GAGGCTCGGC GGCGACATCA ACGCCTCCAC CTCGACCGAC CGCACCAACT ACTACGAGAC GGTCCCCGAG CACGCCCTGG ACCGGATCCT GTGGCTGGAG GCCGACCGGC TGGCGACCCT GCGCGAGGGC ATGACCCAGG AGGTCCTGGA CAACCAGCGC GACGTCGTCA AGAACGAGCG CCGCCAGCGC TACGACAACC AGCCCTACGG CACCGCCCTG GAGCGCATCC TGCGGCTGGC CTACCCCGAG GGCCACCCCT ACCACCACCC GACCATCGGC TCCATGGCCG ACCTGGACGC CGCCGACCTC GACTACGTCA AGTCGTTCCA CAGGGCCCAC TACGGTCCGG ACAACTGCGT GCTCACCGTG GTCAGCGACC TCGACCCCGA GGACGTCCGG GGCCGCGTGG AGAAGTTCTT CGGTCCCATC CCGGCCCGCG AGAGCGTCCC CGAGGCGCCC GACGCCGCCC TGGAGGCCCC GCTGGGCGGC CCGGTCCGCG ACGCGGTCAC CGAGACCGTG CCCGCCGCCG GGGTGTTCCT CGGGTTCCGC GTCGCCCCCT ACGGCGAGCG CGGGTTCGAC GTCATGCACC TGGCCTCGGC CGTCCTGGGC CAGGGACAGG GCAGCCGCCT GTACCGCTCC CTGGTGGTGG ACCGCCCCAT CGCCGCCGAC GACGGCGGCG GCGCCGCCGA CATCCTGCCG TTCCGCTACA CCGACAGCCT GATGCTGGTG AACATGCTCG CCCGCGAGGG CGTCAGCGGC GACGTTCTGG AGGAGGCCAT GCGCGAGGAG ATCGCCAAGC TGGCCGCCGG GATCACCGAG GAGGAGCTGG ACCGGGCCCG CGCCGTCCTG GAGCGCGACC ACCTCCAGTC CATCTCCAGC CCCTCCGGGC TCGCCGACAG CATCAGCAGC TGCACCCAGC TGTTCGACGA CCCCGAGCTG GCCTACACCT GGCCCCGGCG CTGGGACGAC ATCACCGCCG AGGAGGTGCG GGCCGCCGCC GAGCGCGTGC TGGTCGACGA CAACCTGCTC GTCGTCCGCT TCGACCCCGA GGAGTCCGGG GCCGCCGAGG CGGCCGGGGC CGACACCGTC GACGCCTGA
|
Protein sequence | MTTIGAAGRV VQYTLDNGLR LVTAAASTGQ VASVNLWYGV GSRHEVPGRT GFAHLFEHLM FQGSGGVAKG EHFEEVERLG GDINASTSTD RTNYYETVPE HALDRILWLE ADRLATLREG MTQEVLDNQR DVVKNERRQR YDNQPYGTAL ERILRLAYPE GHPYHHPTIG SMADLDAADL DYVKSFHRAH YGPDNCVLTV VSDLDPEDVR GRVEKFFGPI PARESVPEAP DAALEAPLGG PVRDAVTETV PAAGVFLGFR VAPYGERGFD VMHLASAVLG QGQGSRLYRS LVVDRPIAAD DGGGAADILP FRYTDSLMLV NMLAREGVSG DVLEEAMREE IAKLAAGITE EELDRARAVL ERDHLQSISS PSGLADSISS CTQLFDDPEL AYTWPRRWDD ITAEEVRAAA ERVLVDDNLL VVRFDPEESG AAEAAGADTV DA
|
| |