Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0578 |
Symbol | |
ID | 9244420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 719080 |
End bp | 720660 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003678531 |
Protein GI | 297559557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.385541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGACA CTTCCCCCTC CCTCGTCCCC CGTCCCCACA CCTCGGCTCC GGGAGACGCC GGGGGCCTCA CGCTGACCGC CTCGACCAGG GTGTCCGCCG ACCCCGACGC CCGGGGCACG CTCGCGTGGC TCCAGCGTGA ACTCGGGGCG GCCACCGGCC TCCCCCTGGC CACCGGCGAC GAGGCCAGCG CCCAGATCCG GCTGAGCGTG GACCCCGAGG CGGGTCTGGG CCGCGAGGGC TACCGGCTGA TCGTGGACGC CGAGGGAGCC ATCATCGTGG GCAACGACCC CGCCGGGGTG TTCTACGGTG CGCAGACCCT GCGCCAGCTG CTGCCCGCCG ACGTCTACCG GGACGCCCCG CTGGGCGGTG CCGAGTGGGC CCTGCCCGCC GTGAGCGTCA CCGACGCGCC CCGTTTTCGC TGGCGGGGCG TGATGCTGGA CGTGGCCCGT CACTTCGTGC CCAAGCGCGA GGTGCTGCGG TTCATCGACC TGCTGGCCAT GCACAAGCTC AACGTCCTGC ACCTGCACCT GACCGACGAC CAGGGGTGGC GCGTGGAGAT CCGCCGCTAC CCGAAGCTGA CCGAGGTGGG CTCCTGGCGG ACCCGGAGCC AGGTGGGCGC CGCGAAGCCG CCGGTGTTCG ACGAGCGCCC GCACGGCGGC TTCTTCACGC AGGACGACAT CCGCGAGATC GTCGCCTACG CCGACGCCCG GCACGTGGCC GTCGTCCCCG AGATCGACGT GCCCGGCCAC TCGCAGGCCG CCATCCACGC CTACCCCGAG CTGGGCGAGT GCGGACGGAT CCCGGTCGGC GACCAGTGGG GGATCTTCGA GGAGGTGCTC GCGGTCACCG ACAACGTCCT GGAGTTCTAC CGCAACGTCC TGGACGAGCT GATCGAACTG TTCCCGAGCA CGTACGTGCA CGTGGGCGGC GACGAGTGCC CCAAGACCCA GTGGCGGGCG AGCGCGTCCG CGCAGCGGCG GATCAAGGAG GAGGGGCTGG CCGACGAGGA CGAGCTGCAG AGCTGGTTCA TCCGCCAGCT GGACGAGCAC CTGACCTCGC GCGGCCGCCG CCTGGTCGGC TGGGACGAGA TCCTGGAGGG AGGGCTCGCG CCGGGGGCGA CCGTGATGTC GTGGCGCGGC GAGGAGGGGG GTGTCGCGGC CGCGCGGGCC GGTCACGACG TGGTCATGAG CCCCACCCGC ACCTCCTACC TGGACTACCG GCAGTCGGAG TCCGGGGACG AGCCCGTCCC GGTGGGCACG CTGCTGCGGA CCGAGGACGT GTACCTGGCC GAGCCGGTCC CCCCGGGGCT GACCGAGCAG GAGGCCCGGC ACGTGCTGGG CGCGCAGGTG AACGTGTGGA CCGAGCACAT CGACTCGCCG CGCAGGCTGG ACTACATGGT CTTTCCCAGG CTGTCGGCCT TCGCCGAGCA GGTGTGGTCG TCCGGTGAGC GCGACTACGC CGAGTTCGAG CCCCGGCTGA GGCGGCACCT GGAGCGGCTC GACGCGGCCG GGGTGGAGTA CCGGCCGCTG GAGGGGCCGC GCCCGTGGCA CACCCGCCCG GGGGTGGTGG GCTGGGGGTG A
|
Protein sequence | MPDTSPSLVP RPHTSAPGDA GGLTLTASTR VSADPDARGT LAWLQRELGA ATGLPLATGD EASAQIRLSV DPEAGLGREG YRLIVDAEGA IIVGNDPAGV FYGAQTLRQL LPADVYRDAP LGGAEWALPA VSVTDAPRFR WRGVMLDVAR HFVPKREVLR FIDLLAMHKL NVLHLHLTDD QGWRVEIRRY PKLTEVGSWR TRSQVGAAKP PVFDERPHGG FFTQDDIREI VAYADARHVA VVPEIDVPGH SQAAIHAYPE LGECGRIPVG DQWGIFEEVL AVTDNVLEFY RNVLDELIEL FPSTYVHVGG DECPKTQWRA SASAQRRIKE EGLADEDELQ SWFIRQLDEH LTSRGRRLVG WDEILEGGLA PGATVMSWRG EEGGVAAARA GHDVVMSPTR TSYLDYRQSE SGDEPVPVGT LLRTEDVYLA EPVPPGLTEQ EARHVLGAQV NVWTEHIDSP RRLDYMVFPR LSAFAEQVWS SGERDYAEFE PRLRRHLERL DAAGVEYRPL EGPRPWHTRP GVVGWG
|
| |