Gene Ndas_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0578 
Symbol 
ID9244420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp719080 
End bp720660 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content73% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003678531 
Protein GI297559557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.385541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGACA CTTCCCCCTC CCTCGTCCCC CGTCCCCACA CCTCGGCTCC GGGAGACGCC 
GGGGGCCTCA CGCTGACCGC CTCGACCAGG GTGTCCGCCG ACCCCGACGC CCGGGGCACG
CTCGCGTGGC TCCAGCGTGA ACTCGGGGCG GCCACCGGCC TCCCCCTGGC CACCGGCGAC
GAGGCCAGCG CCCAGATCCG GCTGAGCGTG GACCCCGAGG CGGGTCTGGG CCGCGAGGGC
TACCGGCTGA TCGTGGACGC CGAGGGAGCC ATCATCGTGG GCAACGACCC CGCCGGGGTG
TTCTACGGTG CGCAGACCCT GCGCCAGCTG CTGCCCGCCG ACGTCTACCG GGACGCCCCG
CTGGGCGGTG CCGAGTGGGC CCTGCCCGCC GTGAGCGTCA CCGACGCGCC CCGTTTTCGC
TGGCGGGGCG TGATGCTGGA CGTGGCCCGT CACTTCGTGC CCAAGCGCGA GGTGCTGCGG
TTCATCGACC TGCTGGCCAT GCACAAGCTC AACGTCCTGC ACCTGCACCT GACCGACGAC
CAGGGGTGGC GCGTGGAGAT CCGCCGCTAC CCGAAGCTGA CCGAGGTGGG CTCCTGGCGG
ACCCGGAGCC AGGTGGGCGC CGCGAAGCCG CCGGTGTTCG ACGAGCGCCC GCACGGCGGC
TTCTTCACGC AGGACGACAT CCGCGAGATC GTCGCCTACG CCGACGCCCG GCACGTGGCC
GTCGTCCCCG AGATCGACGT GCCCGGCCAC TCGCAGGCCG CCATCCACGC CTACCCCGAG
CTGGGCGAGT GCGGACGGAT CCCGGTCGGC GACCAGTGGG GGATCTTCGA GGAGGTGCTC
GCGGTCACCG ACAACGTCCT GGAGTTCTAC CGCAACGTCC TGGACGAGCT GATCGAACTG
TTCCCGAGCA CGTACGTGCA CGTGGGCGGC GACGAGTGCC CCAAGACCCA GTGGCGGGCG
AGCGCGTCCG CGCAGCGGCG GATCAAGGAG GAGGGGCTGG CCGACGAGGA CGAGCTGCAG
AGCTGGTTCA TCCGCCAGCT GGACGAGCAC CTGACCTCGC GCGGCCGCCG CCTGGTCGGC
TGGGACGAGA TCCTGGAGGG AGGGCTCGCG CCGGGGGCGA CCGTGATGTC GTGGCGCGGC
GAGGAGGGGG GTGTCGCGGC CGCGCGGGCC GGTCACGACG TGGTCATGAG CCCCACCCGC
ACCTCCTACC TGGACTACCG GCAGTCGGAG TCCGGGGACG AGCCCGTCCC GGTGGGCACG
CTGCTGCGGA CCGAGGACGT GTACCTGGCC GAGCCGGTCC CCCCGGGGCT GACCGAGCAG
GAGGCCCGGC ACGTGCTGGG CGCGCAGGTG AACGTGTGGA CCGAGCACAT CGACTCGCCG
CGCAGGCTGG ACTACATGGT CTTTCCCAGG CTGTCGGCCT TCGCCGAGCA GGTGTGGTCG
TCCGGTGAGC GCGACTACGC CGAGTTCGAG CCCCGGCTGA GGCGGCACCT GGAGCGGCTC
GACGCGGCCG GGGTGGAGTA CCGGCCGCTG GAGGGGCCGC GCCCGTGGCA CACCCGCCCG
GGGGTGGTGG GCTGGGGGTG A
 
Protein sequence
MPDTSPSLVP RPHTSAPGDA GGLTLTASTR VSADPDARGT LAWLQRELGA ATGLPLATGD 
EASAQIRLSV DPEAGLGREG YRLIVDAEGA IIVGNDPAGV FYGAQTLRQL LPADVYRDAP
LGGAEWALPA VSVTDAPRFR WRGVMLDVAR HFVPKREVLR FIDLLAMHKL NVLHLHLTDD
QGWRVEIRRY PKLTEVGSWR TRSQVGAAKP PVFDERPHGG FFTQDDIREI VAYADARHVA
VVPEIDVPGH SQAAIHAYPE LGECGRIPVG DQWGIFEEVL AVTDNVLEFY RNVLDELIEL
FPSTYVHVGG DECPKTQWRA SASAQRRIKE EGLADEDELQ SWFIRQLDEH LTSRGRRLVG
WDEILEGGLA PGATVMSWRG EEGGVAAARA GHDVVMSPTR TSYLDYRQSE SGDEPVPVGT
LLRTEDVYLA EPVPPGLTEQ EARHVLGAQV NVWTEHIDSP RRLDYMVFPR LSAFAEQVWS
SGERDYAEFE PRLRRHLERL DAAGVEYRPL EGPRPWHTRP GVVGWG