Gene Ndas_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0801 
Symbol 
ID9244646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp986049 
End bp987662 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content73% 
IMG OID 
Productpolysaccharide deacetylase 
Protein accessionYP_003678751 
Protein GI297559777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.613804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAGC AGTCCCGCCT CGCCATGGCA CCGGACCACC CCCTGATCAC GGTCGTGGTA 
GCCCTGGCGG TCCTGGCGGG CATGGTGTTG TTGGCCACCC GCTCCTCCGG GGAGCCCGAG
CCCGGGCCGC GGACGCATGG AGCGCCGGTC TCCGAGAGCG AGCCGTCGCC CCCGGCCTCC
GGTGAGCCCG ACGGGCTCAC CGAGGTGGAC GCGGCCGCGG TGGCGGGCCT GGAGGAGGCC
GAACTGCCCT ACGACGGGGA GATCTCCGTC GCGGTGACCT ATCCGGTGAT CCCCAACGCC
GAGCCGCTGG CCGACTTCCT GGAGCGTGAG CTGACCGACG AGGTGCACGC CTTCGAGGCC
GCCAACCCCG GCGCGGTCTC CTTCGAGGCG GGGTGGAACC TGACCGCGGC GCGCGACGGC
CTGCTGGGGG TCCGCGTGAC CCGGGTGGAG ACCGACTCCG AGGGGTCCCG GGAGGGGTAC
ACCACCTACT GGTACGACAC CGAGACCGCG GCGCACCACC CCTCCGCCGC ACTGGTCGGC
GGCCAGGAGC AGCTGGAGGA GCTCAACGGC CTGGTGCGCG GCGCCGTCGG AAGCGGGGAG
GGCGGGGAGG GCCCGGCCGA CCCCGGCGTC GTCCACCCGA TCAGCTCCCT GTACGACTCG
GTGGGCTTCA ACCCCGACGG CGACCTGGTC GTGGAGTTCG ACGCCGGGCA GGTCGCCCCG
GCGGAGGAGG GCCGCACGCG CGCGGTGGTC GCCGCCGGTG AGGCCGAGGG CCTGCTGTCC
GAGCTGGGCC TGCGCGTGCG CGACGCCGCG ACCGTGGGCG TGGAGGACTT CTCCATCGCC
GCCCCGCCCC AGGCCGACAA GGACGGCCAG GAGGAGGGCG CCGTGCCCGG GCAGGTGCCC
GCGGTGGACC CCGAGGTCGA CTGCTCCGAC CCCGAAACCA AGTGCGTGGC GCTGACCTAC
GACGACGGGC CCGGCGGGCG CACCCCCGAA CTGCTGGACG CCCTCGCCGA GTACGACGCG
CGGGCCACGT TCTTCGTCAC CGGCAACCCC GTCATGGAGC ACCCGCACAC GGTGCGGCGC
GCCTACGCGG AGGGGCACGA GATCGCCAAC CACACCCTGA ACCACCCCGA CCTGGCCGGT
CTGGGCGCGG GAGGGGTGCG CGCGGACCTG GACGTCGTGC AGGCGCTGGT GTACCGCGAG
ACCGGCTACA CCATGAACCT CATGCGCCCG CCCTACGGCT CGACCGACGA GGGCGTGGCC
TCGGTGACCG CGGACATGGG CCTGGCCCAG ATCCTGTGGA GCGTGGACAC CCTCGACTGG
AAGGACCGCA AGGCCTCGGT GATCCACGAC CGGGTGCTGG AGGGCGCTTC GGACGGGGCG
ATCATCCTCA TGCACGACAT CCACGGCACC ACCGTCGACG CCTCCCGCAC GGCCATCCGG
GAACTGGACG AGCAGGGGTA CACCATGGTC ACGGTGTCCC AGCTGCTGGG GACCACGACC
CCCGGCCAGA GCTACATGGA CGGGGTCCCC GACGCCCCGG AGGAGGACGC CGACCCCTCC
GAGGAGGCCG GTGAGCCCGC TGAGGGGGCC GAGGAGGCTT CCGAGGAGGC CTGA
 
Protein sequence
MSKQSRLAMA PDHPLITVVV ALAVLAGMVL LATRSSGEPE PGPRTHGAPV SESEPSPPAS 
GEPDGLTEVD AAAVAGLEEA ELPYDGEISV AVTYPVIPNA EPLADFLERE LTDEVHAFEA
ANPGAVSFEA GWNLTAARDG LLGVRVTRVE TDSEGSREGY TTYWYDTETA AHHPSAALVG
GQEQLEELNG LVRGAVGSGE GGEGPADPGV VHPISSLYDS VGFNPDGDLV VEFDAGQVAP
AEEGRTRAVV AAGEAEGLLS ELGLRVRDAA TVGVEDFSIA APPQADKDGQ EEGAVPGQVP
AVDPEVDCSD PETKCVALTY DDGPGGRTPE LLDALAEYDA RATFFVTGNP VMEHPHTVRR
AYAEGHEIAN HTLNHPDLAG LGAGGVRADL DVVQALVYRE TGYTMNLMRP PYGSTDEGVA
SVTADMGLAQ ILWSVDTLDW KDRKASVIHD RVLEGASDGA IILMHDIHGT TVDASRTAIR
ELDEQGYTMV TVSQLLGTTT PGQSYMDGVP DAPEEDADPS EEAGEPAEGA EEASEEA