Gene Ndas_5005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5005 
Symbol 
ID9248894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp147760 
End bp148950 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID 
ProductHistone deacetylase 
Protein accessionYP_003682892 
Protein GI297563919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.983537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGGC GCACGTCCTG CTCGCTTCGG GTGGCATGGG ACGACGGACT CACGGCCTAC 
GACTTCGGTC CGCAGCACCC GATGGCGCCG ATACGCGTCG AGCTGACCAT GGCGCTCAGC
CGCGAACTCG GCGTGCTGGA CGCGCCGGGG GTCGGACTCC TGGACGTCGA ACCCGCCTCG
GACGAACTCC TGTCCCTCGT GCACGACCCC GCCTACATCG AGGCGGTCAA GCGCGCGGGC
CGCACGCTGG AGCCCGACGA CGCCCACATG CTGGGCACCT CCGACAACCC CGTCTTCAAG
GACATGCACG ACGCCGCCGC GCTCATCTCC GGCGCGTCCG TGGCCGCCGC GCGGGCGGTC
TGGAGCGGGG AGACCGCGCA CGCGGCCAAC ATCGCGGGCG GCCTGCACCA CGCCATGCGC
GGCAACGCCT GGGGCTTCTG CGTCTACAAC GACCCCGCCC TGGCCATCGC CTGGCTGCTG
GAGCAGGGGG CCAAGCGCGT CGCCTACGTG GACGTGGACG TCCACCACGG CGACGGCGTC
CAGAACGCCT TCTACAACGA CCCGCGCGTG CTCACCATCA GCCTCCACGA GTCCCCGGCG
ACCCTGTTCC CCGGCACCGG CCAGGCCTCC GAGACCGGCG GCCCGGACGC CGAGGGGTAC
GCGGTCAACG TCGCCCTGCC CGCGGGCACC GGCGACAACG GCTGGCACCG CGCCTTCGAC
GCCGTCGTGC CGCCGCTGCT GCACGAGTTC CAGCCCGAGA TCCTGGTGAC CCAGCAGGGC
TGCGACACCC ACGCCCTGGA CCCGCTCGCC AACCTCACCC TGAGCGTGGA CGGCCAGCGC
CGGGCCTACG CCGAGCTGCA CGAGCTGGCC CGCAAGACGG CGGGCGGCCG CTGGCTGCTG
TTCGGCGGCG GCGGGTACGG GCTGGTCCAC GTCGTCCCCC GCGCCTGGAC CCACCTGCTG
GGCGAGGCCG CGGGCCGTCC CATCGACCCC GACACCGAGA CCCCGCAGGG CTGGCGCGAC
TTCGTGCGCC AGCGCACGGG GGAGCTGGCG CCGCTGTACA TGACCGACGG GCGCGAGGTC
GTCTTCGACC ACTTCGTGGA CGGCTACGAC CCGGGCGACC CGGTGGACCG GGCCATCCAC
GCGACCCGGA CCGCGGTCTT CCCCAGCCAC GGGATCGACC CGAGCCTGTA G
 
Protein sequence
MGGRTSCSLR VAWDDGLTAY DFGPQHPMAP IRVELTMALS RELGVLDAPG VGLLDVEPAS 
DELLSLVHDP AYIEAVKRAG RTLEPDDAHM LGTSDNPVFK DMHDAAALIS GASVAAARAV
WSGETAHAAN IAGGLHHAMR GNAWGFCVYN DPALAIAWLL EQGAKRVAYV DVDVHHGDGV
QNAFYNDPRV LTISLHESPA TLFPGTGQAS ETGGPDAEGY AVNVALPAGT GDNGWHRAFD
AVVPPLLHEF QPEILVTQQG CDTHALDPLA NLTLSVDGQR RAYAELHELA RKTAGGRWLL
FGGGGYGLVH VVPRAWTHLL GEAAGRPIDP DTETPQGWRD FVRQRTGELA PLYMTDGREV
VFDHFVDGYD PGDPVDRAIH ATRTAVFPSH GIDPSL