Gene Ndas_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3535 
Symbol 
ID9247404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4245466 
End bp4246476 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003681442 
Protein GI297562468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.582496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.719008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCA CCAGCAGGGA GGTCCACCTG GTGGCCCGTC CCGTCGGCGA GCCCGAGCCC 
ACCGACTTCT CCCTCGTGGA GACCACCGTC GCCGACCCCG GTCCCGGGCA GGTCCTGGTG
CGCAACGACT GGATGTCCGT GGACCCGTAC ATGCGCGGCC GCATGAACGA CGCCAAGTCC
TACGTCCCCC CGTTCCGGCT CGGCGAGCCG ATGGACGGCG GCGCCGTGGG CGTGGTCACC
GCCTCCGGCA GCGACGACGT CCCCGTGGGC ACCACCGTCC TGCACTCGGC CGGATGGCGC
GAGTACGCGC TGCTGCCCGC GGATTCCGTG CGCGCGGTGG ACGCCTCCCT GGCACCCGCC
GAGGCCTACC TCGGCGTGCT GGGCATGATC GGCCTCACCG CCTACGCGGG CCTGACCGAG
ATCGCCCCGG TGCGCGAGGG CGACGTGGTG TTCGTCTCCG GCGCCGCGGG CGCGGTCGGC
TCCGCCGCCG GCCAGATCGC CCGCCAGCTG GGCGCGTCCC GGGTGGTCGG GTCCGCGGGC
GGCCCGGAGA AGAAGCGCCG CCTCCTGGAG GACTTCGGCT TCGACGCCGC CATCGACTAC
CGCGAGGGCC GCCTGGAGGA GCAGCTCGCC GAGGCCGCGC CCGAGGGGAT CGACGTCTAC
TTCGACAACG TCGGCGGCGA CCACCTGAGG GCCGCCATCG CCGCGATGCG CAACCACGGC
CGGATCGCCC TGTGCGGCGC GATCTCCCAG TACAACGCCA CCAAGCCCGA GCCCGGCCCC
GACAACCTCT TCCTGGCCGT CGGCAAGCGC CTCACCCTGC GCGGGTTCAT CGCCGGAGAC
CACGGCCACC TGATGAAGGA GTACGCCGAG CGCGCCTCCG GGTGGATCGT CGACGGCAGG
CTGCGCAGCG AGCAGACCGT CGTCGACGGC ATCGACAACG CCGTGCGGGC CTTCCTCGGC
ATGATGCGGG GCGCCAACAC GGGCAAGATG CTGGTCCACC TCACACCCTG A
 
Protein sequence
MSVTSREVHL VARPVGEPEP TDFSLVETTV ADPGPGQVLV RNDWMSVDPY MRGRMNDAKS 
YVPPFRLGEP MDGGAVGVVT ASGSDDVPVG TTVLHSAGWR EYALLPADSV RAVDASLAPA
EAYLGVLGMI GLTAYAGLTE IAPVREGDVV FVSGAAGAVG SAAGQIARQL GASRVVGSAG
GPEKKRRLLE DFGFDAAIDY REGRLEEQLA EAAPEGIDVY FDNVGGDHLR AAIAAMRNHG
RIALCGAISQ YNATKPEPGP DNLFLAVGKR LTLRGFIAGD HGHLMKEYAE RASGWIVDGR
LRSEQTVVDG IDNAVRAFLG MMRGANTGKM LVHLTP