Gene Ndas_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1589 
Symbol 
ID9245439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1944721 
End bp1945905 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content72% 
IMG OID 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003679524 
Protein GI297560550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0618655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAG TGGTGTGGAA CGGGACCAGG AACGTCGACA CCGTGACCGT TCCCGATCCG 
CGGATCGAGG AGCCCGGTGA CGCCCTCGTC CGGATCACCA GTTCGGGCCT GTGCGGATCC
GACCTGCACC TGTACGAGGT GCTCGGGCCG TTCATGACCC CGGGGGACAT CCTCGGACAC
GAACCCATGG GCGTGGTCGA GGAGGTCGGT TCCGGCGTCA CCTCCCTCAC CCCCGGCCAG
CGGGTCGTCA TCCCCTTCCA GATCTCCTGC GGGCACTGCC TCATGTGCGA CACCGGCCTC
CAGAGCCAGT GCGAGAACAC CCAGGTCGAG GAGCAGGGCA TGGGCGCCGC GCTCTTCGGC
TACAGCAAGC TCTACGGCTC GGTGCCCGGA GCGCAGGCCG AGTACCTGCG GGTTCCGCGC
GCCGAGACCA CCGCGGTCCC GGTGCCCGAC CAGGGCCCCG ACGACCGCTA CCTGTTCCTG
TCCGACGTGC TGCCGACCGC CTGGCAGGCC GTCCGCTACG CCGACGTCCC CGAGGGCGGG
TCGGTGGCCG TCCTGGGGCT GGGGCCGATC GGCGACATGT GCTGCCGGGT CGCCCGCCAC
CTGGGCGCGG GCCGGGTGTT CGGCGTGGAC CCGGTGCCGG AGCGGCGCGC CCGCGCCGCC
GCCCGGGACG TGGAGGTGTT CGACTCCTCC AAGGGCACCG ACGACGTGGT CCAGGAGATC
CGCGACCGTA CGGACGGGCG CGGCCCGGAC GCGGTCATCG ACGCGGTCGG CATGGAGGCG
GCCGGGCACG GCTCGGCCAA GTTCGCGCAG CGCGTGGCCA ACCTCATGCC CCGGGGCGTG
GCGGCCAAGA TGATGGAGAC GGCCGGGGTG GACCGGCTGA CCGCCCTGCA CACCGCCATC
GACCTGGTGC GGCGCGGCGG GACCGTCTCC CTGATCGGGG TGTACGGCGG CATGGCCGAC
CCGATGCCGA TGCTCACGCT CTTCGACAAG CAGATCCAGC TGCGGATGGG GCAGGCCAAC
GTGCGCCGGT GGGTGCCGGA GATCCTGCCG CTGCTGGAGG GGTCCGACCC GCTGGGGGTG
GACGACTTCG CCACCCACCA CGTGGGCCTG GACGCGGCCT CGCTGGCCTA CGAGAAGTTC
CAGAAGAAGC AGGACGGCGT GTTCAAGGTC GTCTTCCGGC CCTGA
 
Protein sequence
MKAVVWNGTR NVDTVTVPDP RIEEPGDALV RITSSGLCGS DLHLYEVLGP FMTPGDILGH 
EPMGVVEEVG SGVTSLTPGQ RVVIPFQISC GHCLMCDTGL QSQCENTQVE EQGMGAALFG
YSKLYGSVPG AQAEYLRVPR AETTAVPVPD QGPDDRYLFL SDVLPTAWQA VRYADVPEGG
SVAVLGLGPI GDMCCRVARH LGAGRVFGVD PVPERRARAA ARDVEVFDSS KGTDDVVQEI
RDRTDGRGPD AVIDAVGMEA AGHGSAKFAQ RVANLMPRGV AAKMMETAGV DRLTALHTAI
DLVRRGGTVS LIGVYGGMAD PMPMLTLFDK QIQLRMGQAN VRRWVPEILP LLEGSDPLGV
DDFATHHVGL DAASLAYEKF QKKQDGVFKV VFRP