Gene Dshi_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2036 
Symbol 
ID5713031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2155779 
End bp2156762 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content69% 
IMG OID641267960 
Productzinc-containing alcohol dehydrogenase 
Protein accessionYP_001533376 
Protein GI159044582 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.194095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0749622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TCGTTTATAC CGGGGTGGCG CAGCTGGCCT TCCGCGATGT GCCGGAGCCG 
GTTCCGGCTG CGGGCGATCA CCTGATCCGC ATCGACAGTG TCGGGATCTG CGGCTCGGAC
ATGCATGCCT ATCTCGGACA TGACGATCGC CGCCCTGCCC CGCTGATCCT CGGGCACGAG
GGCGCGGGCG TGATAATCGG CGGCCCCCGC GACGGGGAGC GTGTGACGAT CAATCCGCTC
GTGACCTGCG GCACCTGCCC GGCCTGCGTG TCGGGACGCG ACAACCTGTG TGCCACAAGG
CAGATCATCT CGATGCCCCC GCGCGATGGG GCGTTCGCGC AATACGTCGC CATGCCGGCA
CGCAACCTGG TGACCGTACC CGACGACGTC CCGCTGGAGA AAGCCGCCCT GGCCGAGCCC
GTGGCGGTCA GCTGGCACGC GGTGCGTCTG GGGCTGGCAT CCATGGCCGA CGCGCGCCGC
GACAGCGCCC TGGTGATCGG CGGCGGGGCC ATCGGCGTGG CCGCCGCGAT CAGCCTGCAA
GCGCAGGGTG TGGCGGACGT GACCCTCGTG GAGCCGAACG CCATGCGCCG CGAGTACCTC
GCTCGCGATG CAAACTACAC CATCGCGACG CCCGAGCAGG TCGCAGGCCG GGTTTTCGAC
ATCACCGTGG ACGGGGTTGG CTATGATGCC ACCCGGGCGG CGGCTTCGGC GGCGACCCGT
CCCGGCGGTC TGCTCTTGCA TATCGGGCTG GGGGGTGGGT CCGCGGGCCT CGACATTAGG
CGGATCACCC TGCAGGAGAT CACCGTGATC GGCACCTATA CCTACACCGC GCAGGATTTT
CGCGACACCT GTGCCGCGAT GTTTGACGGC CGCCTCGGCG GGCTCGACTG GACCGAAAGC
CGTCCCCTTT CCGCGGGGGC AGACGCCTTC GCCGATATCC GCGCGGGCCG CGTGCCCGCA
CCCAAGATCA TACTCAAGCC GTAA
 
Protein sequence
MKALVYTGVA QLAFRDVPEP VPAAGDHLIR IDSVGICGSD MHAYLGHDDR RPAPLILGHE 
GAGVIIGGPR DGERVTINPL VTCGTCPACV SGRDNLCATR QIISMPPRDG AFAQYVAMPA
RNLVTVPDDV PLEKAALAEP VAVSWHAVRL GLASMADARR DSALVIGGGA IGVAAAISLQ
AQGVADVTLV EPNAMRREYL ARDANYTIAT PEQVAGRVFD ITVDGVGYDA TRAAASAATR
PGGLLLHIGL GGGSAGLDIR RITLQEITVI GTYTYTAQDF RDTCAAMFDG RLGGLDWTES
RPLSAGADAF ADIRAGRVPA PKIILKP