Gene Dhaf_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_4212 
Symbol 
ID7261232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp4455353 
End bp4456483 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID643564127 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002460655 
Protein GI219670220 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0000750849 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAA TAGTTTATGA GGGAATTCGG GATGTTAAAG TTAAGAATGT CGGAGATCCG 
GGAATACAAA AGCCTGACGA CATCATTGTT AAGGTCACAT CCACAGCCAT ATGCGGTTCA
GATCTTCATC TTATTCACGG TATGGTCCCC GGTATGCCCG AGGGGTTTGT TCTTGGTCAT
GAGACCATGG GCATCGTAGA AGAGGTAGGC GGGGATGTGT ACAACATTAA AAAAGGAGAT
CGGGTTATTG TGCCTTTTCC TATTGCCTGC GGGCATTGCT GGTATTGTGA ACATGACCTG
TGGAGTCAGT GTGATAACGC GAATCCTGAA GCCGAAGTGG GAGCGTATTT TGGCTACAGC
AATACTTTTG GCGGTTATGA TGGGGGACAG GCGGAGTACC TGCGAGTTCC TTACGCCAAT
GTGGGGCCCA AAGTGGTTCC GGAGGAATTA ACCGACGAAC AGGTCCTCTT CTTAACAGAT
ATCCTGCCTA CCTCATACTG GGGAGTGGAA ATCGGTGGGG TAAAAAAGGA CGATACAGTG
GTGGTCCTGG GCTGTGGGCC GGTAGGCCTG CTGACCATCA AATGGGCCAT TTTCCAGGGG
GCCAAACGAG TCATTGCCGT GGATCATATT AGCTACCGGC TGGATCATGC CTATAGATAC
TATGGGGTGG AGGTCATTAA CTTTGAAGAT CACGACAACA CCGGCGAGTA TATTAAGGAG
ATAACTCACG GAGGTGCGGA CGTGGTGATC GACTGTGTAG GTATGGATGG CAAAGCATCC
ACCCTTGAGA AGATCGAGAC CTTGCTTAAG CTCCAAGGGG GCTCCAAATC AGCCATTGAG
ATTGCCACTC AGGCAGTGCG AAAAGGTGGA ACCGTAGCTT TGGTAGGTGT CTATGGGTCA
AAGTATAATC TGTTTCCTTT GGGGGATTTT TTCTCCCGAA ACATTACCTT GAAGATGGGG
CAATGCCCGG CCCATTCCTA TGTGGAGCCG ATCATGGAAT TGATCAAAAC AGGCCGGTTT
GATGCTACGG ATATCATTAC TCACCGCCTT TCCTTAGATA AAGGGGAGCA TGCCTATGAG
GTTTTTGACG AGAAAAAGGA TAACTGCATT AAAGTTGTCT TGAAGCCATA G
 
Protein sequence
MKAIVYEGIR DVKVKNVGDP GIQKPDDIIV KVTSTAICGS DLHLIHGMVP GMPEGFVLGH 
ETMGIVEEVG GDVYNIKKGD RVIVPFPIAC GHCWYCEHDL WSQCDNANPE AEVGAYFGYS
NTFGGYDGGQ AEYLRVPYAN VGPKVVPEEL TDEQVLFLTD ILPTSYWGVE IGGVKKDDTV
VVLGCGPVGL LTIKWAIFQG AKRVIAVDHI SYRLDHAYRY YGVEVINFED HDNTGEYIKE
ITHGGADVVI DCVGMDGKAS TLEKIETLLK LQGGSKSAIE IATQAVRKGG TVALVGVYGS
KYNLFPLGDF FSRNITLKMG QCPAHSYVEP IMELIKTGRF DATDIITHRL SLDKGEHAYE
VFDEKKDNCI KVVLKP