Gene Dtox_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0752 
Symbol 
ID8427690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp761968 
End bp763266 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content53% 
IMG OID645033110 
Producthistidinol dehydrogenase 
Protein accessionYP_003190285 
Protein GI258514063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGGA TTATGCAGGC GGGAGACCCC GCCCTGGAAA ATATTATGAA TAGGCATATG 
GCCGGCAAAG AAGCAGCCGC GGCAATCGTG GCAGAAATAA TAAAGACTGT GCGCGAGCAG
GGTGACAGGG CTTTATGCGC TTACACGGCC GATTTTGATC AAGTCAAGTT GACCCCCGGT
CAACTGAAAG TCATACCTGA AGAAATAGAA GAAGCCTATG GCCTGGTGGA AAAGACAACT
CTGGAGTCAA TATGCCTGGC CCGTGACAAT ATAGCTTCTT TCCACCGCAA GCAGTTGGAG
AAATCCTGGT TTGCCCCGTC CGATAACGGT GTTATCCTGG GACAGCTGGT GCGCCCGCTG
GCCAGAGCAG GCATTTATGT CCCGGGCGGT ACTGCTGTGC TCTTTTCTTC AGTGCTGATG
AACGCCATAC CGGCCATGGT GGCCGGGGTG AAAGAAATTG TTATGGTTAC TCCTCCGGGC
AAGGACGGAA AACTCAACCC TTATACACTG GTGGCAGCCG CTGAGGCAGG GGTAACCGAG
ATTTATAAGG CCGGTGGGGC TCAGGCAATA GCGGCACTGG CTTACGGAAC TGAAACAATC
AAGCCGGTAG ATAAAATAAC CGGTCCGGGC AATATCTATG TAACCCTGGC CAAACAGCAG
GTCTACGGGC AGGTGGGTAT TGATATGCTG GCCGGGCCCA GTGAAGTGCT GGTAGTGGCT
GATGAAACGG CGGATGCTTC CTATGTGGCT GCGGATCTGC TCTCTCAGGC CGAGCATGAC
GTGCTGGCTT CCGCTGTGCT GCTGACACCG GTGGAAAGCC TGGCTGAAAA AGTAAGAGAG
GAAGTTGCCA GGCAGACTGA GTTGTTGCCG CGCAGGGATA TCGTCACCCG TTCTTTAACT
GACTACAGCG CTATAGTAAT CACCAGGGAT TTGGCCCAGG CGGTGCAGAT GGCCAATAGG
TTTGCACCTG AACACCTGGA ACTGATGGTA AAAGAGCCTT TTAACCTGTT ATGCCAGATT
ACCAATGCCG GGGCCGTCTT TTTAGGCTCT TACTCACCCG AGCCGGTGGG AGATTACTGG
GCCGGGCCAA ATCACGTGCT GCCTACAGGC GGAACAGCCA GGTTCTACTC GCCTTTGAGC
GTAGATACCT TTATTAAAAA ATCAAGCATA ATTTCCTATA CACGGGAGGC ATTGGAGCAG
GCCGGTGGTC ATATTGCAGA CATGGCTGTT AAAGAGGGAC TGGATGCTCA TGCCAACGCT
ATAAGAATCA GGCTGGAGGG GAAAGGAGAA AATCAATAA
 
Protein sequence
MIRIMQAGDP ALENIMNRHM AGKEAAAAIV AEIIKTVREQ GDRALCAYTA DFDQVKLTPG 
QLKVIPEEIE EAYGLVEKTT LESICLARDN IASFHRKQLE KSWFAPSDNG VILGQLVRPL
ARAGIYVPGG TAVLFSSVLM NAIPAMVAGV KEIVMVTPPG KDGKLNPYTL VAAAEAGVTE
IYKAGGAQAI AALAYGTETI KPVDKITGPG NIYVTLAKQQ VYGQVGIDML AGPSEVLVVA
DETADASYVA ADLLSQAEHD VLASAVLLTP VESLAEKVRE EVARQTELLP RRDIVTRSLT
DYSAIVITRD LAQAVQMANR FAPEHLELMV KEPFNLLCQI TNAGAVFLGS YSPEPVGDYW
AGPNHVLPTG GTARFYSPLS VDTFIKKSSI ISYTREALEQ AGGHIADMAV KEGLDAHANA
IRIRLEGKGE NQ