Gene Dshi_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2034 
SymbolhisD2 
ID5713029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2153718 
End bp2155025 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID641267958 
Producthistidinol dehydrogenase 
Protein accessionYP_001533374 
Protein GI159044580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.138079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0860001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAG AGTATCTCAA GAAAGCCACC CTGACCCCCA AGAGCGACGC GGGCGAGACC 
AAGAAGATCG TGCGCGCCAT CCTCGACGAG ATCGAGGCCG GCGGGGACGA CGCCGCACTG
GCCTATGCCC GGAAGTTCGA CAATTACGAA GGCGAGATCC TGCTGAGCCA GGACGCCATC
GACGCGGCCA TCGCGCAGGT GCCCGAGAAG CTCAAGCACG ACATCGATTT CGCCCATGCC
AACGTGAAGC GTTTCGCCGA GGCCCAGCGT GACACGGTCG CCAATTTCGA GATCGAGGTG
GTGCCGGGCC TGATCGCGGG GCAGAAGGCG ATCCCGGTTC ATGCGGCGGG CTGCTACGTG
CCCGGCGGAC GCTACAGCCA TATCGCCAGC GCGATCATGA CCGTGACCAC CGCCAAGGTG
GCGGGCTGCA AGCATATCGT GGCCTGCTCG CCACCGCGAC CGGATGTGGG CATCGCGCCC
GCCATCGTCT ACGCCGCCCA TGTCTGCGGT GCCGACAAGA TCATGGCGAT GGGCGGGGTG
CAGGGCGTGG CGGCGATGAC CTTTGGCCTC TTCGGTCTGC CCAAGGCGAA CATCCTCGTG
GGCCCCGGCA ACCAGTTCGT GGCCGAAGCC AAGCGGATGC TCTTCGGGCG CGTGGGCATC
GACATGATCG CGGGGCCGAC CGACAGCCTG ATCCTGGCGG ATGCGTCCGC CGACCCCATG
GTGGTGGCGG TCGACCTGGT CGGTCAGGCA GAGCATGGCT ACAACTCGCC CGTCTGGCTG
GTGACCGATG ACCGCGCGCT GGCCGAGAAG GTGATGGAAC TGGTCCCCGG CCTGATCGAC
GACCTGCCGG ACGTGAACCG CGAGAACGCG ACCGCCGCCT GGCGCGATTA TGCCGAGGTG
ATCGTCTGTG CCGACCGTGA GGAAATGGCC GCGACCTCGG ACGAGTACGC GCCCGAACAC
CTGACCGTGC AGGCCGAGGA TCTGGATTGG TGGCTGGAGC GGCTGAGCTG CTACGGCTCG
CTGTTTCTGG GCGAAGAGAC CACCGTGGCC TTCGGTGACA AGGCCTCGGG GACGAACCAC
GTGCTGCCGA CCTCGGGGGC CGCGAATTAC ACCGGGGGGC TTTCGGTGCA CAAATACATG
AAGATCGTGA CCTGGCAGCG CTCAACCCGC GAAGGGTCCA AGCCCGTCGC GCTCGCCACG
GCGCGCATTT CGCGACTGGA AGGGATGGAA GGCCACGCTC GCACCGCCGA TATTCGTCTC
AGGAAGTATT TCCCCGACCA GGAATTCGAT CTGACCGGCA ATGACTGA
 
Protein sequence
MSREYLKKAT LTPKSDAGET KKIVRAILDE IEAGGDDAAL AYARKFDNYE GEILLSQDAI 
DAAIAQVPEK LKHDIDFAHA NVKRFAEAQR DTVANFEIEV VPGLIAGQKA IPVHAAGCYV
PGGRYSHIAS AIMTVTTAKV AGCKHIVACS PPRPDVGIAP AIVYAAHVCG ADKIMAMGGV
QGVAAMTFGL FGLPKANILV GPGNQFVAEA KRMLFGRVGI DMIAGPTDSL ILADASADPM
VVAVDLVGQA EHGYNSPVWL VTDDRALAEK VMELVPGLID DLPDVNRENA TAAWRDYAEV
IVCADREEMA ATSDEYAPEH LTVQAEDLDW WLERLSCYGS LFLGEETTVA FGDKASGTNH
VLPTSGAANY TGGLSVHKYM KIVTWQRSTR EGSKPVALAT ARISRLEGME GHARTADIRL
RKYFPDQEFD LTGND