Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2034 |
Symbol | hisD2 |
ID | 5713029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2153718 |
End bp | 2155025 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267958 |
Product | histidinol dehydrogenase |
Protein accession | YP_001533374 |
Protein GI | 159044580 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.138079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0860001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGAG AGTATCTCAA GAAAGCCACC CTGACCCCCA AGAGCGACGC GGGCGAGACC AAGAAGATCG TGCGCGCCAT CCTCGACGAG ATCGAGGCCG GCGGGGACGA CGCCGCACTG GCCTATGCCC GGAAGTTCGA CAATTACGAA GGCGAGATCC TGCTGAGCCA GGACGCCATC GACGCGGCCA TCGCGCAGGT GCCCGAGAAG CTCAAGCACG ACATCGATTT CGCCCATGCC AACGTGAAGC GTTTCGCCGA GGCCCAGCGT GACACGGTCG CCAATTTCGA GATCGAGGTG GTGCCGGGCC TGATCGCGGG GCAGAAGGCG ATCCCGGTTC ATGCGGCGGG CTGCTACGTG CCCGGCGGAC GCTACAGCCA TATCGCCAGC GCGATCATGA CCGTGACCAC CGCCAAGGTG GCGGGCTGCA AGCATATCGT GGCCTGCTCG CCACCGCGAC CGGATGTGGG CATCGCGCCC GCCATCGTCT ACGCCGCCCA TGTCTGCGGT GCCGACAAGA TCATGGCGAT GGGCGGGGTG CAGGGCGTGG CGGCGATGAC CTTTGGCCTC TTCGGTCTGC CCAAGGCGAA CATCCTCGTG GGCCCCGGCA ACCAGTTCGT GGCCGAAGCC AAGCGGATGC TCTTCGGGCG CGTGGGCATC GACATGATCG CGGGGCCGAC CGACAGCCTG ATCCTGGCGG ATGCGTCCGC CGACCCCATG GTGGTGGCGG TCGACCTGGT CGGTCAGGCA GAGCATGGCT ACAACTCGCC CGTCTGGCTG GTGACCGATG ACCGCGCGCT GGCCGAGAAG GTGATGGAAC TGGTCCCCGG CCTGATCGAC GACCTGCCGG ACGTGAACCG CGAGAACGCG ACCGCCGCCT GGCGCGATTA TGCCGAGGTG ATCGTCTGTG CCGACCGTGA GGAAATGGCC GCGACCTCGG ACGAGTACGC GCCCGAACAC CTGACCGTGC AGGCCGAGGA TCTGGATTGG TGGCTGGAGC GGCTGAGCTG CTACGGCTCG CTGTTTCTGG GCGAAGAGAC CACCGTGGCC TTCGGTGACA AGGCCTCGGG GACGAACCAC GTGCTGCCGA CCTCGGGGGC CGCGAATTAC ACCGGGGGGC TTTCGGTGCA CAAATACATG AAGATCGTGA CCTGGCAGCG CTCAACCCGC GAAGGGTCCA AGCCCGTCGC GCTCGCCACG GCGCGCATTT CGCGACTGGA AGGGATGGAA GGCCACGCTC GCACCGCCGA TATTCGTCTC AGGAAGTATT TCCCCGACCA GGAATTCGAT CTGACCGGCA ATGACTGA
|
Protein sequence | MSREYLKKAT LTPKSDAGET KKIVRAILDE IEAGGDDAAL AYARKFDNYE GEILLSQDAI DAAIAQVPEK LKHDIDFAHA NVKRFAEAQR DTVANFEIEV VPGLIAGQKA IPVHAAGCYV PGGRYSHIAS AIMTVTTAKV AGCKHIVACS PPRPDVGIAP AIVYAAHVCG ADKIMAMGGV QGVAAMTFGL FGLPKANILV GPGNQFVAEA KRMLFGRVGI DMIAGPTDSL ILADASADPM VVAVDLVGQA EHGYNSPVWL VTDDRALAEK VMELVPGLID DLPDVNRENA TAAWRDYAEV IVCADREEMA ATSDEYAPEH LTVQAEDLDW WLERLSCYGS LFLGEETTVA FGDKASGTNH VLPTSGAANY TGGLSVHKYM KIVTWQRSTR EGSKPVALAT ARISRLEGME GHARTADIRL RKYFPDQEFD LTGND
|
| |