Gene Arth_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1585 
SymbolhisD 
ID4445882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1766661 
End bp1768037 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content68% 
IMG OID639689400 
Producthistidinol dehydrogenase 
Protein accessionYP_831079 
Protein GI116670146 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCATTT CTCCGGACAG CCCCGTACAG ACCACAGCCC CGGCCATCAA CTACCGCACC 
ATTGACCTGC GCGGTCAGCG GCTCTCGCTC GCCGGACTCC GGGCGGCTGT CCCGCGTGCC
CGGCACCAGA CCATGGCGGA CGCCGAGCAG AAGGTCACCG ACATCATTAC CGCGGTCCGG
TCCCGGGGCT TCGCCGCCCT GAGTGAGTTC GCAAGAACGT TCGACGGCGT CGAGCAGGCA
CACCCGCGTG TGCCGGGCGC CGCGCTGCGC GCGGCGCTGG AGGAGCTGGA CCCGGCAGTG
CGGGCGGCCC TCGAAGAATC GATCAGCAGG GCGCGCAAGT TCGCAGACAG CCAGCGGCCG
GCAAACGTCG ACGTCGAACT GGGCGAGGGC GCCGTCGTTA GCCAGAACTG GGTGCCGGTT
GGTCGCGTGG GGCTGTATGT GCCCGGCGGA CTGGCTGTCT ACCCGTCGTC GGTCATCATG
AACGTCGTCC CGGCGCTGGC GGCCGGCGTT GCCTCCATTG CCCTGGCTTC TCCTCCCCAG
AAAAACTTCG GGGGGCTCCC GCACCCCACC ATCCTTGCCG CGGCGGCGCT GCTTGGCATT
GATGAGGTTT ACGCCATCGG GGGCGCACAG GCCATCGCGG CCTTTGCCTA CGGTGTTCCC
GCTGACGACG CGGAGGCAGC CCTGGATCCG GTGGACGTAG TGACCGGGCC GGGCAACATC
TTTGTTGCCA CGGCCAAACG GCTGGTCAAA GGTGTTGTCG GCATCGATTC CGAGGCCGGC
ACCACGGAAA TCGCCATCCT TGCGGACAGG ACCGCCCGCC CGTCGCTGGT GGCGGCCGAC
CTCATCAGCC AGGCCGAACA CGATCCGAAG GCCGCCTCGG TCCTCATCAC CGACTCAGAG
GATCTTGCCG GGGCCGTCCG CGCCGAGCTC GCCGTGCAGG CTGCGTTGAC CAAGCATTCC
GAACGTGTAC GGGAAGCCTT GTCCGGCCCG CAGTCCGGCG TGGTGCTCGT GGACGACCTT
GAGCAGGGAA TCGCGGCCTG TGATGCCTAC GCCGCGGAGC ACCTTGAAAT CATGACGGCG
GATGCCCCGG CTGTGGCTGC CCGTATCCGC AACGCTGGAG CGATCTTCGT GGGGGACTAC
AGCCCCGTCA GCCTGGGGGA CTACTGTGCA GGGTCCAACC ACGTGCTGCC GACGAGCGGC
ACGGCTGCTT TCTCTTCCGG CCTTAATGTC ACCACGTTCC TTCGTGCCAT CCAGGTGGTC
AACTACAGCA AGCCGGCCCT CCAGAAGGTC AGCGGGCACA TCGTCAGCCT CGCGGGCGCA
GAGGACCTGC CGGCCCACGG CGAGGCCGTG ACGGCGAGGT TCGCCGGAAC GGAATAA
 
Protein sequence
MTISPDSPVQ TTAPAINYRT IDLRGQRLSL AGLRAAVPRA RHQTMADAEQ KVTDIITAVR 
SRGFAALSEF ARTFDGVEQA HPRVPGAALR AALEELDPAV RAALEESISR ARKFADSQRP
ANVDVELGEG AVVSQNWVPV GRVGLYVPGG LAVYPSSVIM NVVPALAAGV ASIALASPPQ
KNFGGLPHPT ILAAAALLGI DEVYAIGGAQ AIAAFAYGVP ADDAEAALDP VDVVTGPGNI
FVATAKRLVK GVVGIDSEAG TTEIAILADR TARPSLVAAD LISQAEHDPK AASVLITDSE
DLAGAVRAEL AVQAALTKHS ERVREALSGP QSGVVLVDDL EQGIAACDAY AAEHLEIMTA
DAPAVAARIR NAGAIFVGDY SPVSLGDYCA GSNHVLPTSG TAAFSSGLNV TTFLRAIQVV
NYSKPALQKV SGHIVSLAGA EDLPAHGEAV TARFAGTE