Gene Namu_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3163 
SymbolhisD 
ID8448777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3483816 
End bp3485123 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content75% 
IMG OID645042244 
Producthistidinol dehydrogenase 
Protein accessionYP_003202485 
Protein GI258653329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0133294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000240605 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTCAAC GGATCGACCT ACGTGCCGGC CTGCCCCGGG TGCTCGGCGA CGTGCTGCCC 
CGGGCGGCCG TCGACATCGG TTCGGCGACC GCGACGGTCG CCCCGATCAT CGAGGACGTC
CGTCATCGTG GCGCGGCGGC CGTGCTGGAC GCGGCGCAGC GGTTCGACGG CGTGCGCCCG
GCCGCGGTTC GGGTCCCGGT CGAGGTGATC GAGGCGGCCT CCGGCGCGCT GACCGGTTCG
CTGCGGGCCG CGCTGGTGGA GTCCATCGCC CGCGCCCGGG TCGGGCACGC CGCGCAGCTG
CCGGCCGAGA CGGTGACCAC GCTGGCCTCC GGTGCCCTGG TGCGGCAGCG GTGGGTGCCG
GTGCGCCGGG TCGGGCTGTA CGTGCCGGGC GGGCGCGCGC TGTACCCGTC GAGTGTGGTG
ATGAACGTGG TGCCCGCGCA GGTCGCGGGC GTCGACGCGA TCGCCGTCAC CTCGCCGCCG
CAGAAGGACA ACGACGGCTG GCCGGACCGC AACGTGCTGG CCGCCTGCGC CCTGCTGGAC
ATCGATGAGG TCTACGCCGC CGGCGGGGCC CAGGGCATCG CGCTGCTGGC TCTGGGTGCC
GATGGCGTCG AACCGGTCGA CGTGATCACC GGACCGGGCA ACGTCTACGT CACCGCGGCC
AAGCGGCTGC TGCGCGGTGT CGTCGGCATC GACTCGGAGG CCGGCCCCAC CGAGATCGCC
GTGGTCGCTG ACGACTCCGC CGACCCCGAG TACGTCGCCG CCGACCTGAT CTCGCAGGCC
GAGCACGACC CGCTGGCCGC CTCGGTGCTG ATCACCACCT CGACCGAGCT GGCCGACGCG
GTCGACGCCG TGCTGCCGGC CCGGGTGGCC GCGACCAAGC ACAGCGAGCG GATCACCGAG
GCGCTGACCG GCCCGCAGTC CGGCGTGGTG CTGGTCGCCG GCATCGACGA CGCGCTGGCC
GTGGCCGACG CATACGCCGC CGAGCACCTG GAGATCCAGA CCCGGGACGC CGCCGCCGTG
GCCGCCCGGG TGCGCAATGC CGGCGCGGTG TTCGTCGGCG CGTACTCACC GGTATCGCTG
GGCGACTACT GCGCCGGGTC CAACCACGTG CTGCCCACCG GCGGGTCGGC CCGGTTCTCC
GCCGGCCTGG CCGCCACCAC GTTCCTGCGG CAGCAGCAGG TGATCGACTA CTCCGCCGAT
GCGCTGCGCG AGGTGGGTCC ACACGTCGCC GCCCTGTCGG CCGCGGAGGA CCTGCCCGCG
CACGGCGAGG CGGTCGCCGT GCGGCTGACG GCGCGGGACG GCTCATGA
 
Protein sequence
MLQRIDLRAG LPRVLGDVLP RAAVDIGSAT ATVAPIIEDV RHRGAAAVLD AAQRFDGVRP 
AAVRVPVEVI EAASGALTGS LRAALVESIA RARVGHAAQL PAETVTTLAS GALVRQRWVP
VRRVGLYVPG GRALYPSSVV MNVVPAQVAG VDAIAVTSPP QKDNDGWPDR NVLAACALLD
IDEVYAAGGA QGIALLALGA DGVEPVDVIT GPGNVYVTAA KRLLRGVVGI DSEAGPTEIA
VVADDSADPE YVAADLISQA EHDPLAASVL ITTSTELADA VDAVLPARVA ATKHSERITE
ALTGPQSGVV LVAGIDDALA VADAYAAEHL EIQTRDAAAV AARVRNAGAV FVGAYSPVSL
GDYCAGSNHV LPTGGSARFS AGLAATTFLR QQQVIDYSAD ALREVGPHVA ALSAAEDLPA
HGEAVAVRLT ARDGS