Gene Noca_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3045 
SymbolhisD 
ID4600162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3241998 
End bp3243296 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content75% 
IMG OID639777651 
Producthistidinol dehydrogenase 
Protein accessionYP_924234 
Protein GI119717269 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCC GCATCGACCT GCGAGGCGCC GACCCGGGTG TCGACTACCG GGCAGCCGTG 
CCCCGTGCCG ACTTCGACAT CGAGGCCGCG GTCCCGGCGG TGCATGCGAT CTGCGAGGAC
GTCCGGACCC GCGGGCTGGA CGCGATCCGC GAGCTCTCGG AGCGCTTCGA CGGCGTCGCC
GTGGACGACA TCCGGGTCGC CCCGGAGGCG CTGGCCACCG CGCTCGAGCG GCTCGACCCC
GACATCCGGG CCGCCCTGGA GGAGTCGATC GCGCGGCTGC GGGCCACCTG CGCGAACGAG
CTCGAGCAGG ACGCCGTCAC CGACCTCGGC CCCGGCGCCC GGGTCACCCA CCGCAAGGTG
CCGGTCGGCC GGGTCGGCCT CTACGTCCCC GGCGGGCTGG CCCCGCTGGT CTCCAGCGTG
CTGATGAACG TCGTGCCGGC CCAGACCGCC GGCGTCGGGT CGATCGCGCT CGCGAGCCCG
CCCCAGCGTG AGTTCGCAGG CGCGGTGCAC CCGACGATCC TGGCGGCGTG CGCGCTGCTG
GGGGTCGAGG AGGTGTACGC CGTCGGCGGC GCCCAGGCGA TCGCGATGTT CGCCTACGGC
ACCGGGCCGT GCCGGCGGGT CGACCTGGTG ACCGGGCCCG GCAACATCTA CACGGTCACC
GCCAAGCGGC TGCTCAAGGG CCTGGTCGGT ATCGACTCGG AGGCGGGCCC CACCGAGATC
GCGATCCTCG CCGACGACAC GGCGGACCCG GCGTACGTCG CCGCCGACCT GCTCAGCCAG
GCCGAGCACG ACCCGCTCGC CGCCGCCGTG CTCGTCACGC CCTCCGACCG GCTGGCCGAC
GCGGTCGCGG CCGAGCTCGA GACGCAGGTC GCGGCCACCA AGCACGTCGA ACGGATCCGC
ACCAGCCTCT CCGGGCGGCA GTCCGGGGTC GTCCTCGTCG ACGACCTCGA GCAGGGCCTC
GAGGTCGTGA ACGCCTACGC CGCCGAGCAC CTCGAGATCC ACACCGAGGA CGCCGCGGCG
TACGCCGCCC GGGTCCGCAA CGCCGGCGCG ATCTTCGTCG GCCCCTACGC CCCGGTCAGC
CTCGGCGACT ACTGCGCCGG CTCCAACCAC GTGCTGCCGA CCGCCGGCTG CGCCTGCCAC
TCCTCGGGCC TCTCGGTGCG CGCGTTCACC AAGTCGGTCC ACGTGGTCGA CTACTCCCGC
GCGGCGCTCG ACGCCGTGGC CGGGCACGTC GTCACGCTGG CCGAGGCCGA GGACCTCCCC
GGCCACGGCG CGGCCGTCCG GGTGCGGTTC GGGGGCTGA
 
Protein sequence
MIRRIDLRGA DPGVDYRAAV PRADFDIEAA VPAVHAICED VRTRGLDAIR ELSERFDGVA 
VDDIRVAPEA LATALERLDP DIRAALEESI ARLRATCANE LEQDAVTDLG PGARVTHRKV
PVGRVGLYVP GGLAPLVSSV LMNVVPAQTA GVGSIALASP PQREFAGAVH PTILAACALL
GVEEVYAVGG AQAIAMFAYG TGPCRRVDLV TGPGNIYTVT AKRLLKGLVG IDSEAGPTEI
AILADDTADP AYVAADLLSQ AEHDPLAAAV LVTPSDRLAD AVAAELETQV AATKHVERIR
TSLSGRQSGV VLVDDLEQGL EVVNAYAAEH LEIHTEDAAA YAARVRNAGA IFVGPYAPVS
LGDYCAGSNH VLPTAGCACH SSGLSVRAFT KSVHVVDYSR AALDAVAGHV VTLAEAEDLP
GHGAAVRVRF GG