Gene Dret_1370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1370 
SymbolhisD 
ID8419199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1597877 
End bp1599187 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID645037946 
Producthistidinol dehydrogenase 
Protein accessionYP_003198236 
Protein GI258405494 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.304099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.714962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTGTC GAAGCCTGAC CTACTCTTCT GCCCAGGACT GGGCCGCCAT TCGTGCATGG 
CTCGCTCCGC GGACTGAACC CGACACCTCG GTGGAGGGCC CGGTGCGCGA GATACTCCAA
GAAGTGCAAC AGCACGGCGA TGCCACTTTG GTCAAATACA CCCAGCGTTT CGATTGCCCG
GATTTCCAAG CCCATATGCT GGCGGTGCCC CAGGAACAGA TCGAGGCTGC TGTACAGGAA
ATTCCCGCCG AGGACAAACG GATTATTGAG GAGGCCGCGG CCAACATCAG GGATTATCAC
GCCAAACAGC AGGAAAACTC CTGGTTCACC CCCCAAAGCG GCGGCACGAT ACTGGGCCAG
ATCGTTCGCC CTGTGGACCG GGCTGGGCTC TACGTTCCAG GCGGACAGGG AGGCGATACC
CCGCTCTTGT CCAGTCTGCT CATGAACGCC ATTCCCGCCC AGGTGGCCGG GGTTGGGGAC
ATCTCCCTGG TGACCCCTCC AAGGGTGGAT GGGACCGTCA ATCCCTATAT CCTGTGTACA
GCGGGCATTC TCGGCTTAGA CCGTGTTTTC GCCGTTGGCA GTGCCTGGGC CGTCGCCGCC
TTGGCCTTCG GCACCGAAAC CCTTCCCTGC GTCGACGTCA TTGCCGGACC GGGAAACATC
TTTGTGGCCA CAGCCAAACG GCTGTTGCAG GGCCAGATCG GCATCGACAT GGTCGCCGGT
CCCAGTGAAA TCGCCATCGT AGCGGACGCA AGCGCTTCAG CCGAACGGCT GGCCGCGGAC
ATGCTCTCCC AGGCCGAACA CGACCCCCTG GCGTCGAGTA TTCTGATAAC CGACTCGCAG
GACCTGCTGC AAACCACCCA ACAGGAATTG GAACGCCAGC TCGCCGAATT GCCCCGCAAT
ACCATCGCCC GGCAGTCGCT CTCGGACTGG GGGGCCTGCA TTCGTGTTCC GGACACGGCC
ACCGGACTGG AACTCGCCAA TCGTCTTGCC CCGGAGCACC TTGAACTCTG CCTGGAATCA
CCCTGGCAGT GGATCGATCA AGTCCACCAT GCCGGAGCGG TTTTCCTCGG TCACAGCACC
CCGGAACCTG TTGGCGATTA TTTCGCCGGA CCGAACCACG TCCTGCCGAC CATTGGCACG
GCCCGATTCA GTTCCGCCCT TTCGGTCCAG AATTTCACCA AGAAGACAAG CCTCATCGCC
ACTTCGGACG CCTATATCCA GGAGCATGGG GCCAAGATCG CTCGAATGGC CCGCCTCGAA
GGGCTTGAGG CCCACGCCAG AAGCGTCGAG ACCCGGTATC GGTGTTTGTG A
 
Protein sequence
MTCRSLTYSS AQDWAAIRAW LAPRTEPDTS VEGPVREILQ EVQQHGDATL VKYTQRFDCP 
DFQAHMLAVP QEQIEAAVQE IPAEDKRIIE EAAANIRDYH AKQQENSWFT PQSGGTILGQ
IVRPVDRAGL YVPGGQGGDT PLLSSLLMNA IPAQVAGVGD ISLVTPPRVD GTVNPYILCT
AGILGLDRVF AVGSAWAVAA LAFGTETLPC VDVIAGPGNI FVATAKRLLQ GQIGIDMVAG
PSEIAIVADA SASAERLAAD MLSQAEHDPL ASSILITDSQ DLLQTTQQEL ERQLAELPRN
TIARQSLSDW GACIRVPDTA TGLELANRLA PEHLELCLES PWQWIDQVHH AGAVFLGHST
PEPVGDYFAG PNHVLPTIGT ARFSSALSVQ NFTKKTSLIA TSDAYIQEHG AKIARMARLE
GLEAHARSVE TRYRCL