Gene Hhal_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2112 
SymbolhisD 
ID4710042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2317180 
End bp2318499 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID639856586 
Producthistidinol dehydrogenase 
Protein accessionYP_001003678 
Protein GI121998891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGACA TCAAGCGGCT GAGCACCACA CAACCCGATT TCTGGGACAG CCTGGACGCC 
CTGACGAGCT GGGGGAGCCA CCCGGGCGCC TCCATTGAGC AGCGCGTCGC CGAGGTGGTC
GAGGGGGTTC GCACCGGCGG TGATGCGGCC CTGCTGGATT ACACGCGGCG TTTCGACGGC
CGCGAGGCCG ATTCGGTGGC CGCGCTGGAG ATCCCGACCG AGCAGGTGGT CGCGGCCCAC
GAACAGATCG CCCCGGAGAT GCGCGAGGCG CTGGAGCAGG CGGCCGCGCG GATCCGCGAC
TACGCGCAGC GTCAGCGCCT GGAGTCCTGG GAGTACCAGG ACGCCGACGG TAACCGCCTC
GGGCAGCAGG TCACCGCCCT CGACCGGGTC GGGGTCTACG TCCCCGGCGG CAAGGCCGCC
TATCCCTCAT CGGTGCTGAT GAACGCCGTG CCGGCCCGGG TGGCCGGCGT GCAGGAGATC
ATCATGACCG TCCCGGCCCC CGGGGGGGAG CTCTCACCAC TGGTGCTGGC CGCTGCCCAT
GTGGCGGGCG TGGATCGCAT CTTCACCTTG GGCGGGGCGC AGGCGGTGGC GGCCCTGGCC
TACGGGACCG AGACCGTACC CGCCGTGGAC AAGATCGTCG GCCCCGGTAA CGCCTATGTT
GCCGAGGCCA AGCGCCGCGT CTACGGGGTG GTGGGCATCG ACATGATCGC CGGCCCCTCC
GAGGTGCTGG TCATCAGCGA CGGCCAGGCC GATCCCGAGT GGATCGCCAT GGACCTGTTC
TCCCAGGCCG AGCACGACGA GGAGGCGCAG GCCCTGCTGG TCTGCCCGGA CTTCGTCTTC
CTCGATCAGG TCCAGGCGGC CATGGAGCGC CTGCTGCCGG ATATGGAGCG TTCGGAGATC
ATCCGCACCT CGCTGGCGGA GCGGGGGGCG TTGATCTGTG TCCGCGATCT GGAAGAGGCG
CAGCAGGTGG CCAACTATGT CGCTCCGGAA CACCTGGAGC TCTCCGTGGC CGAGCCCGAT
CGACTGGCCG AGGGCATCCG TCACGCCGGG GCGATCTTCC TCGGCCATTA CAGCGCCGAG
TCACTGGGTG ATTACTGTGC CGGCCCCAAC CACACGCTGC CGACGTCGCG GACCGCGCGG
TTCGCATCGC CGTTGGGTGT CTACGACTTC CAGAAGCGTT CCACCACGCT GGCCTGCTCA
CCCGCCGGCG CCGCAGCACT GGCCGGGACC GCCGCGGTGA TGGCACGGGG CGAGGGACTG
ACCGCCCATG CCCGCTCAGC GGAGTACCGC GGCCAGGGGG AGGAGCGCAG CGGTGACTGA
 
Protein sequence
MVDIKRLSTT QPDFWDSLDA LTSWGSHPGA SIEQRVAEVV EGVRTGGDAA LLDYTRRFDG 
READSVAALE IPTEQVVAAH EQIAPEMREA LEQAAARIRD YAQRQRLESW EYQDADGNRL
GQQVTALDRV GVYVPGGKAA YPSSVLMNAV PARVAGVQEI IMTVPAPGGE LSPLVLAAAH
VAGVDRIFTL GGAQAVAALA YGTETVPAVD KIVGPGNAYV AEAKRRVYGV VGIDMIAGPS
EVLVISDGQA DPEWIAMDLF SQAEHDEEAQ ALLVCPDFVF LDQVQAAMER LLPDMERSEI
IRTSLAERGA LICVRDLEEA QQVANYVAPE HLELSVAEPD RLAEGIRHAG AIFLGHYSAE
SLGDYCAGPN HTLPTSRTAR FASPLGVYDF QKRSTTLACS PAGAAALAGT AAVMARGEGL
TAHARSAEYR GQGEERSGD