Gene Rpal_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5022 
SymbolhisD 
ID6412715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5404690 
End bp5405985 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID642714906 
Producthistidinol dehydrogenase 
Protein accessionYP_001993986 
Protein GI192293381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTTC GTCTCGATAA CGCCAGTCCC GATTTCGCTT CCAAATTCAA AGCCTTCCTG 
GCGATGAAGC GGGAGGTTGC GGCGGATATC GAGGCCGCCA CCCGGGCCAT CGTCGACGAC
GTCGCGCATC GCGGCGACGC TGCGCTGCTG GAAGCGACCG AAAAGTTCGA CCGGCTGACG
CTGGATGCCG CCGGCATGCG GGTCGGCGAG GCCGAGGTCG AAGCCGCCGT GAAGGCGTGC
GATTCCGAGA CGATCGACGC GCTGAAGCTG GCGCGCGACC GCATCGAATT TTTCCATCGC
CGGCAATTGC CGAAGGACGA TCGCTTCACC GATCCGCTCG GCGTCGAGCT CGGCTGGCGC
TGGAGCGCGA TCGAGGCGGT CGGCCTGTAC GTGCCCGGCG GCACCGCGGC GTATCCGTCG
TCGGTGCTGA TGAATGCGAT TCCAGCCAAG GTCGCCGGCG TCGAGCGCGT GGTGATGGTG
GTGCCGTCGC CCGGCGGCAC GCTCAATCCG CTGGTGCTGG CCGCGGCGCA GCTCGCTGGC
GCCACCGAGA TCTACCGCAT CGGCGGCGCA CAGGCGGTGG CTGCGTTGGC CTACGGCACC
GCGACGATTG CGCCGGTTGC CAAGATCGTC GGCCCCGGCA ACGCTTATGT CGCTGCAGCC
AAGCGCCTGG TGTTCGGCCG CGTCGGCATC GACATGATCG CTGGCCCGTC CGAGGTCGTC
GTCGTCGCAG ACAAGACCGC CAATCCGGAT TGGATTGCGG CCGATTTGTT GGCGCAGGCC
GAGCACGACG CCAATGCGCA ATCGATTCTG ATTACCGATA GCGCGGTACT CGCCGCCGAC
GTCGAGCGCG CCTTGGCGGC GCAGCTCACC ACGCTGCCGC GCGTCAAGAT CGCGCGCGCC
TCGTGGGACG AGTTCGGCGC CATCATCAAG GTCGCCAAGC TGGAGGATGC GGTCCCGCTC
GCCAACGCGA TCGCGGCCGA GCATCTCGAG ATCATGACGG CCGATCCGGA AGCGTTCGCC
GACAAGATCC GTAATGCCGG CGCGATCTTC CTCGGCGGCC ATACGCCGGA AGCGATCGGC
GACTATGTCG GCGGCTCCAA CCACGTGCTG CCGACCGCCC GTTCGGCGCG GTTCTCGTCG
GGTCTCGGCG TGCTCGACTT CATGAAGCGC ACCTCGATCC TGAAGTGCGG TCCCGAACAG
CTCGCCGCGC TCGGCCCGGC CGCGATGGCG CTCGGCAAGG CCGAAGGATT GGACGCGCAT
GCGCGATCGG TGGGACTGCG TCTGAATCAG CGATGA
 
Protein sequence
MPLRLDNASP DFASKFKAFL AMKREVAADI EAATRAIVDD VAHRGDAALL EATEKFDRLT 
LDAAGMRVGE AEVEAAVKAC DSETIDALKL ARDRIEFFHR RQLPKDDRFT DPLGVELGWR
WSAIEAVGLY VPGGTAAYPS SVLMNAIPAK VAGVERVVMV VPSPGGTLNP LVLAAAQLAG
ATEIYRIGGA QAVAALAYGT ATIAPVAKIV GPGNAYVAAA KRLVFGRVGI DMIAGPSEVV
VVADKTANPD WIAADLLAQA EHDANAQSIL ITDSAVLAAD VERALAAQLT TLPRVKIARA
SWDEFGAIIK VAKLEDAVPL ANAIAAEHLE IMTADPEAFA DKIRNAGAIF LGGHTPEAIG
DYVGGSNHVL PTARSARFSS GLGVLDFMKR TSILKCGPEQ LAALGPAAMA LGKAEGLDAH
ARSVGLRLNQ R