Gene RPD_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4279 
SymbolhisD 
ID4024801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4741840 
End bp4743165 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID637964486 
Producthistidinol dehydrogenase 
Protein accessionYP_571397 
Protein GI91978738 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCCTTT GTGCCGCCGG AAGCTTCCAA ATGCCAATTC GTCTCGACAG CGCCAGCACC 
GATTTCGCCT CTCAATTCAA GGCCTTCCTG GCCATGAAGC GCGAGGTTGC GGCCGACATC
GAGGCGGCCA CCCGGGCCAT CGTCGACGAC GTCGCGGCGC GCGGTGATGC GGCGCTGCTC
GAAGCCACCG CGAAGTTCGA CCGTTTGAAG CTGGAAGCCA GCGGCCTGCG CGTCAGCGCC
GCCGAGGTCG ACGCCGCGGT CAAGCTGTGC GACCCGGATA CCATCGACGC GCTGAAATTC
GCGCGCGACC GCATCGAATT CTTCCACCGC CGACAATTGC CGAAGGACGA CCGCTTCACC
GATCCGCTCG GCGTCGAGCT CGGCTGGCGC TGGAGCGCGA TCGAGGCGGT CGGCCTCTAT
GTCCCCGGCG GCACCGCGGC CTATCCGTCC TCGGTGCTGA TGAACGCGAT CCCCGCCAAG
GTCGCCGGCG TCGAGCGCGT GGTGATGGTG GTGCCGTCAC CCGACGGCAA GCTCGCTCCG
CTGGTACTCG CCGCGGCGCA GCTCGGCGGC GTAACTGAGA TCTACCGCGT CGGCGGCGCC
CAGGCGGTGG CCGCGCTGGC TTACGGCACC GCGACGATCG CGCCGGTCTC CAAGATCGTC
GGCCCCGGCA ACGCCTATGT GGCCGCCGCC AAGCGGCTGG TGTTCGGTCG TGTCGGCATC
GACATGATCG CGGGTCCGTC AGAAGTCGTG GTGGTCGCCG ACAACACCGG CAATCCGGAC
TGGATCGCGG CCGATCTTCT GGCGCAGGCC GAGCACGACG CCAATGCGCA GTCGATCCTG
ATCACCGACG ATGCGGCGCT GGCGGCCGAG GTCGAGCGCG CCGTCGCGGC GCAGTTGAAG
ACGCTGTCGC GCGAAAAGAT CGCGCGCGCC TCGTGGGACG CGTTCGGCGC CATCATCAAG
GTCGCCCGGC TCGACGAGGC GGTCGGCCTC GCGGACGCCA TCGCCGCCGA GCATCTCGAG
ATCATCACTG CCGATCCGGA AGCCTTCGCG GCCAAGATCC GCAATGCCGG CGCGATCTTC
CTCGGCGCGC ATACGCCGGA GGCGATCGGC GACTATGTCG GAGGCTCCAA CCACGTGCTG
CCGACCGCGC GTTCGGCGCG GTTCTCCTCC GGGCTCGGCG TGCTCGATTT CATGAAGCGG
ACCTCGATCC TGAAATGCGG TCCGGAGCAG CTCGCCGCCC TCGGACCCGC GGCGATGACG
CTCGGTCACG CCGAGGGACT GGATGCTCAT GCGCGCTCGG TGGGACTGCG TCTGAATCGG
CGATGA
 
Protein sequence
MGLCAAGSFQ MPIRLDSAST DFASQFKAFL AMKREVAADI EAATRAIVDD VAARGDAALL 
EATAKFDRLK LEASGLRVSA AEVDAAVKLC DPDTIDALKF ARDRIEFFHR RQLPKDDRFT
DPLGVELGWR WSAIEAVGLY VPGGTAAYPS SVLMNAIPAK VAGVERVVMV VPSPDGKLAP
LVLAAAQLGG VTEIYRVGGA QAVAALAYGT ATIAPVSKIV GPGNAYVAAA KRLVFGRVGI
DMIAGPSEVV VVADNTGNPD WIAADLLAQA EHDANAQSIL ITDDAALAAE VERAVAAQLK
TLSREKIARA SWDAFGAIIK VARLDEAVGL ADAIAAEHLE IITADPEAFA AKIRNAGAIF
LGAHTPEAIG DYVGGSNHVL PTARSARFSS GLGVLDFMKR TSILKCGPEQ LAALGPAAMT
LGHAEGLDAH ARSVGLRLNR R