Gene RPB_4350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4350 
SymbolhisD 
ID3912164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4933854 
End bp4935197 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content70% 
IMG OID637886255 
Producthistidinol dehydrogenase 
Protein accessionYP_487948 
Protein GI86751452 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.376502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCATTC GCGGCCTTGT GGGCCTTTGT GACTCCGGAA GCTTCCAAAT GCCAGTTCGC 
CTCGACGCCG CCAGTACCGA TTTTCCTCAG CAGTTCAAGG CCTTCCTGGC CATGAAGCGC
GAGGTCGCGG CGGATATCGA GGCCGCCACG AGGGCCATCG TCGACGACGT CGCGGCGCGC
GGCGACGCCG CGCTGCTGGA CGCCACTGCG AAGTTCGACC GGCTGACGCT GGACGCGGCC
GGCCTGCGCG TCACCGCGGC CGACATCGAT GCCGCGGTGC AGGCCTGCGA CCCTGATACC
GTCGACGCGC TGAAATTCGC CCGCGACCGC ATCGAGTTCT TCCACCGCCG GCAATTGCCC
AAGGACGACC GCTTCACCGA TGCGCTGGGC GTCGAACTCG GCTGGCGCTG GAGCTCGATC
GAGGCGGTGG GTCTGTACGT GCCCGGCGGC ACCGCGGCCT ACCCGTCCTC GGTGCTGATG
AACGCGATCC CGGCCAAAGT CGCCGGCGTC GACCGCGTCG TGATGGTGGT GCCGTCGCCG
GACGGCAAGC TCAACCCGCT GGTGCTGGCC GCCGCTTCGC TCGGCGGCGT CACCGAGATC
TACCGCGTCG GCGGCGCCCA GGCGGTGGCG GCGCTGGCTT ACGGCACCGC GACGATCGCG
CCGGTGTCGA AGATCGTCGG CCCCGGCAAC GCCTATGTGG CGGCCGCCAA GCGGCAGGTG
TTCGGCCGCG TCGGCATCGA CATGATCGCC GGCCCGTCCG AGGTGGTGGT GGTCGCCGAC
AGCACCGGCA ATCCGGACTG GATCGCCGCG GACCTGCTGG CGCAGGCCGA GCACGACGCC
AATGCGCAGT CGATCCTGAT CACCGACGAC GCCGAACTCG CCGACGCCGT CGAACGCGCC
GTAACGGCGC AACTCACGAC GCTGCCGCGC GCCGAGATCG CCCGCGCCTC GTGGGAGGCC
TATGGCGCCA TCATCAAGGT CGCGCGTCTC GACGATGCGG TGGCGCTGGC GGACGCGATC
GCGGCCGAGC ATCTCGAGAT CATCACCGCC GATCCGGAAG CCTTCGCCGC GAAGATCCGC
AATGCGGGCG CGATCTTCCT GGGCGCGCAT ACGCCGGAGG CGATCGGTGA CTATGTCGGC
GGCTCCAACC ACGTGCTGCC GACGGCGCGC TCGGCGCGGT TCTCGTCTGG GCTCGGCGTG
CTCGACTTCA TGAAGCGCAC CTCGATCCTG AAATGCGGCC CCGAGCAGCT CGCCGCGCTG
GGCCCGGCTG CGATGACGCT CGGCCATGCC GAGGGTCTGG AGGCTCACGC GCGCTCGGTG
GGACTGCGTC TGAATCGGCG ATGA
 
Protein sequence
MSIRGLVGLC DSGSFQMPVR LDAASTDFPQ QFKAFLAMKR EVAADIEAAT RAIVDDVAAR 
GDAALLDATA KFDRLTLDAA GLRVTAADID AAVQACDPDT VDALKFARDR IEFFHRRQLP
KDDRFTDALG VELGWRWSSI EAVGLYVPGG TAAYPSSVLM NAIPAKVAGV DRVVMVVPSP
DGKLNPLVLA AASLGGVTEI YRVGGAQAVA ALAYGTATIA PVSKIVGPGN AYVAAAKRQV
FGRVGIDMIA GPSEVVVVAD STGNPDWIAA DLLAQAEHDA NAQSILITDD AELADAVERA
VTAQLTTLPR AEIARASWEA YGAIIKVARL DDAVALADAI AAEHLEIITA DPEAFAAKIR
NAGAIFLGAH TPEAIGDYVG GSNHVLPTAR SARFSSGLGV LDFMKRTSIL KCGPEQLAAL
GPAAMTLGHA EGLEAHARSV GLRLNRR