Gene RPD_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2047 
Symbol 
ID4022529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2294807 
End bp2296000 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content64% 
IMG OID637962240 
Product3-hydroxyisobutyrate dehydrogenase 
Protein accessionYP_569183 
Protein GI91976524 
COG category[I] Lipid transport and metabolism 
COG ID[COG2084] 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.996651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.228122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCT CCTCCGCCGA TCCGCGCCAG ACCGAACATA TGGGGTGGTT GGGTTCAAGG 
ACAGAATTCA ACTCTCACCC GCCCTCGCCA TCGTTACGGC TGGTCTTACA GCAGCGGATG
ATCTCGGCGA AACGGTCTTT TGCACTGCAA GAGATACGGC CCCAGCAATT CGATCGAAGA
CAATTGCCCC CGTCAGCCCC TTGCCCGTCG AGCCTCGCGG CCTATGCTGA CTCACCTGCC
GCGGGCCTGC ATCAGGCTAC GCTGACTGCC TCAACTCGAT TCGGCCGGCA TTCGATCGGG
CCGCGCAATC TGAAGGGTAC AGACATGGCC AAAGTCGCTT TCCTCGGTCT CGGCGTGATG
GGTTTTCCGA TGGCCGGACA TCTCGTCAAA AAGGGAGGGC ACGACGTCAC CGTCTATAAT
CGTACCGCCG CAAAAGCAAA GAGCTGGGCC GATCAGTTTG GAGGCCGCAC GGCGGCGACG
CCGGCTGAAG CGGCCAAGGA TCAGGACTTC GTGATGGCCT GCGTCGGCAA CGACCACGAC
TTGCGGGCAG TGACCACAGG CGACGACGGC GCGTTCGCGG CGATGAAATC CGGCGCGATC
TTCGTCGATC ACACCACCGC GTCCGCCGAG GTCGCGCGCG AGCTGGATGC GGCCGCGACC
AAGGCCGGCT TCGCCTTCAT TGATGCGCCG GTGTCGGGCG GCCAGGCCGG CGCCGAGAAC
GGCGTCCTGA CGGTGATGTG CGGCGGCAGC GACGGGGCCT ATGCCAAGGC CGAGCCGGTG
ATCGCGTCCT ATGCGCGGAT GTGCAAGCTG CTCGGACCGG CCGGCTCCGG CCAGCTCACC
AAGATGGTCA ATCAGATCTG CATCGCCGGG CTGGTCCAGG GGCTGTCGGA AGGCATCCAC
TTCGCCAAGA AGGCGGGCCT CGACGTCAAC GCCGTGATCG ACACCATCTC CAAGGGCGCC
GCGCAGTCCT GGCAGATGGA GAACCGGCAC AAGACGATGA ACGACGGCAA ATACGATTTC
GGCTTCGCGG TCGAATGGAT GCGCAAGGAC CTGTCGATCT GCCTGGCCGA GTCCCGCCGC
AACGGCGCCA GCCTGCCGGT GACCGCGCTG GTGGATGCCT TCTACGCCGA AGTCGAAAAG
ATCGGCGGAC GCCGTTGGGA CACCTCCAGC CTGCTGGCAC GGCTCGAACG CTGA
 
Protein sequence
MDFSSADPRQ TEHMGWLGSR TEFNSHPPSP SLRLVLQQRM ISAKRSFALQ EIRPQQFDRR 
QLPPSAPCPS SLAAYADSPA AGLHQATLTA STRFGRHSIG PRNLKGTDMA KVAFLGLGVM
GFPMAGHLVK KGGHDVTVYN RTAAKAKSWA DQFGGRTAAT PAEAAKDQDF VMACVGNDHD
LRAVTTGDDG AFAAMKSGAI FVDHTTASAE VARELDAAAT KAGFAFIDAP VSGGQAGAEN
GVLTVMCGGS DGAYAKAEPV IASYARMCKL LGPAGSGQLT KMVNQICIAG LVQGLSEGIH
FAKKAGLDVN AVIDTISKGA AQSWQMENRH KTMNDGKYDF GFAVEWMRKD LSICLAESRR
NGASLPVTAL VDAFYAEVEK IGGRRWDTSS LLARLER