Gene RPB_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4226 
Symbol 
ID3912034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4802195 
End bp4803190 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content67% 
IMG OID637886129 
ProductD-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding 
Protein accessionYP_487828 
Protein GI86751332 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.800657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.563491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTCG CCATTCTCGA CGATTACTTC GACACGCTGC GCACGCTGAA CTGCTTCGGC 
CGGCTGCAAG GCCACGACGT CACGGTGTTC AACGATCACG TCCAGGACAC CGATGCGCTG
GCTTCGCGGC TGCGCGACAC CGAGGCGCTG GTGCTGATCC GCGAGCGGAC GCAGATTCGT
GCCGCCCTGC TGGAGAAGCT GCCGCGGTTG AAGCTGATCA GCCAGCGCGG CGTCTATCCC
CACATCGATG TCGACGCCTG CACGCGGCTC GGAATCGTCG TGTCGTCGAA CATGAGCGCC
GGCGCGCCGT CCTATGCGGC GGCGGAATTG ACCTGGGGCC TGGTGCTGGG GGCGATGCGG
CAGATCCCGC AGCAGATGGC GGCGCTGAAG GCCGGCGTCT GGCAGATCGG CGTCGGTCAC
ACGCTGCGTG ACAAGACGCT CGGCATCTAC GGCTACGGCC GGATCGGCCG CGTCGTGGCG
GGCTACGGCC GCGCCTTCGG CATGACCGTG CTGGTCTGGG CGCGCGAGCC CAATCTCGCC
GAGGCGCGCG CCGACGGTTA TCAGATCGCC GGCAGCAAGG AAGACTTGTT TGCCCACAGT
GACGTGCTGT CGCTGCACAT GCGCTTGATC GACGCCACCC GCGGCATCGT CACGCGCGCG
GATCTGGCGC GGATGAAGCC GACGGCGCTG CTGGTCAACA CCAGCCGCGC CGGACTGATC
GAGCAGGGGG CCCTCGTCGC GGCGCTCCGC GCCGGGCGTC CCGGCATGGC GGCGATCGAT
GTGTTCGACA CCGAGCCGCT GCGCGATCCG CAGGATCCGC TACTGGCGAT GGACAACGTC
GTTGCCACGC CGCATATCGG CTACGTGTCG CGTGACGAAT ACGAGCTGCA ATTCGGCGAT
ATCTTCGAGC AGATCGTCGC CTATGCGGCG GGCGAGCCGA TCAATGTGGT CAACCCCGCA
TCACTGTCCT CGTCGCGGTC CTCGTCGCGG CGCTGA
 
Protein sequence
MKVAILDDYF DTLRTLNCFG RLQGHDVTVF NDHVQDTDAL ASRLRDTEAL VLIRERTQIR 
AALLEKLPRL KLISQRGVYP HIDVDACTRL GIVVSSNMSA GAPSYAAAEL TWGLVLGAMR
QIPQQMAALK AGVWQIGVGH TLRDKTLGIY GYGRIGRVVA GYGRAFGMTV LVWAREPNLA
EARADGYQIA GSKEDLFAHS DVLSLHMRLI DATRGIVTRA DLARMKPTAL LVNTSRAGLI
EQGALVAALR AGRPGMAAID VFDTEPLRDP QDPLLAMDNV VATPHIGYVS RDEYELQFGD
IFEQIVAYAA GEPINVVNPA SLSSSRSSSR R