Gene RPB_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2026 
Symbol 
ID3909840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2303502 
End bp2304929 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content67% 
IMG OID637883920 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_485645 
Protein GI86749149 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0174906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCTGA CCATCACCAA CACGCCGAAG CGCGCCGAGC CGCAGGCCGT GGCCAGCGCC 
ATTGAAGCGC TCGCGGCGCG GTTCGGCAAC CGTCTCGTCA CGTCGCTCGC GGTTCGCGAA
CAGCACGGCC ACACCACCAC CTGGCTGCCG AACCAGCCGC CGGACGCGGT GGTGATGGCG
CAGGAGACGG CGGACATCCA GGACGTGGTG CGCATTTGTG CCAAATACCG CGTGCCGGTG
ATCGCGTTCG GCACCGGCAC CTCGCTGGAG GGCCACGTCA ACGCCCCGGC TGGCGGCATT
TCGATCGACC TGCGCGACAT GAACAAGATC CTCAGCGTTC ATGCCGAGGA TCTCGACTGC
GTGATCCAGC CCGGCGTCAC CCGCAAGGCT CTGAACGAGG ACCTTCGCGA CCAGGGCCTG
TTCTTCCCGA TCGATCCGGG CGCCGACGCC TCGATCGGCG GCATGGCGGC GACGCGCGCC
TCCGGCACCA ATGCGGTCCG CTACGGCACC ATGCGCGACA ACGTGCTGGC GCTGAAAGTC
GTTCGCGGCG ACGGCGAGAT CATCACCACC GGCACCCGCG CCAAAAAGTC CGCCGCCGGC
TACGACCTGA CGCATCTGTT CGTCGGCAGC GAGGGCACGC TCGGCATCAT TTCGGAACTG
ACCATCAAGC TGCGCGGCAT CCCCGAGGTG ATCGCGGCGG CCTCGTGCTC GTTTTCGTCG
GTCACCGACG CCTGCCAGGC GGTGATCCTG GCGATCCAGA CCGGCATCCC GCTGGCGCGG
ATCGAGCTGC TCAGCGAGTC CCAGGTCAGG GCCGTCAACG CCTATTCCAA GCTGACGCTG
CCGGAGACGC CGCTGCTGCT GCTGGAATTC CACGGCAGCG AGGTCGAGGT CGGCGAGCAG
TCGAAGAATT TCGGCGCGAT CGCCAAGGAT TGCGGCGGCG GCGACTTCAC CTGGACGACG
CGGCCCGAAG ACCGCAACAA GCTCTGGCAG GCCCGGCACG ACGCCTATTG GTCGGTGCGG
GCGCTGCGGC CCGGCGACGG CGTCGGCGTG GTCGCCACCG ATGTCTGCGT GCCGATCTCC
CGGCTGGCCG ATTGCGTCGC CGAGACTGAG CAGGACATGG CGCGGCTCGG CCTGCTGGCG
CCGATCGTCG GCCATGTCGG CGACGGCAAT TTCCATTGCT CGCTGCTATG CGACGTCAAC
GACGCCGACG AGATGGCGCG CGCCGATGAG TTCATGCACC GTCTGGTCGA GCGGGCGCAG
GCAATGGACG GCACCTGCAC CGGCGAACAC GGCATCGGTC AGGGCAAGCA GAAATATCTT
CAAGCCGAAC TCGGCATCGA GGCGCTGCAG GCGATGCGCG CGATCAAGCA GGCGCTTGAC
CCGCAAAACA TCTTCAATCC CGGCAAGATC CTGCCGCAAG GGCTTTGA
 
Protein sequence
MGLTITNTPK RAEPQAVASA IEALAARFGN RLVTSLAVRE QHGHTTTWLP NQPPDAVVMA 
QETADIQDVV RICAKYRVPV IAFGTGTSLE GHVNAPAGGI SIDLRDMNKI LSVHAEDLDC
VIQPGVTRKA LNEDLRDQGL FFPIDPGADA SIGGMAATRA SGTNAVRYGT MRDNVLALKV
VRGDGEIITT GTRAKKSAAG YDLTHLFVGS EGTLGIISEL TIKLRGIPEV IAAASCSFSS
VTDACQAVIL AIQTGIPLAR IELLSESQVR AVNAYSKLTL PETPLLLLEF HGSEVEVGEQ
SKNFGAIAKD CGGGDFTWTT RPEDRNKLWQ ARHDAYWSVR ALRPGDGVGV VATDVCVPIS
RLADCVAETE QDMARLGLLA PIVGHVGDGN FHCSLLCDVN DADEMARADE FMHRLVERAQ
AMDGTCTGEH GIGQGKQKYL QAELGIEALQ AMRAIKQALD PQNIFNPGKI LPQGL