Gene RPB_3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3567 
Symbol 
ID3911369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4088395 
End bp4089675 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content72% 
IMG OID637885469 
Producthydroxypyruvate reductase 
Protein accessionYP_487173 
Protein GI86750677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2379] Putative glycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.187616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.669404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AACGTCCCCT GCTCCGCGCG CTGTTCGACG CCGCCGTCGC GGCCGCGCAT 
CCGGACAGCA TTCTCGCCGC GCATCTGCCG CCGCTGCCGC GCGGCCGGAT CATCTGCCTC
GCCGCCGGCA AGGGTGCCGC CGCGATGGCC GCCGCCGCGG AGCGGCATTA TCTCGACACG
CTCGGGCTCG CGCCCTCACG TCTGATCGGC ATCGCCACCA CCCGCCACGG CCATCGCGTG
GCGACCCGCG CCATCGACGT GATCGAGGCC GGGCACCCGA TGCCCGACGC CGAAGGGCTG
CGCGGTGCCG AAGCGAGCCT GAAGCTCGCC GCCACCGCGA CCGCCGACGA TCTGCTGCTG
GTGCTGCTGT CGGGCGGCGG CTCGGCGAAC TGGATCCTGC CGGCCGACGG CATCACGCTC
GCGCAAAAGC AGGCCACCAC GCGCGCGCTG CTGCGCTCCG GCGCGCCGAT CGGCGAGGTC
AACACCGTCC GCAAGCATCT GTCGCGGATC AAGGGCGGCC GCCTCGCTTG CGCCGGCAGC
AGCGCCGCCG AAATCGTGAC GCTGGCGATT TCCGACGTGC CGCGCGACGA GGCATCGGCG
ATAGCGTCCG GGCCGACCGT GCCCGATCCG ACGACGCTGG ACGACGCCCG CGCACTGGTG
GCGCGCTACA AGCTCGACAT CGACGACGCA GTCCATGCCG CGCTGAATGA TCCACGCAAC
GAAAGCTGCA AGCCGGGCGA CGCCGCTTTC GCCCGCGCCC GCTTCGCCAT CATCGCGCGG
CCGCGGCAAT CGCTGGACGC CGCGATCAAG CTGGCGCGCG ATTCCGGCTA TGCGATCGCC
GATCTCGGCG CCGATCTCGA AGGCGAAGCC CGCGACGTGG CTGCCGCCCA CGCCCGGCTC
GCGCGCGAGG CCCGTGCGGC CGGCAGGCGG CTCGCGATCA TCTCCGGCGG CGAACTCACC
GTCACCGTGC GCGGCAACGG CCGCGGCGGC CCCAACCAGG AATATGCGCT GGCGCTGGCG
CAGCACCTGC GCGACCTGCC GGACATCGCA GCCCTCGCCG CCGACACCGA CGGCGCCGAC
GGCGGCGCCG GCCACGCCAC CGACCCCGCC GGCGCGCTGA TCGACGCCCG CACCTTCGCG
AAGATCGACG AGCGCGATCT CGACCCTACC GCCTATCTGG CGAACAACGA CGCTACCGGC
TTCTTCGACC AGACCGGCGA CCTGCTCGTC ACCGGCCCGA CGCTGACCAA CGTCAACGAT
ATCCGGGTGA TCCTGGTGTA G
 
Protein sequence
MTDQRPLLRA LFDAAVAAAH PDSILAAHLP PLPRGRIICL AAGKGAAAMA AAAERHYLDT 
LGLAPSRLIG IATTRHGHRV ATRAIDVIEA GHPMPDAEGL RGAEASLKLA ATATADDLLL
VLLSGGGSAN WILPADGITL AQKQATTRAL LRSGAPIGEV NTVRKHLSRI KGGRLACAGS
SAAEIVTLAI SDVPRDEASA IASGPTVPDP TTLDDARALV ARYKLDIDDA VHAALNDPRN
ESCKPGDAAF ARARFAIIAR PRQSLDAAIK LARDSGYAIA DLGADLEGEA RDVAAAHARL
AREARAAGRR LAIISGGELT VTVRGNGRGG PNQEYALALA QHLRDLPDIA ALAADTDGAD
GGAGHATDPA GALIDARTFA KIDERDLDPT AYLANNDATG FFDQTGDLLV TGPTLTNVND
IRVILV