Gene RPB_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3134 
Symbol 
ID3910935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3581229 
End bp3582812 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID637885036 
Producthypothetical protein 
Protein accessionYP_486741 
Protein GI86750245 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.673748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.335659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCG CGGCACCGGC GGCGTCCGCC ATGCAATCGA TGACGGGAGC AGACCGGCTT 
TGCGATGCGC TACTGGCGCA CGAGGTCGAT GTCTGCTTCG CCAATCCTGG CACGTCGGAG
ATGCATTTCG TGGCGGCGCT CGATCGCCAG CCGCGGATGC GGTGCATCCT GGGTCTGTTC
GAGGGCGTCG TCACCGGCGC TGCGGACGGT TACGCCCGGA TGCGCGACCG GCCGGCCGCG
ACGCTGCTGC ATCTCGGCCC GGGCCTCGCC AACGGCCTAG CCAACATCCA CAACGCCCGC
CGCGCGCATT CGCCGATGAT CAACGTCGTC GGCGATCACG CGGTCGAGCA TCTGCCGCTG
GACTCGCCGC TGACCAGCGA CATCGACTCG CTGGCGCGCC CGATGTCGCG CTGGGTCAAA
CGGATCGGCG ATCCGACGCG GATCGATCTC GACGTGGCGG AGGCCTGGTC GCAGAGCATG
GCGCCCGGGA TTTCCACCCT GATCCTGCCG GCGGACATGG CCTGGAACGA GCATGTCTAT
CGCGCGCCGC CGGCGCGGGG CAGGCGGCTG CCGCAAGCGC CGCTTCAGCA GGAGACGGTG
GCGCGGATCG GCGCCGCGAT CCGCACCAAG CGGCGTGTCG TGCTGCTGCT GACCGGCGCC
GCCTTGCGTG AGCAGGCACT GTCGATTGCG GACGGCATCC GCCGCAGCGC CGGTGTCGAG
ATCTACGCGC AGGGCGCCAA CGGCCGGATC GAACGTGGCC GCGGCCGCGC GCCGATCGCG
AAGCTGGCGA TGACCCACGA GATCGGCGGC CGGATGCTGC AGGACGTCGA CATGGTGGTG
CTGATCGGCG CCAAGGAGCC GGTGTCGTTC TTCGCCTATC CGAACAGGCC GGGAAGGCTG
GCGCCGGCGA ATTGCGAGAT CGTCCGGCTG GCGGGGCCGG ATCAGGATTT GGTACAGGCG
CTCGAATGGC TGGCCGACGA GGTCGATGCC CCGCGCGCGC CGGTCGCGGT CGCGCCGCGC
GCCGCTGCGG CGCCGGTCGC CAGCGGGGCG TTGACCGTCG ATATGGTGAA CCGCCTCGTC
GCCGCGCGGC TGCCCGAGCA AGCGATCGTC TGCGACGAGG CACTGACCTC GTCGGGCTTC
TTCGACCTGT CTTACGACGC CCCGCCGCAC GACTATCTGC AGATCACCGG TGGCGCGATC
GGGATCGGCA TTCCGATGGC CGCGGGCGCC GCGGTCGCGT GTCCCGACCG CAAGGTGATC
AACCTGCAGG CCGACGGCAG CGGCATGTAC ACGGTGCAGG GTCTGTGGAC CCACGCGCGC
GAGAATCTCG ACGTTCTCAC CATCGTATTT TCGAACCGCT CCTACGCCAC GCTGTGGGGC
GAGATGAGCA AGGTCGGTGC GCAGACGCCG GGACGCAATG CGCAGCGCAT GCTGCAGCTC
GATCAGCCCG AGCTCGACTG GACCAAGCTG GCTGCGGGGC TCGGCGTCGA ATCCCGCCGA
GTCACGACGG CGTCGGAGTT CTCGCGCGCG TTCGATGCCG CGCTCGGCCG GCGTGGTCCT
GCCGTGATCG AGGCAATGGT GTGA
 
Protein sequence
MAVAAPAASA MQSMTGADRL CDALLAHEVD VCFANPGTSE MHFVAALDRQ PRMRCILGLF 
EGVVTGAADG YARMRDRPAA TLLHLGPGLA NGLANIHNAR RAHSPMINVV GDHAVEHLPL
DSPLTSDIDS LARPMSRWVK RIGDPTRIDL DVAEAWSQSM APGISTLILP ADMAWNEHVY
RAPPARGRRL PQAPLQQETV ARIGAAIRTK RRVVLLLTGA ALREQALSIA DGIRRSAGVE
IYAQGANGRI ERGRGRAPIA KLAMTHEIGG RMLQDVDMVV LIGAKEPVSF FAYPNRPGRL
APANCEIVRL AGPDQDLVQA LEWLADEVDA PRAPVAVAPR AAAAPVASGA LTVDMVNRLV
AARLPEQAIV CDEALTSSGF FDLSYDAPPH DYLQITGGAI GIGIPMAAGA AVACPDRKVI
NLQADGSGMY TVQGLWTHAR ENLDVLTIVF SNRSYATLWG EMSKVGAQTP GRNAQRMLQL
DQPELDWTKL AAGLGVESRR VTTASEFSRA FDAALGRRGP AVIEAMV