Gene RPB_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4478 
Symbol 
ID3912294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5067751 
End bp5069289 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content68% 
IMG OID637886381 
Productcarboxylyase-like protein 
Protein accessionYP_488072 
Protein GI86751576 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0662178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCC GCGTCAAACC GCCCTTTCCG GACCTCCGCG CTTTCGCCGC CTATCTGGAA 
TCCCGCGGGC AGTTGCACCG GATCAAGAAG CCGGTCTCGG TGGTCCACGA ACTGACCGAA
ATCCATCGCC GCGTGCTGCA CGCCGGCGGC CCGGCGCTGC TGATCGAACA TCCGGTCAAG
GCCGACGGCA CGCCGTCGGA GATGCCGATT CTGGTCAATT TGTTCGGCAC CGTCGAGCGG
GTGGCGTGGG GGCTCGGCAT CGAACCGCAA AATCTGTCGG CGCTGGGCGA AGCGCTCGCC
GAAATGCGCG AGCCGGCGCC GCCGCAAAGC CTCACCGACG CATTGAGCAA ATTGCCGATG
GCCCGCGCAG CGCTGTCGAT GCGGCCGAAG ACCGCCAAGA CCGCACCGGC GCAGGACGTG
GTGCTGACCG GGGACGCGGT CGATCTCGGC CGGCTGCCGA TCCAGATTCC GTGGCCGGGC
GAGCCGGCGC CGCTGATCAC CTGGCCGCTG GTGTTCACCA GGCCGCCACC CGGCGCGCCC
GGCACCGACA ATGTCGGCGT CTACCGCATC CAGGTGCTGG GCAAGGATCG CATCATCATG
CGCTGGCTGG CGCATCGCGG CGGCGCCAAG CATCACCACC AATGGAAGGC CGAGGGGCGC
GAGATGCCGG TCGCGATCGT GATCGGCGCC GACCCGGCGA TGATCCTGTC GGCGGTGCTG
CCGCTGCCGG AGAATATCTC CGAGATCAAA TTCTCCGGGC TGCTGCGCGG CGACCGCCCG
AGCATGACGC CCTGCGTCGG GATTCCGCTG AACGTGCCGG CCGACGCCGA GATCGTGCTG
GAAGGCTTCG TGTCGCCGAC CGAGACCGCG CCGGAGGGCC CCTATGGCGA CCACACCGGC
TATTACAACG CGGTCGAGGA ATTCCCGGTG ATGCGGATCA CCGCGATCAC GATGCGGCGC
AGTCCGATCT ATCTGTCGAC CTATACGGGG CGTCCGCCGG ACGAGCCGTC GCGGCTCGGC
GAGGCGTTCA ACGACGTGTT CCTCCCCGTC GCGCGGCGGC AGTTTCCGGA GATCGTCGAT
CTGTGGCTGC CGCCGGAGGC GTGCTCCTAT CGAATTGCGG TCGCCTCGAT CAAGAAACGC
TATCCCGGCC AGGCGCGGCG GCTGATGATG GGGCTGTGGT CGATGCTGCC GCAGTTCAGC
TACACCAAGC TTTTGATCAT CGTCGACGAC GACGTCGACG TGCGCGACTG GGCCGACGTG
ATGTGGGCGG TGTCGACCCG CGCCGACACC TCGCGCGACA TGCTGTCGAT CAGCGATACG
CCGATCGACT ATCTCGATTT CGCCTCGCCG AAATCCGGGC TCGGCGGCAA GCTGGGCATC
GACGCCACCA ACAAGATCGG CACCGAGACC GAGCGCGAAT GGGGCAAGGT GCTGGAGATG
GACAAGGACG TGATCGCGCG GGTCGACGCA ATGTGGTCGA GCCTCGGGCT GCCCCCGGCG
CCGACGCCGG CCCTGGCGCA ACGCAGGCTG CTGCGATGA
 
Protein sequence
MLSRVKPPFP DLRAFAAYLE SRGQLHRIKK PVSVVHELTE IHRRVLHAGG PALLIEHPVK 
ADGTPSEMPI LVNLFGTVER VAWGLGIEPQ NLSALGEALA EMREPAPPQS LTDALSKLPM
ARAALSMRPK TAKTAPAQDV VLTGDAVDLG RLPIQIPWPG EPAPLITWPL VFTRPPPGAP
GTDNVGVYRI QVLGKDRIIM RWLAHRGGAK HHHQWKAEGR EMPVAIVIGA DPAMILSAVL
PLPENISEIK FSGLLRGDRP SMTPCVGIPL NVPADAEIVL EGFVSPTETA PEGPYGDHTG
YYNAVEEFPV MRITAITMRR SPIYLSTYTG RPPDEPSRLG EAFNDVFLPV ARRQFPEIVD
LWLPPEACSY RIAVASIKKR YPGQARRLMM GLWSMLPQFS YTKLLIIVDD DVDVRDWADV
MWAVSTRADT SRDMLSISDT PIDYLDFASP KSGLGGKLGI DATNKIGTET EREWGKVLEM
DKDVIARVDA MWSSLGLPPA PTPALAQRRL LR