Gene RPD_4328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4328 
Symbol 
ID4024852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4794723 
End bp4796261 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content68% 
IMG OID637964537 
Productcarboxylyase-like protein 
Protein accessionYP_571446 
Protein GI91978787 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.568607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACC GCGTTAAACC GCCTTTTCCG GACCTTCGCG CGTTCGCCGC CTATCTGGAA 
TCCCGCGGAC AGTTGCACCG CATCCGCAAG CCGGTCTCGG TGGTGCACGA CCTCACCGAG
ATTCATCGCC GCGTGCTGCA CGCCGGCGGC CCGGCGCTGA TGATCGAACA GCCGATCAAG
GCCGATGGCA CGCCGTCCGA GATGCCGATG CTGGTCAATC TGTTCGGCAC CGTCGAGCGG
GTGGCGTGGG GCCTCGGCAT CCTGCCCGAG AATCTGCCGC AGCTCGGCGA GGCACTCGCC
GAAATGCGCG AGCCGGCGCC GCCGCAGAGC CTGACCGACG CGCTGAGCAA GCTGCCGATG
GCGAGGGCCG CCTTGTCGAT GCGCCCGAAG ATGGCCCGGA CGGCGCCCGC GCAGGACGTG
GTGCTGACCG GCGACGCGGT CGATCTCGGC CGGCTGCCGG TGCAGATCCC GTGGCCGGGC
GAGCCGGCGC CGCTGATCAC CTGGCCGCTG GTGTTCACCA AGCCGCCGCC CGGCGTGGCC
GGCACCGACA ATGTCGGCGT TTACCGGATG CAGGTGCTGG GCAAGGATCG CGTCATCATG
CGCTGGCTGG CGCATCGCGG CGGCGCCAAG CATCACCATC AATGGGCGGC GCAGAAGCGC
GAGATGCCGG TCGCGGTGGT GATCGGCGCC GATCCGTCGA TGATCCTGTC GGCGGTGCTG
CCGCTGCCGG AGACGGTGTC CGAAATCAAG TTCTCGGGCC TGCTGCGCGG CGACCGGCCG
AGCCTGACGC CCTGCGTCAG CATTCCGCTC AACGTGCCGG CCGATGCCGA GATCGTGCTG
GAGGGCCTGG TGTCGCCGAC CGAGACCGCG CCGGAGGGGC CGTATGGCGA CCACACCGGC
TATTACAACG CGGTCGAGGA ATTCCCGGTG ATGCGGATCA CCGCGATCAC GATGCGGCGC
AACCCGATCT ATCTCTCGAC CTACACCGGA CGGCCGCCGG ACGAGCCGTC GCGGCTCGGC
GAAGCGCTCA ACGACGTGTT TCTGCCGGTG GCGCGGCGGC AGTTTCCCGA GATCGTCGAT
CTGTGGCTGC CGCCGGAAGC GTGCTCGTAC CGAATCGCGG TCGCCTCGAT CAAGAAACGT
TATCCCGGCC AGGCGCGGCG GCTGATGATG GGGCTGTGGT CGATGCTGCC GCAGTTCAGC
TACACCAAGC TCCTGATCAT CGTCGACGAC GACGTCGACG TCCGCGAATG GGCCGATGTG
ATGTGGGCGG TGTCGACCCG CTGCGACACC TCGCGCGACA TGGTGGCGAT CAGCGACACC
CCGATCGACT ATCTCGACTT CGCCTCGCCG AAATCCGGAC TCGGGGGCAA ACTCGGCATC
GACGCCACCA ACAAGATCGG CACCGAGACC GAGCGCGAAT GGGGCAAGGT GCTGGAGATG
GACGACGATG TGATCGCGCG CGTCGACGCG ATGTGGTCCG GCCTCGGGCT CGCGCCCGCG
TTGACGCCGG CCGCGGCGCA ACGCAAGCTG CTGCGATGA
 
Protein sequence
MLNRVKPPFP DLRAFAAYLE SRGQLHRIRK PVSVVHDLTE IHRRVLHAGG PALMIEQPIK 
ADGTPSEMPM LVNLFGTVER VAWGLGILPE NLPQLGEALA EMREPAPPQS LTDALSKLPM
ARAALSMRPK MARTAPAQDV VLTGDAVDLG RLPVQIPWPG EPAPLITWPL VFTKPPPGVA
GTDNVGVYRM QVLGKDRVIM RWLAHRGGAK HHHQWAAQKR EMPVAVVIGA DPSMILSAVL
PLPETVSEIK FSGLLRGDRP SLTPCVSIPL NVPADAEIVL EGLVSPTETA PEGPYGDHTG
YYNAVEEFPV MRITAITMRR NPIYLSTYTG RPPDEPSRLG EALNDVFLPV ARRQFPEIVD
LWLPPEACSY RIAVASIKKR YPGQARRLMM GLWSMLPQFS YTKLLIIVDD DVDVREWADV
MWAVSTRCDT SRDMVAISDT PIDYLDFASP KSGLGGKLGI DATNKIGTET EREWGKVLEM
DDDVIARVDA MWSGLGLAPA LTPAAAQRKL LR