Gene RPB_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4061 
Symbol 
ID3911868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4632059 
End bp4633186 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID637885965 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_487665 
Protein GI86751169 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC AGGATTTCAC CATCCGCAGC ATCGAGGCGA TGTGTTTCCG CTATCCGTTG 
TCGCTGCCGG TTGTGACCTC ATTCGGGCGG ATGACGACCC GCCCCGCGGT TTTCATCCGT
GTCACGGATG AAGACGGCAT CAGCGGCTGG GGCGAGGCGT GGGCGAACTT TCCCGCCACC
GGAGCCGAGC ATCGCGCCCG AACCATCAAC GAGGTGCTGG CGCCGCTCGC GGCGGGCGCC
CGCATCGCGC ATCCGTCCGA ATTCTTCGAG GCGCTGACGC AGCGCACGGC TGTGCTGGCG
CTGCAATCCG GCGAGCAAGG GCCGTTCGCG CAGGCGATCG CCGGCATCGA TCTGGCGCTG
TGGGATCTGT TCGCGCGACG CCGCGCCACG CCGCTGTGGC GGCTGCTCGG CGGCGCGAAT
GCCACGATCA AGGTCTATGC CAGCGGCATC AATCCCACCG GCACGCGGGA GATGGCCGAA
ACCGCGCTGG CGCGCGGCCA TCGCGCGCTC AAGCTGAAGA TCGGCTTCGG CGCCGAGATC
GATCACCCCA ATCTCGCCGC GCTGCGCGCG CTGGCCGGCG ACGGCACGCT GGCCGCCGAC
GCCAATCAGG CCTGGACGCT GCAGCAGGCA TGCGAGGCGG CACCGCATTT GCGCGACTAC
AATCTCGCCT GGCTGGAGGA GCCGATCCGC GCCGATCGTC CGTGGCCGGA ATGGCAGGCG
CTGCGCCGCG CCGCCACGAT GCCGCTCGCC GCCGGCGAGA ATTTCGCGAG CCGCGAGAGC
TTTCAGCAAG CGCTGTCCGA CGACACGCTC GGCGTCATCC AGCCCGATAT CGCCAAATGG
GGCGGGCTGT CGGCCTGCGC GCCGATCGCC CGCGACATCG TGGCCGCCGG CAAGCGGTTC
TGCCCGCATT ATCTCGGCGG CGGCATCGGT CTGCTGGCAT CGGCGCATCT GCTCGCCGGC
ATCGGCGGTG ACGGCCTGCT GGAGGTCGAC GCCAACGACA ATCCGCTGCG CGAAGCGTTC
TGTGGCCCCG TCGCCGCGAT CAGCGACGGC GCCATCACGC TCGGCGATGC GCCCGGCCTC
GGAGTCGAGC CGGACCTCGT CGGCATCGCG CAGTATCGAA CCGTATAG
 
Protein sequence
MTAQDFTIRS IEAMCFRYPL SLPVVTSFGR MTTRPAVFIR VTDEDGISGW GEAWANFPAT 
GAEHRARTIN EVLAPLAAGA RIAHPSEFFE ALTQRTAVLA LQSGEQGPFA QAIAGIDLAL
WDLFARRRAT PLWRLLGGAN ATIKVYASGI NPTGTREMAE TALARGHRAL KLKIGFGAEI
DHPNLAALRA LAGDGTLAAD ANQAWTLQQA CEAAPHLRDY NLAWLEEPIR ADRPWPEWQA
LRRAATMPLA AGENFASRES FQQALSDDTL GVIQPDIAKW GGLSACAPIA RDIVAAGKRF
CPHYLGGGIG LLASAHLLAG IGGDGLLEVD ANDNPLREAF CGPVAAISDG AITLGDAPGL
GVEPDLVGIA QYRTV