Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4061 |
Symbol | |
ID | 3911868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4632059 |
End bp | 4633186 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885965 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_487665 |
Protein GI | 86751169 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC AGGATTTCAC CATCCGCAGC ATCGAGGCGA TGTGTTTCCG CTATCCGTTG TCGCTGCCGG TTGTGACCTC ATTCGGGCGG ATGACGACCC GCCCCGCGGT TTTCATCCGT GTCACGGATG AAGACGGCAT CAGCGGCTGG GGCGAGGCGT GGGCGAACTT TCCCGCCACC GGAGCCGAGC ATCGCGCCCG AACCATCAAC GAGGTGCTGG CGCCGCTCGC GGCGGGCGCC CGCATCGCGC ATCCGTCCGA ATTCTTCGAG GCGCTGACGC AGCGCACGGC TGTGCTGGCG CTGCAATCCG GCGAGCAAGG GCCGTTCGCG CAGGCGATCG CCGGCATCGA TCTGGCGCTG TGGGATCTGT TCGCGCGACG CCGCGCCACG CCGCTGTGGC GGCTGCTCGG CGGCGCGAAT GCCACGATCA AGGTCTATGC CAGCGGCATC AATCCCACCG GCACGCGGGA GATGGCCGAA ACCGCGCTGG CGCGCGGCCA TCGCGCGCTC AAGCTGAAGA TCGGCTTCGG CGCCGAGATC GATCACCCCA ATCTCGCCGC GCTGCGCGCG CTGGCCGGCG ACGGCACGCT GGCCGCCGAC GCCAATCAGG CCTGGACGCT GCAGCAGGCA TGCGAGGCGG CACCGCATTT GCGCGACTAC AATCTCGCCT GGCTGGAGGA GCCGATCCGC GCCGATCGTC CGTGGCCGGA ATGGCAGGCG CTGCGCCGCG CCGCCACGAT GCCGCTCGCC GCCGGCGAGA ATTTCGCGAG CCGCGAGAGC TTTCAGCAAG CGCTGTCCGA CGACACGCTC GGCGTCATCC AGCCCGATAT CGCCAAATGG GGCGGGCTGT CGGCCTGCGC GCCGATCGCC CGCGACATCG TGGCCGCCGG CAAGCGGTTC TGCCCGCATT ATCTCGGCGG CGGCATCGGT CTGCTGGCAT CGGCGCATCT GCTCGCCGGC ATCGGCGGTG ACGGCCTGCT GGAGGTCGAC GCCAACGACA ATCCGCTGCG CGAAGCGTTC TGTGGCCCCG TCGCCGCGAT CAGCGACGGC GCCATCACGC TCGGCGATGC GCCCGGCCTC GGAGTCGAGC CGGACCTCGT CGGCATCGCG CAGTATCGAA CCGTATAG
|
Protein sequence | MTAQDFTIRS IEAMCFRYPL SLPVVTSFGR MTTRPAVFIR VTDEDGISGW GEAWANFPAT GAEHRARTIN EVLAPLAAGA RIAHPSEFFE ALTQRTAVLA LQSGEQGPFA QAIAGIDLAL WDLFARRRAT PLWRLLGGAN ATIKVYASGI NPTGTREMAE TALARGHRAL KLKIGFGAEI DHPNLAALRA LAGDGTLAAD ANQAWTLQQA CEAAPHLRDY NLAWLEEPIR ADRPWPEWQA LRRAATMPLA AGENFASRES FQQALSDDTL GVIQPDIAKW GGLSACAPIA RDIVAAGKRF CPHYLGGGIG LLASAHLLAG IGGDGLLEVD ANDNPLREAF CGPVAAISDG AITLGDAPGL GVEPDLVGIA QYRTV
|
| |