Gene RPC_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1451 
Symbol 
ID3973429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1577784 
End bp1578779 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content69% 
IMG OID637924566 
Productmandelate racemase/muconate lactonizing enzyme 
Protein accessionYP_531332 
Protein GI90422962 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.470419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCA GCAAATTATC GCGACTGATC GCACGGATCG AAAGATGGCC GATTGCGGGC 
GCTTTCACGA TTAGTCGCGG TTCGAAGACC GAGGCGGTGG TGGTGGTGGC GGAGATCAGC
CGTGGCGGCT ACGTCGGGCG GGGTGAATGC GTACCTTACG CGCGCTATGG CGAGACTCCA
GATGCGACAT TGGCGATGAT TGAGGCGTTA GCAGGGCCGC TTGGCCGCGG AATGGATCGC
CAGGCGCTGC AGGCTGCGCT GCCCGCCGGC GCCGCCCGCA ACGCGCTGGA CTGCGCGCTG
CTCGATCTCG AGGCCAAGAG CAGCGGCGGC CGGGTCTGGG ATCTGCTCGG CCGTGCCGCG
CCGCGCCCCT GCACCACCGC CTATACGATT TCGCTGGGAA CGCCCGAGGC GATGGCCGCC
GCGGCCGCCA AGGCTGCCGG GCGGCCGCTG TTGAAGGTCA AGCTCGGCGG CACCGAGGAC
GGAGCCAGGA TCGCGGCGGT GCGCCGGGCG GCGCCGGAAT CCGAATTGAT CGTCGATGCC
AACGAGGCCT GGACCGCGGA CAACCTCGAA CAGAATCTGG CCGAATGCGC CGAGGTCGGC
GTCACCCTGG TGGAGCAGCC GCTGCCCGCC GACAACGACG CGGCGCTGGC CCGGATCCGC
CGGCCGATGG CGGTCTGCGC CGACGAGAGC GTGCATGATC TGGCGTCGCT CGAGGGTTTG
CGCGAGCGCT ATGACGCCAT CAACATCAAG CTCGACAAGG CCGGCGGATT GACCGAGGCG
ATAGCGATGG CCGACGCGGC GCGGGCGCAG GGCTTGGAGA TCATGGTCGG CTGCATGGTG
GCGACCTCGC TTGCGATGGC GCCGGCGATG CTGCTGGCGC AGCAGGCCCG CTTCGTCGAC
CTCGACGGCC CACTGCTGCT GGCCGGCGAC CGTGACGACG GGCTGCGCTA CGACGGCAGC
ACGGTCTATC CGCCGGACCC GGAGCTTTGG GGCTGA
 
Protein sequence
MTSSKLSRLI ARIERWPIAG AFTISRGSKT EAVVVVAEIS RGGYVGRGEC VPYARYGETP 
DATLAMIEAL AGPLGRGMDR QALQAALPAG AARNALDCAL LDLEAKSSGG RVWDLLGRAA
PRPCTTAYTI SLGTPEAMAA AAAKAAGRPL LKVKLGGTED GARIAAVRRA APESELIVDA
NEAWTADNLE QNLAECAEVG VTLVEQPLPA DNDAALARIR RPMAVCADES VHDLASLEGL
RERYDAINIK LDKAGGLTEA IAMADAARAQ GLEIMVGCMV ATSLAMAPAM LLAQQARFVD
LDGPLLLAGD RDDGLRYDGS TVYPPDPELW G