Gene RPC_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0437 
Symbol 
ID3970199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp471210 
End bp472259 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID637923553 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_530331 
Protein GI90421961 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAGC GCCACCAAGG CGCTGCCGCG CCGCGGCCTC GATTTTTCGC GATAGGATTC 
CAGCCCATGG CCTCGAATTT GGACACCCTC GTCACGGTTT TCGGCGGTTC GGGATTCATC
GGCCGGCATG TCGTCGGCGC GCTGGCCAAA CGCGATTTCC GCATCCGGGT CGCGGTGCGC
CGGCCGGATC TCACCGGGCA TCTGCAGCCG CTCGGCAAGG TCGGCCAGAT CCACGCCGTG
CAGGCCAACC TGCGCTATCC CGATTCGGTG CAGGCCGCGG TGCGCGACGC CGGCATCGTG
GTCAATCTGG TCGGCATCCT GGCCGAGGGC GGGGCGCAGA AATTCCAGGC GGTGCAGGCG
CAGGGCGCCG GCGCCATTGC GCAGGCCGCA GCCGCGGTCG GCGCCCGCAT GGTGCATGTC
TCGGCGATCG GCGCCGACGC GCAGTCAGCG TCGCTCTATG CCCGCTCCAA GGCCGCCGGA
GAGCAGGCGG TGCTCGCCGC GGTGCCGCAG GCTGTGATTT TCCGGCCCTC GGTGGTGTTC
GGCCCCGAGG ACCAGTTCAC CAACCGATTC GCCGGGCTGG CGCGGATGTC GGCAGTGGTG
CCGCTGATCG GCGGCGGCGC CACCAAATTG CAGCCGGTCT ATGTCGGCGA CGTCGCCACC
GCGGTGGCGC AGGCGGTCGA CGGCAAGGCC AAGCCGGGCG CCACCTACGA GCTCGGCGGC
CCGGAAGTGC TGACCATGCG GCAGGTGATC GAGATCATCC TCGACGTCAT CCAGCGCCGC
CGCATCCTGC TGTCATTGCC GTTCGGGCTG GCGCGGCTGC AGGCGCAGCT GCTGCAATTC
GCCCCCGGTC CGCTGAAGCT GACCCCCGAC CAAGTGGCCT TGCTGCAGGT CGACAATGTA
GTGTCGGAGG CCGCCCAGGC AGCCGGGCTG ACGCTGCAGG GGCTCGGCAT CCCGCCGGAT
TCGCTGCAGG CGATCGCGCC GTCCTATCTG TGGCGATTCC GTGCCACCGG CCAGTTCCAG
CGCAAGATCG TCGAGCCGAA GAACTCCTGA
 
Protein sequence
MQERHQGAAA PRPRFFAIGF QPMASNLDTL VTVFGGSGFI GRHVVGALAK RDFRIRVAVR 
RPDLTGHLQP LGKVGQIHAV QANLRYPDSV QAAVRDAGIV VNLVGILAEG GAQKFQAVQA
QGAGAIAQAA AAVGARMVHV SAIGADAQSA SLYARSKAAG EQAVLAAVPQ AVIFRPSVVF
GPEDQFTNRF AGLARMSAVV PLIGGGATKL QPVYVGDVAT AVAQAVDGKA KPGATYELGG
PEVLTMRQVI EIILDVIQRR RILLSLPFGL ARLQAQLLQF APGPLKLTPD QVALLQVDNV
VSEAAQAAGL TLQGLGIPPD SLQAIAPSYL WRFRATGQFQ RKIVEPKNS