Gene RPD_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1020 
Symbol 
ID4021495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1158692 
End bp1159960 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID637961211 
Productfumarylacetoacetase 
Protein accessionYP_568159 
Protein GI91975500 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.406627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.926144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCA ATGATCCGCG CTTGCGCTCC TTCATCGAGG TCGATCCGAC CTCGGATTTC 
CCCATTCAGA ATCTGCCCTA TGGCGTGTTC TCGACGGCGA GCACGCCGAC GCCGCGCGTC
GGCGTCGCGA TCGGCGACTA CGTCCTCGAT CTCGCCGTGT TGCAGGCTTC ACGCCTGATC
GATCTGCCGG ATGGCGTGTT CGCGCAATCC TCGATCAACG CTTTCATGGC GCTCGGGCCG
CAGCAGTGGA GCAAGACGCG GGCGCGGATC AGCGAGTTGC TAAGGCACGA CAGCGCCGAG
CTGCGCGACA ACGCCGCGCT GCGGCAGCAG GCGTTGATTC CGTTGCGCGA AGCGAAGCTG
CATTTGCCGC TGCGGGTCGA GGGCTTCACC GATTTCTATT CGTCGAAGGA GCACGCCACC
AATGTCGGCA CGATGTTCCG CGACAAGACC AATCCGCTGC TGCCGAACTG GCTGCACATC
CCGATCGGCT ACAATGGTCG CGCGTCCACC GTGGTGGTCA GCGGCGTCGG AATTCACCGC
CCGCGCGGGC AGTTGAAGCC GCCTTCCGTC GAGCTGCCGA GCTTCGGCCC GTGCAAGCGG
CTCGACTTCG AGCTGGAGAT CGGAGTGGTC GTCGGGCAAT CATCGGCGAT GGGCGCGATG
TTGACCGAGG CGCAGGCCGA ACAGATGATC TTCGGCTTCA CGCTGCTCAA CGACTGGAGC
GCGCGTGATA TCCAGCAATG GGAGTATGTC CCGCTCGGGC CGTTCCAGGC CAAGGCGTTC
GCGACCTCGA TCAGCCCGTG GATCGTGACG CGCGAGGCGC TGGAGCCGTT TCGCGTGCAC
GGCCCCGAGC AGCAACCGAC ACCCTTGGAC TATCTGCGGC AGAAGGGCGC CAACAATTAC
GACATGGCGC TGGAAGTCAG CCTGCGTACA CCCGCGATGG CCACGCCGGC GCGGATCAGC
GCCACCAATT TCAAATACAT GTACTGGTCT TCGGTGCAGC AACTGGTGCA TCACGCCTCG
AGCGGCTGCG CGATGAATAT CGGCGATCTG CTCGGCTCCG GCACCGTCAG CGGCCCGGAG
AAGGATCAAC TCGGCAGTCT GCTGGAGTTG AGCTGGAACG GCGCGGAGCC GGTCCAGCTT
CCGGGCGGCG AGCAGCGCGG CTTTCTCGAA GACGGCGACT CCCTGCTGAT GCGCGGCTGG
TGCCAGGGCG ACGGCTATCG GATCGGCTTC GGCGAAGTCG AAGGGACGAT TCTGCCGGCG
GGCAACTAA
 
Protein sequence
MHPNDPRLRS FIEVDPTSDF PIQNLPYGVF STASTPTPRV GVAIGDYVLD LAVLQASRLI 
DLPDGVFAQS SINAFMALGP QQWSKTRARI SELLRHDSAE LRDNAALRQQ ALIPLREAKL
HLPLRVEGFT DFYSSKEHAT NVGTMFRDKT NPLLPNWLHI PIGYNGRAST VVVSGVGIHR
PRGQLKPPSV ELPSFGPCKR LDFELEIGVV VGQSSAMGAM LTEAQAEQMI FGFTLLNDWS
ARDIQQWEYV PLGPFQAKAF ATSISPWIVT REALEPFRVH GPEQQPTPLD YLRQKGANNY
DMALEVSLRT PAMATPARIS ATNFKYMYWS SVQQLVHHAS SGCAMNIGDL LGSGTVSGPE
KDQLGSLLEL SWNGAEPVQL PGGEQRGFLE DGDSLLMRGW CQGDGYRIGF GEVEGTILPA
GN