Gene Rpal_5151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5151 
Symbol 
ID6412851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5549115 
End bp5550383 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content66% 
IMG OID642715041 
Productfumarylacetoacetase 
Protein accessionYP_001994114 
Protein GI192293509 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.389186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCCCA ACGACCCGCG CCTGCGCTCC TTCGTCGATG TGAAACTGGA ATCGGACTTT 
CCGATCCAGA ACCTGCCTTA CGGCGTTGTC TCGACGGTCG ACGATCCTGG TCCCCGCGTC
GGCGTCGCGA TCGGCGACTT CGTGCTCGAC CTCGCCATGC TGCAAACTGC GAAGCTGCTC
GACCTGCCGG AGGGCGTGTT CACGCAATCG TCGATCAATG CCTTCATGGC GCTCGGGCCG
AACGTCTGGC GCAGCACGCG GGCGCGGATC AGCGCGCTGC TTCGGCACGA CAATCCCGAG
CTGCGCGACC ATGCGGAGTT GCGCGCCAAG GCGCTGCTGC CGATGAGCCA AGTGAAGCTG
CATCTGCCGC TCCGCGTCGA GGGCTTCACC GATTTCTATT CGTCGAAGGA ACACGCCACC
AACGTCGGCA CCATGTTCCG GGACAAGACC AATCCGCTGC TGCCGAACTG GCTGCACATC
CCGATCGGCT ACAACGGCCG CGCCTCGACC GTGGTGGTCA GCGGCACCCA GATCCATCGC
CCGCGCGGCC AGCTCAAGCC GCCGTCGGCG GAGCTGCCGA GTTTCGGACC GTGCAAGCGG
CTGGATTTCG AACTCGAGAT CGGCGTGGTG GTCGGGCAGT CGTCGGCGAT GGGCACGATG
CTGACCGAAC AGCAGGCCGA GGAGATGATC TTCGGCTTCA CGCTGCTCAA CGATTGGAGC
GCGCGCGACA TCCAGCAATG GGAATACGTG CCGCTAGGGC CGTTCCAGGC CAAGGCGTTC
GCGACCTCGA TCAGCCCGTG GATCGTCACC CGCGAGGCAC TTGAGCCGTT TCGCGTTCAC
GGCCCCGTGC AGGATCCGGC GCCGCTGCCT TACTTGCAGC AGAAGGGCGC CAACAATTAC
GACATGGCGC TGGAAGTCGC CTTGCGGACG CCGACGATGC AACAGCCGGC GCGGATCAGC
GCGACCAATT TCAAATACAT GTACTGGTCG TCGGTGCAGC AGCTGGTGCA CCATGCCTCC
AGCGGCTGCG CGATGAGTGT CGGCGACCTG CTCGGCTCCG GGACGGTGTC GGGTCCGGAG
AAGGATCAGC TCGGCAGCCT GCTGGAATTG AGCTGGAACG GCACGGAGCC GCTGCAACTG
CCGGGCGGCG AGAGCCGCGG CTTCCTCGAA GACGGCGACA GCCTGGTGAT GCGCGGCTGG
TGCCAGGGCG ACGGCTACCG CGTCGGCTTC GGCGAGGTCG AGGGCACGGT GCTGGCGGCG
AGCGAGTAG
 
Protein sequence
MHPNDPRLRS FVDVKLESDF PIQNLPYGVV STVDDPGPRV GVAIGDFVLD LAMLQTAKLL 
DLPEGVFTQS SINAFMALGP NVWRSTRARI SALLRHDNPE LRDHAELRAK ALLPMSQVKL
HLPLRVEGFT DFYSSKEHAT NVGTMFRDKT NPLLPNWLHI PIGYNGRAST VVVSGTQIHR
PRGQLKPPSA ELPSFGPCKR LDFELEIGVV VGQSSAMGTM LTEQQAEEMI FGFTLLNDWS
ARDIQQWEYV PLGPFQAKAF ATSISPWIVT REALEPFRVH GPVQDPAPLP YLQQKGANNY
DMALEVALRT PTMQQPARIS ATNFKYMYWS SVQQLVHHAS SGCAMSVGDL LGSGTVSGPE
KDQLGSLLEL SWNGTEPLQL PGGESRGFLE DGDSLVMRGW CQGDGYRVGF GEVEGTVLAA
SE