Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5151 |
Symbol | |
ID | 6412851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5549115 |
End bp | 5550383 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642715041 |
Product | fumarylacetoacetase |
Protein accession | YP_001994114 |
Protein GI | 192293509 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.389186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCCCA ACGACCCGCG CCTGCGCTCC TTCGTCGATG TGAAACTGGA ATCGGACTTT CCGATCCAGA ACCTGCCTTA CGGCGTTGTC TCGACGGTCG ACGATCCTGG TCCCCGCGTC GGCGTCGCGA TCGGCGACTT CGTGCTCGAC CTCGCCATGC TGCAAACTGC GAAGCTGCTC GACCTGCCGG AGGGCGTGTT CACGCAATCG TCGATCAATG CCTTCATGGC GCTCGGGCCG AACGTCTGGC GCAGCACGCG GGCGCGGATC AGCGCGCTGC TTCGGCACGA CAATCCCGAG CTGCGCGACC ATGCGGAGTT GCGCGCCAAG GCGCTGCTGC CGATGAGCCA AGTGAAGCTG CATCTGCCGC TCCGCGTCGA GGGCTTCACC GATTTCTATT CGTCGAAGGA ACACGCCACC AACGTCGGCA CCATGTTCCG GGACAAGACC AATCCGCTGC TGCCGAACTG GCTGCACATC CCGATCGGCT ACAACGGCCG CGCCTCGACC GTGGTGGTCA GCGGCACCCA GATCCATCGC CCGCGCGGCC AGCTCAAGCC GCCGTCGGCG GAGCTGCCGA GTTTCGGACC GTGCAAGCGG CTGGATTTCG AACTCGAGAT CGGCGTGGTG GTCGGGCAGT CGTCGGCGAT GGGCACGATG CTGACCGAAC AGCAGGCCGA GGAGATGATC TTCGGCTTCA CGCTGCTCAA CGATTGGAGC GCGCGCGACA TCCAGCAATG GGAATACGTG CCGCTAGGGC CGTTCCAGGC CAAGGCGTTC GCGACCTCGA TCAGCCCGTG GATCGTCACC CGCGAGGCAC TTGAGCCGTT TCGCGTTCAC GGCCCCGTGC AGGATCCGGC GCCGCTGCCT TACTTGCAGC AGAAGGGCGC CAACAATTAC GACATGGCGC TGGAAGTCGC CTTGCGGACG CCGACGATGC AACAGCCGGC GCGGATCAGC GCGACCAATT TCAAATACAT GTACTGGTCG TCGGTGCAGC AGCTGGTGCA CCATGCCTCC AGCGGCTGCG CGATGAGTGT CGGCGACCTG CTCGGCTCCG GGACGGTGTC GGGTCCGGAG AAGGATCAGC TCGGCAGCCT GCTGGAATTG AGCTGGAACG GCACGGAGCC GCTGCAACTG CCGGGCGGCG AGAGCCGCGG CTTCCTCGAA GACGGCGACA GCCTGGTGAT GCGCGGCTGG TGCCAGGGCG ACGGCTACCG CGTCGGCTTC GGCGAGGTCG AGGGCACGGT GCTGGCGGCG AGCGAGTAG
|
Protein sequence | MHPNDPRLRS FVDVKLESDF PIQNLPYGVV STVDDPGPRV GVAIGDFVLD LAMLQTAKLL DLPEGVFTQS SINAFMALGP NVWRSTRARI SALLRHDNPE LRDHAELRAK ALLPMSQVKL HLPLRVEGFT DFYSSKEHAT NVGTMFRDKT NPLLPNWLHI PIGYNGRAST VVVSGTQIHR PRGQLKPPSA ELPSFGPCKR LDFELEIGVV VGQSSAMGTM LTEQQAEEMI FGFTLLNDWS ARDIQQWEYV PLGPFQAKAF ATSISPWIVT REALEPFRVH GPVQDPAPLP YLQQKGANNY DMALEVALRT PTMQQPARIS ATNFKYMYWS SVQQLVHHAS SGCAMSVGDL LGSGTVSGPE KDQLGSLLEL SWNGTEPLQL PGGESRGFLE DGDSLVMRGW CQGDGYRVGF GEVEGTVLAA SE
|
| |