Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1020 |
Symbol | |
ID | 4021495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1158692 |
End bp | 1159960 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637961211 |
Product | fumarylacetoacetase |
Protein accession | YP_568159 |
Protein GI | 91975500 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.406627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.926144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCCA ATGATCCGCG CTTGCGCTCC TTCATCGAGG TCGATCCGAC CTCGGATTTC CCCATTCAGA ATCTGCCCTA TGGCGTGTTC TCGACGGCGA GCACGCCGAC GCCGCGCGTC GGCGTCGCGA TCGGCGACTA CGTCCTCGAT CTCGCCGTGT TGCAGGCTTC ACGCCTGATC GATCTGCCGG ATGGCGTGTT CGCGCAATCC TCGATCAACG CTTTCATGGC GCTCGGGCCG CAGCAGTGGA GCAAGACGCG GGCGCGGATC AGCGAGTTGC TAAGGCACGA CAGCGCCGAG CTGCGCGACA ACGCCGCGCT GCGGCAGCAG GCGTTGATTC CGTTGCGCGA AGCGAAGCTG CATTTGCCGC TGCGGGTCGA GGGCTTCACC GATTTCTATT CGTCGAAGGA GCACGCCACC AATGTCGGCA CGATGTTCCG CGACAAGACC AATCCGCTGC TGCCGAACTG GCTGCACATC CCGATCGGCT ACAATGGTCG CGCGTCCACC GTGGTGGTCA GCGGCGTCGG AATTCACCGC CCGCGCGGGC AGTTGAAGCC GCCTTCCGTC GAGCTGCCGA GCTTCGGCCC GTGCAAGCGG CTCGACTTCG AGCTGGAGAT CGGAGTGGTC GTCGGGCAAT CATCGGCGAT GGGCGCGATG TTGACCGAGG CGCAGGCCGA ACAGATGATC TTCGGCTTCA CGCTGCTCAA CGACTGGAGC GCGCGTGATA TCCAGCAATG GGAGTATGTC CCGCTCGGGC CGTTCCAGGC CAAGGCGTTC GCGACCTCGA TCAGCCCGTG GATCGTGACG CGCGAGGCGC TGGAGCCGTT TCGCGTGCAC GGCCCCGAGC AGCAACCGAC ACCCTTGGAC TATCTGCGGC AGAAGGGCGC CAACAATTAC GACATGGCGC TGGAAGTCAG CCTGCGTACA CCCGCGATGG CCACGCCGGC GCGGATCAGC GCCACCAATT TCAAATACAT GTACTGGTCT TCGGTGCAGC AACTGGTGCA TCACGCCTCG AGCGGCTGCG CGATGAATAT CGGCGATCTG CTCGGCTCCG GCACCGTCAG CGGCCCGGAG AAGGATCAAC TCGGCAGTCT GCTGGAGTTG AGCTGGAACG GCGCGGAGCC GGTCCAGCTT CCGGGCGGCG AGCAGCGCGG CTTTCTCGAA GACGGCGACT CCCTGCTGAT GCGCGGCTGG TGCCAGGGCG ACGGCTATCG GATCGGCTTC GGCGAAGTCG AAGGGACGAT TCTGCCGGCG GGCAACTAA
|
Protein sequence | MHPNDPRLRS FIEVDPTSDF PIQNLPYGVF STASTPTPRV GVAIGDYVLD LAVLQASRLI DLPDGVFAQS SINAFMALGP QQWSKTRARI SELLRHDSAE LRDNAALRQQ ALIPLREAKL HLPLRVEGFT DFYSSKEHAT NVGTMFRDKT NPLLPNWLHI PIGYNGRAST VVVSGVGIHR PRGQLKPPSV ELPSFGPCKR LDFELEIGVV VGQSSAMGAM LTEAQAEQMI FGFTLLNDWS ARDIQQWEYV PLGPFQAKAF ATSISPWIVT REALEPFRVH GPEQQPTPLD YLRQKGANNY DMALEVSLRT PAMATPARIS ATNFKYMYWS SVQQLVHHAS SGCAMNIGDL LGSGTVSGPE KDQLGSLLEL SWNGAEPVQL PGGEQRGFLE DGDSLLMRGW CQGDGYRIGF GEVEGTILPA GN
|
| |