Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0909 |
Symbol | |
ID | 3909762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1046404 |
End bp | 1047672 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882802 |
Product | fumarylacetoacetase |
Protein accession | YP_484531 |
Protein GI | 86748035 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR01266] fumarylacetoacetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCCA ACGATCCGCG CCTGCGCTCC TTCGTCGATG TGAAACCGGA ATCGGACTTT CCGATCCAGA ACCTTCCCTA CGGCGTGATC TCGACCGCGT CCGACCCTTC CCCGCGTGTC GGCGTCGCGA TCGGCGATTT CGTGCTCGAT CTCGCGGCGC TGCAGGCGGC CAAGCTGCTC GATCTGCCGG ACGGCGTGTT CGCGCAATCG TCGATCAACG CCTTCATGGC GCTCGGGCTC GCAATGTGGA GCACGACACG GGCGCGGATC AGTGCGTTGC TGCGTCACGA CAATCCCGAG CTGCGCGACG ACGCCGCGCT GCGCGCGCGG GCGCTTGTTC CGATGAGTGA CGCGAAGTTG CATCTGCCGC TGCGCGTCGA AGGCTTCACC GATTTCTACT CGTCGAAGGA ACACGCCACC AATGTCGGCA CGATGTTCCG CGACAAGACC AATCCGCTGC TGCCGAACTG GCTGCACATC CCGATCGGCT ACAACGGCCG CGCCTCGACC GTCGTGGTCA GCGGCACCCA GATCCATCGT CCGCGCGGGC AGCTCAAGCC ACCATCCGCC GAGCTGCCGA GCTTCGGCCC GTGCAAGCGG CTCGATTTCG AGCTGGAGAT CGGCGTCGTG ATCGGGCAGC CGTCGGCGAT GGGCACGACG CTGACCGAAC AGCAGGCCGA GGAGATGATC TTCGGCTTCA CGCTGTTGAA CGACTGGAGC GCGCGCGACA TCCAGCAATG GGAGTATGTG CCGCTCGGGC CGTTCCAGGC GAAAGCGTTC GCCACCTCGA TCAGCCCGTG GATCGTGACG CGCGAGGCGC TGGAGCCGTT TCGGGTTCAC GGGCCCACGC AGGATCCTGT GCCTCTGCCC TATCTGCAGC AGCAGGGGCC TAACAACTAC GACATGGCGC TGGAAGTGAA CCTGCGCACG CCGGCCATGA ACGCGCCGGC GCGGATCAGC GCGACGAATT TCAAATACAT GTACTGGTCG TCAGTGCAGC AGCTGGTGCA CCATGCCTCC AGCGGCTGCG CGATGAATGT CGGCGACCTG CTCGGCTCCG GCACCGTCTC GGGGCCGGCG AAGGATCAGC TCGGCAGCCT GCTGGAGCTG AGCTGGAACG GCGCCGAACC GGTGCAGCTC CCCGGCGGCG AGACCCGCGG CTTCCTCGAC GACGGCGATT CGCTGATCAT GCGCGGCTGG TGCCAGGCCG ACGGCTACCG CGTCGGTTTC GGCGAGGTCG AGGGGACGAT TCTGGCGGCG AAGAGCTGA
|
Protein sequence | MHPNDPRLRS FVDVKPESDF PIQNLPYGVI STASDPSPRV GVAIGDFVLD LAALQAAKLL DLPDGVFAQS SINAFMALGL AMWSTTRARI SALLRHDNPE LRDDAALRAR ALVPMSDAKL HLPLRVEGFT DFYSSKEHAT NVGTMFRDKT NPLLPNWLHI PIGYNGRAST VVVSGTQIHR PRGQLKPPSA ELPSFGPCKR LDFELEIGVV IGQPSAMGTT LTEQQAEEMI FGFTLLNDWS ARDIQQWEYV PLGPFQAKAF ATSISPWIVT REALEPFRVH GPTQDPVPLP YLQQQGPNNY DMALEVNLRT PAMNAPARIS ATNFKYMYWS SVQQLVHHAS SGCAMNVGDL LGSGTVSGPA KDQLGSLLEL SWNGAEPVQL PGGETRGFLD DGDSLIMRGW CQADGYRVGF GEVEGTILAA KS
|
| |