Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3591 |
Symbol | |
ID | 4024105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4002506 |
End bp | 4004062 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637963795 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_570715 |
Protein GI | 91978056 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.943045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.901126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGG CCGCCACCGC CGGGACGAAC GATCCGGCCG CCGCGTTTCG GGCCAATCAG GAGCGCGCTG CGCCGCTGTT GAAGCGACTG ACCGCCGACG GCATCGGCCA CCTGATCGAC GGCGCGATCG TGCCGTCGTC ATCGGGCGAC GTGTTCGAGA CACATTCGCC GATCGACAAC ACGGTGCTGG CGCAGGTCTC GCGCGGCACA ATTGAGGACA TCGACCGCGC CGCGCAGGCG GCGAAGCGGG CGTTTCCGGC GTGGCGCGAC ATGCCGGCGC CGGCGCGGCG CAAGCTGCTG CACAGGGTCG CGGACGCGAT CGAGGCGCGC GCAGACGACA TCGCCGTGCT GGAATGCATC GACACCGGGC AAGCTCACCG CTTCATGGCG AAGGCCGCGA TCCGCGCCGC CGAGAATTTC CGTTTCTTCG CCGACAAATG CACCGAGGCG CGCGACGGCC TCAACACGCC GAGCGACGAG CATTGGAACG TTTCGACCCG GGTGCCGATC GGCCCGGTCG GGGTGATCAC GCCGTGGAAC ACGCCGTTTA TGCTGTCGAC CTGGAAGATC GCGCCTGCAC TCGCGGCCGG CTGCACTGTG GTGCACAAGC CGGCGGAATG GTCGCCGGTG ACCGCCGATC TGTTGTCGCA ACTCTGCAGG CAGGCCGGCC TGCCCGACGG CGTGCTCAAC ACCGTGCACG GTTTCGGCGA GGAAACCGGC AAGGCCTTGA CCGAGCATCC CGCCATCAAG GCGATCGCCT TCGTCGGCGA AACCGCCACG GGCGCTGCGA TCATGGCGCA GGGCGCGCCG ACGCTGAAGC GCGTGCATTT CGAACTCGGC GGCAAGAACC CGGTGATCGT GTTCGACGAC GCCGATCTCG ACCGCGCGCT CGACGCCGTG GTGTTCATGA TCTACTCGCT CAACGGCGAG CGCTGCACCT CGTCGAGCCG GCTGCTGATC CAGCAATCGA TCGCCGACAC CTTCATCGAC AGGCTCGCGG CCCGCGTGCG CACACTGAAG GTCGGCCATC CGCTCGATCC CGCGACCGAG ATCGGCCCGC TGATCCATCA GCGTCATCTC GACAAGGTCT GCTCCTATTT CGATATCGCC CGAAAGGCCG GCGCGACCAT CGCGGTCGGC GGCGCGCGGC ATGACGGGCC GGGCGGCGGC CATTACGTGC AGCCGACGCT GGTGACCGGC GCGCGCAGCG ACATGCAGGT CGCGCAGGAG GAAGTGTTCG GGCCGTTCCT CACCGTGATC CCGTTCCGCG ACGAGGCGGA CGCGATCCGT ATCGCCAATG ATGTCCGCTA CGGCCTCGCC GGCTATGTCT GGACCGCCGA CATCGGCCGC GCGCTCCGCG TCGCCGACGC GCTGGAGGCC GGGATGATCT GGCTGAACTC GGAGAACGTC CGCCATCTGC CGACCCCGTT CGGCGGCATG AAGCAATCCG GCATCGGCCG CGACGGCGGC GACTACTCGT TCGAGTTCTA CATGGAAACC AAACACGTCT CGCTCGCGCG CGGCACGCAC AAGATCCAGA GACTGGGGGC TGTGTAG
|
Protein sequence | MAEAATAGTN DPAAAFRANQ ERAAPLLKRL TADGIGHLID GAIVPSSSGD VFETHSPIDN TVLAQVSRGT IEDIDRAAQA AKRAFPAWRD MPAPARRKLL HRVADAIEAR ADDIAVLECI DTGQAHRFMA KAAIRAAENF RFFADKCTEA RDGLNTPSDE HWNVSTRVPI GPVGVITPWN TPFMLSTWKI APALAAGCTV VHKPAEWSPV TADLLSQLCR QAGLPDGVLN TVHGFGEETG KALTEHPAIK AIAFVGETAT GAAIMAQGAP TLKRVHFELG GKNPVIVFDD ADLDRALDAV VFMIYSLNGE RCTSSSRLLI QQSIADTFID RLAARVRTLK VGHPLDPATE IGPLIHQRHL DKVCSYFDIA RKAGATIAVG GARHDGPGGG HYVQPTLVTG ARSDMQVAQE EVFGPFLTVI PFRDEADAIR IANDVRYGLA GYVWTADIGR ALRVADALEA GMIWLNSENV RHLPTPFGGM KQSGIGRDGG DYSFEFYMET KHVSLARGTH KIQRLGAV
|
| |