Gene RPD_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3591 
Symbol 
ID4024105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4002506 
End bp4004062 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID637963795 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_570715 
Protein GI91978056 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.943045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.901126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGG CCGCCACCGC CGGGACGAAC GATCCGGCCG CCGCGTTTCG GGCCAATCAG 
GAGCGCGCTG CGCCGCTGTT GAAGCGACTG ACCGCCGACG GCATCGGCCA CCTGATCGAC
GGCGCGATCG TGCCGTCGTC ATCGGGCGAC GTGTTCGAGA CACATTCGCC GATCGACAAC
ACGGTGCTGG CGCAGGTCTC GCGCGGCACA ATTGAGGACA TCGACCGCGC CGCGCAGGCG
GCGAAGCGGG CGTTTCCGGC GTGGCGCGAC ATGCCGGCGC CGGCGCGGCG CAAGCTGCTG
CACAGGGTCG CGGACGCGAT CGAGGCGCGC GCAGACGACA TCGCCGTGCT GGAATGCATC
GACACCGGGC AAGCTCACCG CTTCATGGCG AAGGCCGCGA TCCGCGCCGC CGAGAATTTC
CGTTTCTTCG CCGACAAATG CACCGAGGCG CGCGACGGCC TCAACACGCC GAGCGACGAG
CATTGGAACG TTTCGACCCG GGTGCCGATC GGCCCGGTCG GGGTGATCAC GCCGTGGAAC
ACGCCGTTTA TGCTGTCGAC CTGGAAGATC GCGCCTGCAC TCGCGGCCGG CTGCACTGTG
GTGCACAAGC CGGCGGAATG GTCGCCGGTG ACCGCCGATC TGTTGTCGCA ACTCTGCAGG
CAGGCCGGCC TGCCCGACGG CGTGCTCAAC ACCGTGCACG GTTTCGGCGA GGAAACCGGC
AAGGCCTTGA CCGAGCATCC CGCCATCAAG GCGATCGCCT TCGTCGGCGA AACCGCCACG
GGCGCTGCGA TCATGGCGCA GGGCGCGCCG ACGCTGAAGC GCGTGCATTT CGAACTCGGC
GGCAAGAACC CGGTGATCGT GTTCGACGAC GCCGATCTCG ACCGCGCGCT CGACGCCGTG
GTGTTCATGA TCTACTCGCT CAACGGCGAG CGCTGCACCT CGTCGAGCCG GCTGCTGATC
CAGCAATCGA TCGCCGACAC CTTCATCGAC AGGCTCGCGG CCCGCGTGCG CACACTGAAG
GTCGGCCATC CGCTCGATCC CGCGACCGAG ATCGGCCCGC TGATCCATCA GCGTCATCTC
GACAAGGTCT GCTCCTATTT CGATATCGCC CGAAAGGCCG GCGCGACCAT CGCGGTCGGC
GGCGCGCGGC ATGACGGGCC GGGCGGCGGC CATTACGTGC AGCCGACGCT GGTGACCGGC
GCGCGCAGCG ACATGCAGGT CGCGCAGGAG GAAGTGTTCG GGCCGTTCCT CACCGTGATC
CCGTTCCGCG ACGAGGCGGA CGCGATCCGT ATCGCCAATG ATGTCCGCTA CGGCCTCGCC
GGCTATGTCT GGACCGCCGA CATCGGCCGC GCGCTCCGCG TCGCCGACGC GCTGGAGGCC
GGGATGATCT GGCTGAACTC GGAGAACGTC CGCCATCTGC CGACCCCGTT CGGCGGCATG
AAGCAATCCG GCATCGGCCG CGACGGCGGC GACTACTCGT TCGAGTTCTA CATGGAAACC
AAACACGTCT CGCTCGCGCG CGGCACGCAC AAGATCCAGA GACTGGGGGC TGTGTAG
 
Protein sequence
MAEAATAGTN DPAAAFRANQ ERAAPLLKRL TADGIGHLID GAIVPSSSGD VFETHSPIDN 
TVLAQVSRGT IEDIDRAAQA AKRAFPAWRD MPAPARRKLL HRVADAIEAR ADDIAVLECI
DTGQAHRFMA KAAIRAAENF RFFADKCTEA RDGLNTPSDE HWNVSTRVPI GPVGVITPWN
TPFMLSTWKI APALAAGCTV VHKPAEWSPV TADLLSQLCR QAGLPDGVLN TVHGFGEETG
KALTEHPAIK AIAFVGETAT GAAIMAQGAP TLKRVHFELG GKNPVIVFDD ADLDRALDAV
VFMIYSLNGE RCTSSSRLLI QQSIADTFID RLAARVRTLK VGHPLDPATE IGPLIHQRHL
DKVCSYFDIA RKAGATIAVG GARHDGPGGG HYVQPTLVTG ARSDMQVAQE EVFGPFLTVI
PFRDEADAIR IANDVRYGLA GYVWTADIGR ALRVADALEA GMIWLNSENV RHLPTPFGGM
KQSGIGRDGG DYSFEFYMET KHVSLARGTH KIQRLGAV