Gene RPD_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3302 
Symbol 
ID4023812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3656747 
End bp3658360 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content65% 
IMG OID637963506 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_570427 
Protein GI91977768 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.959449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTTC AAGGTGATCT CTGCATTTTG GCAGGCGTCT CACGCGCGCA GGCCCTTCCC 
AAAGATCAAA AAAGGCTGTT CAAGGGGACC AGAAAACAGC CTTTCAGGAG GATACGGATG
CGCACGATCG GGCACTTCAT CGGCGGCAAA GAGGTCGAAG GCAAGTCGGG ACGTTTCGCC
GACGTATTCG AGCCGATGAC CGGCGAGGTC AAAGCCAAGG TCGCGCTCGC CACCCGCGCC
GAGCTGCGCG CCGCCGTCGA GAACGCCAGG GCCGCGCAGC CGGAATGGGG CGCGACCAAC
CCGCAGCGCC GCGCCCGCGT GATGATGAAG TTCCTCGACC TGGTGCAGCG CGACTACGAC
AAGCTCGCCG AGTTGCTGGC GCGCGAGCAC GGCAAGACCA TTCCGGACGC CAAGGGCGAC
ATCCAGCGCG GTCTCGAAGT CGCCGAGTTC GCCTGCGGCA TTCCGCATCT GATGAAGGGC
GAATACACCG AAGGCGCCGG CCCCGGCATC GACATCTATT CGATGCGCCA GCCGCTCGGC
GTGGTCGCCG GCATCACCCC GTTCAACTTC CCGGCGATGA TCCCGATGTG GAAGTTCGCC
CCCGCGATCG CCTGCGGCAA CGCTTTCATC CTGAAGCCGT CGGAGCGCGA TCCCGGCGTG
CCGATGGCGC TTGCGGCTTT GATGATCGAG GCGGGGCTGC CGCCCGGCAT CCTCAACGTC
GTCAACGGCG ACAAGGAGGC GGTCGACGCC ATTCTCGATG ACGCCGACAT CCGCGCGGTC
GGCTTCGTCG GCTCGTCGCC GATCGCGCAA TATATTTACG AGCGCGCGGC GGCGACCGGC
AAGCGCGCCC AGTGCTTCGG CGGCGCCAAG AATCACGCCA TCATCATGCC CGACGCCGAC
ATCGACCAGA CCGTCGACGC GCTGATCGGC GCGGGTTTCG GCTCGGCCGG CGAACGCTGC
ATGGCGATCT CGGTCGCGGT GCCGGTCGGT AAGGCGACCG CGGACAGGCT GATGGAAAAG
CTGATCCCGC GCGTCGAAGC GCTGAAGATC GGTCCCTCGA CCGATCCATC GGCCGATTTC
GGTCCGCTGG TGACGCGTGA GGCGCTGGAG CGCGTCAAGA ACTACGTCGA TATCGGCGTC
AAGGAGGGCG CGACGTTGGC CGTCGACGGC CGCAATTTCA AGCTGCAAGG CTATGAGAAC
GGCTTCTACA TGGGCGGCTG CCTGTTCGAC AACGTCACCC GCGACATGCG GATCTACAAG
GAAGAGATCT TCGGCCCGGT GCTGAGCGTG GTGCGCGCCC ACGATTACGC CGAAGCGCTC
GCGCTGCCCT CGGACCATGA TTACGGCAAC GGCGTTGCGA TCTTCACCCG CGACGGCGAC
GCCGCCCGCG ACTTCGCGGC GAAGGTCAAT GTCGGCATGG TCGGCATCAA CGTGCCGATC
CCGGTGCCGA TCGCCTACTA CACGTTCGGC GGCTGGAAGA AGTCCGGCTT CGGCGATCTC
AACCAGCACG GTCCGGATTC GGTGCGGTTC TACACCAAGA CCAAGACCGT CACCTCGCGC
TGGCCCTCCG GCGTCAAGGA AGGCGCGGAG TTCTCGATCC CGTTGATGAA GTAG
 
Protein sequence
MPLQGDLCIL AGVSRAQALP KDQKRLFKGT RKQPFRRIRM RTIGHFIGGK EVEGKSGRFA 
DVFEPMTGEV KAKVALATRA ELRAAVENAR AAQPEWGATN PQRRARVMMK FLDLVQRDYD
KLAELLAREH GKTIPDAKGD IQRGLEVAEF ACGIPHLMKG EYTEGAGPGI DIYSMRQPLG
VVAGITPFNF PAMIPMWKFA PAIACGNAFI LKPSERDPGV PMALAALMIE AGLPPGILNV
VNGDKEAVDA ILDDADIRAV GFVGSSPIAQ YIYERAAATG KRAQCFGGAK NHAIIMPDAD
IDQTVDALIG AGFGSAGERC MAISVAVPVG KATADRLMEK LIPRVEALKI GPSTDPSADF
GPLVTREALE RVKNYVDIGV KEGATLAVDG RNFKLQGYEN GFYMGGCLFD NVTRDMRIYK
EEIFGPVLSV VRAHDYAEAL ALPSDHDYGN GVAIFTRDGD AARDFAAKVN VGMVGINVPI
PVPIAYYTFG GWKKSGFGDL NQHGPDSVRF YTKTKTVTSR WPSGVKEGAE FSIPLMK