Gene Rsph17029_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1609 
Symbol 
ID4897296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1691426 
End bp1692925 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID640112200 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001043491 
Protein GI126462377 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0517581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAC TCAGCCACTG GATCGACGGC AAGCGCGTGA AGGGCACCTC CGGCCGCTTC 
GCCGATGTCT TCAACCCGGC CACCGGCGAG GTGCAGGCGC GCGTGCCGCT CGCCTCGAAG
GACGAACTCG ACGCCGCCGT GGCCTCGGCC GCCGCCGCCC AGCCGAAATG GGCCGCCACC
AACCCGCAGC GCCGCGCCCG CGTGATGATG GAGGTCGTGC GCCTCCTCAA CCGCGACATG
GACAAGCTGG CCGAGGCGCT CTCGCGCGAG CACGGCAAGA CCATCCCCGA CGCCAAGGGC
GACGTGCAGC GCGGCCTCGA GGTGATCGAA TTCTGCATCG GCGCGCCGCA TCTGCTGAAG
GGCGAGTTCA CCGACAGCGC GGGCCCCGGC ATCGACATGT ATTCGATGCG CCAGCCGCTC
GGCGTGGCTG CGGGCATCAC GCCCTTCAAC TTCCCGGCAA TGATCCCGCT GTGGAAGATG
GGCCCCGCGC TTGCCGCCGG CAACGCCTTC ATCCTGAAGC CGTCCGAGCG CGATCCGTCC
GTGCCGCTGA TGCTGGCCGA GATCTTCCAG GAGGCGGGCC TGCCCGACGG CGTCCTGCAG
GTGGTGAACG GCGACAAGGA GTCGGTCGAC GCGATCCTCG ACAACCCGAC CATCGCGGCG
GTGGGCTTCG TGGGCTCGAC CCCGATCGCG GAATATATCT ATTCCCGCGG CTGCGCGAAC
GGCAAGCGCG TGCAGTGCTT CGGCGGTGCC AAGAACCACA TGATCATCAT GCCGGATGCC
GACCTCGATC AGGCGGCCGA TGCGCTGGTG GGCGCGGGCT ACGGCGCTGC AGGCGAGCGC
TGCATGGCGA TCTCGGTCGC GGTCCCGGTG GGCGACGAGA CGGCCGATGC GCTCATCGAG
CGGCTGATCC CGCGCATCGA GAAGCTGAAG GTCGGCCCCT ACACCGCCGG CAACGACGTG
GATTACGGCC CGGTCGTGAC CGCCGCCGCG CGCGAGAACA TCCTGCGCCT CGTGCAGTCG
GGCGTGGATC AGGGCGCGAA GCTCGTGGTT GACGGTCGCA ACTTCTCGCT CCAAGGCTAC
GAGAAGGGCT TCTTCGTCGG TCCGCACCTC TTCGACCATG TCCGGCCCGA CATGGACATC
TACCGCAAGG AGATCTTCGG CCCGGTCCTC TCGACCGTCC GCGCGGCCTC TTACGAAGAG
GCGCTGAGCC TTGCCATGGA TCATGAGTAC GGCAACGGCA CCGCGATCTA CACCCGCGAC
GGCGACGCCG CCCGCGACTT CGCGGCGCGC GTGAATGTGG GGATGATCGG GATCAACGTG
CCGATCCCGG TGCCGCTGGC CTACCACACC TTCGGCGGCT GGAAGAAATC GGCCTTCGGC
GACCTGAACC AGCACGGCCC CGACTCCTTC CGCTTCTACA CCCGGACCAA GACGATCACC
TCGCGCTGGC CCTCGGGCAT CAAGGAGGGC TCCGCCTTCA ACTTCAAGGC CATGGACTGA
 
Protein sequence
MEELSHWIDG KRVKGTSGRF ADVFNPATGE VQARVPLASK DELDAAVASA AAAQPKWAAT 
NPQRRARVMM EVVRLLNRDM DKLAEALSRE HGKTIPDAKG DVQRGLEVIE FCIGAPHLLK
GEFTDSAGPG IDMYSMRQPL GVAAGITPFN FPAMIPLWKM GPALAAGNAF ILKPSERDPS
VPLMLAEIFQ EAGLPDGVLQ VVNGDKESVD AILDNPTIAA VGFVGSTPIA EYIYSRGCAN
GKRVQCFGGA KNHMIIMPDA DLDQAADALV GAGYGAAGER CMAISVAVPV GDETADALIE
RLIPRIEKLK VGPYTAGNDV DYGPVVTAAA RENILRLVQS GVDQGAKLVV DGRNFSLQGY
EKGFFVGPHL FDHVRPDMDI YRKEIFGPVL STVRAASYEE ALSLAMDHEY GNGTAIYTRD
GDAARDFAAR VNVGMIGINV PIPVPLAYHT FGGWKKSAFG DLNQHGPDSF RFYTRTKTIT
SRWPSGIKEG SAFNFKAMD