Gene Rsph17025_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1389 
Symbol 
ID5083063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1418493 
End bp1419992 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID640482947 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001167591 
Protein GI146277432 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAC TGAGCCACTG GATCGACGGC AAGCACGTCA AGGGCAGCTC GGGCCGCTTT 
GCCGATGTCT TCAACCCCGC CACCGGCGAG GTGCAGGCCC GGGTGCCGCT CGCCTCGAAG
GCCGAACTCG ACGCCGCCGT GGCCTCGGCC GCCGAGGCGC AGGTGAAATG GGGCGCCACC
AACCCGCAGC GCCGCGCCCG CGTGATGATG GAGGTGGTGC GCCTTCTGAA CCGCGACATG
GACAAGCTGG CCGAGGCGCT GAGCCGCGAG CACGGCAAGA CCCTGCCCGA CGCCAAGGGC
GACGTGCAGC GCGGGCTTGA GGTGATCGAG TTCTGCATCG GCGCGCCGCA CCTTCTGAAG
GGCGAGTTCA CCGACAGCGC GGGCCCCGGC ATCGACATGT ATTCGATGCG CCAGCCGCTG
GGCGTGGCCG CGGGCATCAC GCCGTTCAAC TTCCCGGCCA TGATCCCGCT CTGGAAGATG
GGCCCGGCGC TGGCCGCGGG CAACGCCTTC ATCCTGAAGC CCTCCGAGCG CGACCCCTCG
GTTCCCCTGA TGCTGGCCGA GATCTTCCAG GAGGCCGGCC TGCCCGATGG CGTCCTGCAG
GTGGTGAACG GCGACAAGGA GGCGGTGGAC GCCATCCTCG ACAATCCCAC GATCGCCGCC
GTGGGCTTCG TCGGCTCGAC CCCGATCGCG GAATACATCT ATTCCCGCGG CTGCGCGAAC
GGCAAGCGCG TGCAGTGCTT CGGCGGCGCC AAGAACCACA TGATCATCAT GCCGGACGCC
GACCTGGATC AGGCGGCCGA TGCGCTGGTG GGCGCGGGCT ACGGCGCGGC GGGGGAACGC
TGCATGGCGA TCTCGGTCGC GGTGCCGGTG GGCGACGAGA CCGCCGACGC GCTGATCGAG
CGGCTGATCC CGCGGATCGA GAAGCTGAAG GTCGGCCCCT ATACGGGCGG CAACGACGTG
GACTACGGCC CGGTGGTCAC CGCGGCGGCG AAGGAGAACA TCCTGCGCCT TGTGAACTCG
GGCATCGAGC AGGGCGCGAA GCTGGTGGTG GACGGGCGCA ACTTCGCGCT GCAGGGCTAC
GAGAGCGGCT TCTTCGTCGG CCCGCATCTC TTCGACCACG TCACGCCCGA GATGGACATC
TACCGCAAGG AGATCTTCGG CCCGGTGCTT TCGACCGTCC GCGCGGCCTC CTACGAAGAG
GCGCTCGGCC TTGCGATGCA CCACGAATAC GGCAACGGCA CGGCGATCTT CACCCGCGAC
GGCGACGCGG CGCGCGACTT CGCCAACCGG GTGAACGTGG GGATGATCGG GATCAACGTG
CCGATCCCGG TGCCGCTGGC CTATCACACC TTTGGCGGCT GGAAGAAATC GGCCTTCGGC
GACCTGAACC AGCACGGGCC GGACGCCTTC CGCTTCTACA CCCGCACCAA GACCATCACC
TCGCGCTGGC CGAGCGGGAT CAAGGAAGGC TCGGCCTTCA ACTTCAAGGC GATGGACTGA
 
Protein sequence
MEELSHWIDG KHVKGSSGRF ADVFNPATGE VQARVPLASK AELDAAVASA AEAQVKWGAT 
NPQRRARVMM EVVRLLNRDM DKLAEALSRE HGKTLPDAKG DVQRGLEVIE FCIGAPHLLK
GEFTDSAGPG IDMYSMRQPL GVAAGITPFN FPAMIPLWKM GPALAAGNAF ILKPSERDPS
VPLMLAEIFQ EAGLPDGVLQ VVNGDKEAVD AILDNPTIAA VGFVGSTPIA EYIYSRGCAN
GKRVQCFGGA KNHMIIMPDA DLDQAADALV GAGYGAAGER CMAISVAVPV GDETADALIE
RLIPRIEKLK VGPYTGGNDV DYGPVVTAAA KENILRLVNS GIEQGAKLVV DGRNFALQGY
ESGFFVGPHL FDHVTPEMDI YRKEIFGPVL STVRAASYEE ALGLAMHHEY GNGTAIFTRD
GDAARDFANR VNVGMIGINV PIPVPLAYHT FGGWKKSAFG DLNQHGPDAF RFYTRTKTIT
SRWPSGIKEG SAFNFKAMD