Gene RPC_3095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3095 
Symbol 
ID3974046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3436151 
End bp3437647 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID637926203 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_532956 
Protein GI90424586 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.769898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.126836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCAA TCGGGCATTT TATCGCTGGC CGCGAGGTTG CTGGAACGTC GGGTCGGTTT 
GCCGACGTGT TCGAACCGAT GACCGGCGAG GTGCAGGCGC AGGTTGCGCT CGCGACCAAA
GCCGAGCTGC GCGCCGCGGT GGAAGATGCC AAGCAAGCGC AGCTGGCGTG GGGCGCCACC
AACCCGCAGC GTCGCGCCCG GGTGATGATG AAATTCCTCG AGTTGGCGCA GCGCGACTAC
GACAAGCTCG CCGAGTTGCT CGCCAGCGAG CACGGCAAGA CCGTGCCGGA CGCCAAGGGT
GACATCCAGC GCGGCCTCGA AGTGGTGGAA TTCGCCTGCG GCATTCCGCA CCTGATGAAG
GGCGAATACA CCGAAGGCGC CGGCCCCGGC ATCGACATCT ATTCGATGCG GCAGCCGCTC
GGCGTGGTCG CCGGCATCAC CCCGTTCAAT TTCCCGGCGA TGATCCCGAT GTGGAAATTC
GCCCCGGCGA TCGCCTGCGG CAACGCCTTC ATCCTCAAGC CCTCGGAGCG CGACCCCGGG
GTTCCGATGG CGCTCGCCGC GTTGATGATC GAGGCCGGAT TGCCGCCGGG CATCCTCAAC
GTCGTCAACG GCGACAAAGA GGCGGTCGAC GCCATCCTGG ACGACCCGGA TATTCGCGCG
GTCGGCTTCG TCGGTTCCTC GCCGATCGCG CAATACATCT ATGAGCGGGC GGCGGCGACC
GGCAAGCGCG CGCAATGCTT CGGCGGCGCC AAGAACCACG CCATCATCAT GCCCGACGCC
GATCTCGACC ACACCGTCGA CGCCCTGATC GGCGCCGGCT ACGGCTCCGC CGGCGAGCGC
TGCATGGCGA TCTCGGTCGC GGTGCCGGTC GGCAAGGCCA CCGCCGACGC GCTGATGGAG
AAGCTGATCC CGCGGGTCGA GGCGCTGAAG ATCGGGCCGT CCACCGATCC GTCGGCCGAT
TTTGGCCCGC TGGTGACCAA GGCGGCGCTG CAGCGCGTCA AGGACTACGT CAAGGTCGGG
ATCGAGGAGG GCGCGACGCT CGCGGTCGAC GGCCGCGACT TCACGCTTCA GGGCTATGAG
AACGGCTTCT ATATGGGCGG CTGCCTGTTC GACAACGTCA CCAAAGACAT GCGGATCTAC
AAAGAGGAGA TTTTTGGACC CGTGCTGAGC GTGGTCCGTG CGCACGACTA CGCCGAAGCG
CTGGCGCTGC CGTCCGACCA CGACTACGGC AATGGCGTTG CGATCTTCAC CCGCGACGGC
GACGCCGCGC GCGACTTCGC CGCCAAGGTG AATGTCGGCA TGGTCGGTAT CAACGTGCCG
ATCCCGGTGC CGATCGCCTA CTACACCTTC GGCGGCTGGA AGAAATCCGG CTTCGGCGAT
CTCAACCAGC ACGGCCCGGA TTCTATTCGC TTCTACACCA AGACCAAGAC CGTCACCGCG
CGCTGGCCAA GCGGCGTCAA AGAAGGCGCC GAATTCTCGA TCCCGACGAT GAAGTGA
 
Protein sequence
MRSIGHFIAG REVAGTSGRF ADVFEPMTGE VQAQVALATK AELRAAVEDA KQAQLAWGAT 
NPQRRARVMM KFLELAQRDY DKLAELLASE HGKTVPDAKG DIQRGLEVVE FACGIPHLMK
GEYTEGAGPG IDIYSMRQPL GVVAGITPFN FPAMIPMWKF APAIACGNAF ILKPSERDPG
VPMALAALMI EAGLPPGILN VVNGDKEAVD AILDDPDIRA VGFVGSSPIA QYIYERAAAT
GKRAQCFGGA KNHAIIMPDA DLDHTVDALI GAGYGSAGER CMAISVAVPV GKATADALME
KLIPRVEALK IGPSTDPSAD FGPLVTKAAL QRVKDYVKVG IEEGATLAVD GRDFTLQGYE
NGFYMGGCLF DNVTKDMRIY KEEIFGPVLS VVRAHDYAEA LALPSDHDYG NGVAIFTRDG
DAARDFAAKV NVGMVGINVP IPVPIAYYTF GGWKKSGFGD LNQHGPDSIR FYTKTKTVTA
RWPSGVKEGA EFSIPTMK