Gene Rpal_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3940 
Symbol 
ID6411621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4226990 
End bp4228486 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID642713821 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001992911 
Protein GI192292306 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.90589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACGG TCGGGCATTT CATCGGCGGC AAAGAGGTCG AGGGCAAGTC GGGGCGTTTC 
GCCGACGTGT TCGAGCCGAT GACCGGCGAG GTGAAGGCCA AAGTCGCCCT CGCCACCAAA
GCCGAGCTCC GCGCAGCGGT TGAAAACGCC AAGGCCGCGC AGCCGGAATG GGGCGCCACC
AACCCGCAGC GCCGCGCCCG CGTGCTGATG AAGTTCCTCG AATTGGTGCA GCGCGATTAC
GACAAGCTCG CCGAGCTGCT CGCGCGCGAA CATGGCAAGA CCATCCCCGA CGCCAAGGGT
GACATTCAGC GCGGCCTCGA AGTCGCCGAG TTCGCCTGCG GCATTCCGCA TCTGATGAAG
GGCGAATACA CCGAGGGCGC CGGCCCCGGC ATCGACATCT ATTCGATGCG CCAGCCGCTC
GGCGTCGTCG CCGGCATCAC CCCGTTCAAC TTCCCGGCGA TGATCCCGAT GTGGAAGTTC
GCCCCGGCGA TCGCCTGCGG CAACGCCTTC ATCCTGAAGC CGTCGGAGCG TGACCCCGGC
GTGCCGATGG CGCTGGCGGC GCTGATGCTC GAAGCCGGTC TGCCGCCGGG CATCCTCAAC
GTCGTCAACG GCGACAAGGA AGCGGTCGAC GCCATCCTCG ACGATCCGGA CATCAAGGCG
GTCGGCTTCG TCGGCTCCTC GCCGATCGCG CAGTACATCT ATGAGCGTGC GGCGCAGACC
GGCAAGCGCG CGCAATGCTT CGGCGGTGCC AAGAACCACG CCATCATCAT GCCGGATGCC
GATATCGACC AGACCGTCGA CGCGCTGATC GGTGCCGGCT ACGGCTCGGC CGGTGAGCGC
TGCATGGCGA TCTCGGTCGC GGTGCCGGTC GGCAAGGCCA CCGCGGAAGC GCTGATGAGC
AAGCTGATCC CGCGCGTCGA AGCGCTGAAG ATCGGTCCGT CCACCGATCC GACCGCCGAT
TACGGTCCGC TGGTCACCAA GGAAGCGCTG GAGCGCGTCA AGAACTACGT CGATATCGGC
GTCAAGGAAG GCGCGACGCT CGCGGTCGAC GGCCGCGGCT TCAAGATGCA GGGCTACGAG
AACGGCTTCT ACATGGGCGG CTGTCTGTTC GACAACGTCA CCAAGGACAT GCGGATCTAC
AAGGAAGAGA TCTTCGGCCC CGTCCTGAGC GTCGTCCGCG CCCACGACTA TGCCGAAGCG
CTGGCGCTGC CGTCCGACCA CGACTACGGC AACGGCGTCG CGATCTTCAC CCGCGACGGT
GACGCCGCCC GCGACTTCGC CGCCAAGGTC AATGTCGGCA TGGTCGGGAT CAACGTGCCG
ATCCCGGTGC CGATCGCCTA CTACACCTTC GGCGGCTGGA AGAAGTCCGG CTTCGGCGAC
CTCAACCAGC ACGGCCCGGA CTCGATCCGA TTCTACACCA AGACCAAGAC CGTCACCTCG
CGCTGGCCGT CGGGCGTGAA GGAAGGCGCG GAGTTTTCGA TCCCGCTGAT GAAGTAA
 
Protein sequence
MRTVGHFIGG KEVEGKSGRF ADVFEPMTGE VKAKVALATK AELRAAVENA KAAQPEWGAT 
NPQRRARVLM KFLELVQRDY DKLAELLARE HGKTIPDAKG DIQRGLEVAE FACGIPHLMK
GEYTEGAGPG IDIYSMRQPL GVVAGITPFN FPAMIPMWKF APAIACGNAF ILKPSERDPG
VPMALAALML EAGLPPGILN VVNGDKEAVD AILDDPDIKA VGFVGSSPIA QYIYERAAQT
GKRAQCFGGA KNHAIIMPDA DIDQTVDALI GAGYGSAGER CMAISVAVPV GKATAEALMS
KLIPRVEALK IGPSTDPTAD YGPLVTKEAL ERVKNYVDIG VKEGATLAVD GRGFKMQGYE
NGFYMGGCLF DNVTKDMRIY KEEIFGPVLS VVRAHDYAEA LALPSDHDYG NGVAIFTRDG
DAARDFAAKV NVGMVGINVP IPVPIAYYTF GGWKKSGFGD LNQHGPDSIR FYTKTKTVTS
RWPSGVKEGA EFSIPLMK