Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2117 |
Symbol | |
ID | 3908531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2408594 |
End bp | 2410090 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884010 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_485734 |
Protein GI | 86749238 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.591091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACCA TCGGGCACTT CATCGGCGGC AAAGAGGTCG AGGGGAAGTC GGGCCGTTTC GCCGACGTGT TCGAGCCGAT GACCGGCGAG GTCAAGGCCA AGGTCGCGCT CGCCACCAAG GCGGAGATGC GCGCCGCGAT CGAAAACGCC AAGGCCGCGC AGCCGGAATG GGGCGCCACC AACCCGCAGC GCCGCGCCCG CGTGCTGATG AAGTTCCTCG ACCTGGTGCA GCGCGACTAC GACAAGCTCG CCGAGTTGCT GGCGCGCGAG CACGGCAAGA CCATTCCCGA CGCCAAGGGC GACATCCAGC GCGGCCTCGA AGTCGCCGAA TTCGCCTGCG GCATTCCGCA TCTGATGAAG GGTGAATACA CCGAGGGCGC CGGCCCGGGC ATCGACATCT ATTCGATGCG GCAACCCCTG GGAGTCGTCG CCGGCATCAC CCCGTTCAAC TTCCCGGCGA TGATCCCGAT GTGGAAGTTC GCGCCCGCCA TCGCCTGCGG CAACGCCTTC ATCCTGAAGC CGTCGGAGCG CGATCCCGGC GTGCCGATGG CGCTGGCCGC CCTGATGCTC GAAGCCGGCC TGCCGCCGGG CATCCTCAAC GTCGTCAACG GCGACAAGGA AGCGGTCGAC GCCATTCTCG ACGACGCCGA CATCCGCGCC GTCGGCTTCG TCGGCTCGTC GCCGATCGCG CAATACATCT ACGAGCGCGC GGCGGCGACC GGCAAGCGCG CGCAGTGCTT CGGCGGCGCC AAGAACCACG CCATCATCAT GCCCGACGCC GACCTGGACC AGACCGTCGA CGCGCTGATC GGCGCCGGCT ACGGCTCGGC CGGCGAGCGC TGCATGGCGA TCTCGGTCGC GGTGCCGGTC GGCAAGTCGA CCGCCGACAG GCTGATGGAA AAACTGATCC CGCGCGTCGA GGCGCTGAAG ATCGGTCCCT CGACCGATCC GTCCGCCGAT TTCGGCCCGC TGGTCACCAG GGAAGCGCTG GAGCGCGTCA AGACCTACGT CGAGATCGGC GTCCAGGAAG GCGCCACGCT CGCCGTCGAC GGTCGCGGCT TCAAGATGCA GGGCTACGAG AACGGCTTCT ACATGGGCGG CTGTCTGTTC GACAACGTCA CGAAGGATAT GCGGATCTAC AAGGAAGAGA TCTTCGGTCC GGTGCTCAGC GTGGTCCGCG CCCACGACTA CGCCGAAGCG CTGGCGCTGC CGTCCGACCA CGACTACGGC AACGGCGTCG CGATCTTCAC CCGCGACGGC GACGCCGCGC GCGACTTCGC CGCGAAGGTG AATGTCGGCA TGGTCGGCAT CAACGTGCCG ATCCCGGTGC CGATCGCCTA CTACACGTTC GGCGGCTGGA AGAAGTCCGG CTTCGGCGAC CTCAACCAGC ACGGCCCGGA TTCGGTGCGG TTCTATACCA AGACCAAGAC CGTGACCTCG CGCTGGCCCT CCGGCGTCAA GGAAGGCGCG GAGTTCTCGA TCCCGCTGAT GAAGTGA
|
Protein sequence | MRTIGHFIGG KEVEGKSGRF ADVFEPMTGE VKAKVALATK AEMRAAIENA KAAQPEWGAT NPQRRARVLM KFLDLVQRDY DKLAELLARE HGKTIPDAKG DIQRGLEVAE FACGIPHLMK GEYTEGAGPG IDIYSMRQPL GVVAGITPFN FPAMIPMWKF APAIACGNAF ILKPSERDPG VPMALAALML EAGLPPGILN VVNGDKEAVD AILDDADIRA VGFVGSSPIA QYIYERAAAT GKRAQCFGGA KNHAIIMPDA DLDQTVDALI GAGYGSAGER CMAISVAVPV GKSTADRLME KLIPRVEALK IGPSTDPSAD FGPLVTREAL ERVKTYVEIG VQEGATLAVD GRGFKMQGYE NGFYMGGCLF DNVTKDMRIY KEEIFGPVLS VVRAHDYAEA LALPSDHDYG NGVAIFTRDG DAARDFAAKV NVGMVGINVP IPVPIAYYTF GGWKKSGFGD LNQHGPDSVR FYTKTKTVTS RWPSGVKEGA EFSIPLMK
|
| |