Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3302 |
Symbol | |
ID | 4023812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3656747 |
End bp | 3658360 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637963506 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_570427 |
Protein GI | 91977768 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.959449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTTC AAGGTGATCT CTGCATTTTG GCAGGCGTCT CACGCGCGCA GGCCCTTCCC AAAGATCAAA AAAGGCTGTT CAAGGGGACC AGAAAACAGC CTTTCAGGAG GATACGGATG CGCACGATCG GGCACTTCAT CGGCGGCAAA GAGGTCGAAG GCAAGTCGGG ACGTTTCGCC GACGTATTCG AGCCGATGAC CGGCGAGGTC AAAGCCAAGG TCGCGCTCGC CACCCGCGCC GAGCTGCGCG CCGCCGTCGA GAACGCCAGG GCCGCGCAGC CGGAATGGGG CGCGACCAAC CCGCAGCGCC GCGCCCGCGT GATGATGAAG TTCCTCGACC TGGTGCAGCG CGACTACGAC AAGCTCGCCG AGTTGCTGGC GCGCGAGCAC GGCAAGACCA TTCCGGACGC CAAGGGCGAC ATCCAGCGCG GTCTCGAAGT CGCCGAGTTC GCCTGCGGCA TTCCGCATCT GATGAAGGGC GAATACACCG AAGGCGCCGG CCCCGGCATC GACATCTATT CGATGCGCCA GCCGCTCGGC GTGGTCGCCG GCATCACCCC GTTCAACTTC CCGGCGATGA TCCCGATGTG GAAGTTCGCC CCCGCGATCG CCTGCGGCAA CGCTTTCATC CTGAAGCCGT CGGAGCGCGA TCCCGGCGTG CCGATGGCGC TTGCGGCTTT GATGATCGAG GCGGGGCTGC CGCCCGGCAT CCTCAACGTC GTCAACGGCG ACAAGGAGGC GGTCGACGCC ATTCTCGATG ACGCCGACAT CCGCGCGGTC GGCTTCGTCG GCTCGTCGCC GATCGCGCAA TATATTTACG AGCGCGCGGC GGCGACCGGC AAGCGCGCCC AGTGCTTCGG CGGCGCCAAG AATCACGCCA TCATCATGCC CGACGCCGAC ATCGACCAGA CCGTCGACGC GCTGATCGGC GCGGGTTTCG GCTCGGCCGG CGAACGCTGC ATGGCGATCT CGGTCGCGGT GCCGGTCGGT AAGGCGACCG CGGACAGGCT GATGGAAAAG CTGATCCCGC GCGTCGAAGC GCTGAAGATC GGTCCCTCGA CCGATCCATC GGCCGATTTC GGTCCGCTGG TGACGCGTGA GGCGCTGGAG CGCGTCAAGA ACTACGTCGA TATCGGCGTC AAGGAGGGCG CGACGTTGGC CGTCGACGGC CGCAATTTCA AGCTGCAAGG CTATGAGAAC GGCTTCTACA TGGGCGGCTG CCTGTTCGAC AACGTCACCC GCGACATGCG GATCTACAAG GAAGAGATCT TCGGCCCGGT GCTGAGCGTG GTGCGCGCCC ACGATTACGC CGAAGCGCTC GCGCTGCCCT CGGACCATGA TTACGGCAAC GGCGTTGCGA TCTTCACCCG CGACGGCGAC GCCGCCCGCG ACTTCGCGGC GAAGGTCAAT GTCGGCATGG TCGGCATCAA CGTGCCGATC CCGGTGCCGA TCGCCTACTA CACGTTCGGC GGCTGGAAGA AGTCCGGCTT CGGCGATCTC AACCAGCACG GTCCGGATTC GGTGCGGTTC TACACCAAGA CCAAGACCGT CACCTCGCGC TGGCCCTCCG GCGTCAAGGA AGGCGCGGAG TTCTCGATCC CGTTGATGAA GTAG
|
Protein sequence | MPLQGDLCIL AGVSRAQALP KDQKRLFKGT RKQPFRRIRM RTIGHFIGGK EVEGKSGRFA DVFEPMTGEV KAKVALATRA ELRAAVENAR AAQPEWGATN PQRRARVMMK FLDLVQRDYD KLAELLAREH GKTIPDAKGD IQRGLEVAEF ACGIPHLMKG EYTEGAGPGI DIYSMRQPLG VVAGITPFNF PAMIPMWKFA PAIACGNAFI LKPSERDPGV PMALAALMIE AGLPPGILNV VNGDKEAVDA ILDDADIRAV GFVGSSPIAQ YIYERAAATG KRAQCFGGAK NHAIIMPDAD IDQTVDALIG AGFGSAGERC MAISVAVPVG KATADRLMEK LIPRVEALKI GPSTDPSADF GPLVTREALE RVKNYVDIGV KEGATLAVDG RNFKLQGYEN GFYMGGCLFD NVTRDMRIYK EEIFGPVLSV VRAHDYAEAL ALPSDHDYGN GVAIFTRDGD AARDFAAKVN VGMVGINVPI PVPIAYYTFG GWKKSGFGDL NQHGPDSVRF YTKTKTVTSR WPSGVKEGAE FSIPLMK
|
| |