Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1533 |
Symbol | |
ID | 4073021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1870788 |
End bp | 1872260 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983542 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_590609 |
Protein GI | 94968561 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.84974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCA CAGCACAAAC CGAGCTTCTC CAGGTCCAGA ATTTTGTCAA CGGCGAGTGG CGGACCAGCC ACTCTGGCGA TGTTCTCGAG ATTTATAACC CCGCCACGGC TGAGCCGTTG GCGCACGTGC CCCTCTCCGG CGCCGACGAA GTGAATGAAG CGGTGCGCGC GGCCGCTGCC GCGTGGCCCG CGTGGCGCGA GACGCCGCCC GGCGACCGCA TTCAATACAT CTTCAAGCTG AAACAGCTAA TGGAGGAGCA CTTCGAAGAA ATCGCCCGAA CGGTGACGAT CGAGAACGGC AAGACGCTCA CCGAAGCACG AGGAGAAGTG CGTCGCGGCA TTGAGAACGT TGAAGTCGCC TGCGGAATTC CGCTGATGAT GCAGGGCTAC AACCTCGAAA ACATATCGCG TGGCATTGAC GAAATCATGT ATCGCCATCC AATCGGAGTC GTAGCCGCGA TCACACCGTT CAACTTTCCG GCAATGATTC CCTTCTGGTA CCTGCCCTAC GCAATCGCTA CTGGGAACTG TTTCATCCTG AAGCCAAGTG AGCGCGTACC GTTCACAATG CAGAAGGTTT TCGAACTGAT TCATCAGATA GGATTGCCGA AGGGCGTCAT CAACCTGCTG AACGGAGGTA AACCCGCGGT GGACGCTCTG CTCGACCACC CTGAGGTGCG CGCCATCAGC TTCGTCGGGT CGACGCCCGT AGCCCGTTAC ATCTACGAGC GCGCTGCAAA GAACGGCAAG CGCGTGCAAT GCCAGGGCGG CGCAAAGAAC TACGCGGTCA TCCTGCCCGA TGCCGACATG AAAGTGGCGA CGAACATCGT GGGCGAGAGC GCCTTCGGTT GTGCGGGACA ACGATGCCTG GCATTGAGTG TTGGAGTCAC TGTCGGTGAG GCCCAGAAGG GTTTCCGCGA AGCCGTCTCT GAGTTCGCCG CGCACCTCAA GACCGGCAAC GGACTCGAAG CAGGGACACA GATGGGGCCC GTGATCACCG CGCAGAGCAA ATCACGAATC GAAGAAGTTA TTGACCATGC TGTGAAGCAA GGTGCAAAAG CCGTGACCGA TGGCCGCGGC TACAGGGTTG CGAATCATGA GCGCGGCAAC TTCCTTGCGC CGACGATTCT CGATGAAGTG CCCGCCGACA GCGATGTGCC ACAGACTGAA ATCTTCGGCC CCGTGTTGAG CCTGGTGCAC GCCGACAGTC TCGAGCATGC GATTGAGCTG CTTTCCAAGA GCGCGTACGG TAATGCCGCA TCTCTCTTCA CCACCAATGG AGCGCACGCG CGACGTTTCC GCCATGAAGC GCCAGCTGGA AACATTGGCA TCAATATCGG TGTCCCTGCA CCTGTCGCCT ACTTCCCTTT CAGCGGCTGG AAGGAGAGCT TCTTCGGCGA CCTCCACGGC CAAGGTCGTG ATGCGATCGA GTTTTACACC GACAAGAAAG TCGTCATCGA GCGCTGGAGC TAA
|
Protein sequence | MSTTAQTELL QVQNFVNGEW RTSHSGDVLE IYNPATAEPL AHVPLSGADE VNEAVRAAAA AWPAWRETPP GDRIQYIFKL KQLMEEHFEE IARTVTIENG KTLTEARGEV RRGIENVEVA CGIPLMMQGY NLENISRGID EIMYRHPIGV VAAITPFNFP AMIPFWYLPY AIATGNCFIL KPSERVPFTM QKVFELIHQI GLPKGVINLL NGGKPAVDAL LDHPEVRAIS FVGSTPVARY IYERAAKNGK RVQCQGGAKN YAVILPDADM KVATNIVGES AFGCAGQRCL ALSVGVTVGE AQKGFREAVS EFAAHLKTGN GLEAGTQMGP VITAQSKSRI EEVIDHAVKQ GAKAVTDGRG YRVANHERGN FLAPTILDEV PADSDVPQTE IFGPVLSLVH ADSLEHAIEL LSKSAYGNAA SLFTTNGAHA RRFRHEAPAG NIGINIGVPA PVAYFPFSGW KESFFGDLHG QGRDAIEFYT DKKVVIERWS
|
| |