Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1643 |
Symbol | |
ID | 5832584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1833190 |
End bp | 1834152 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641367441 |
Product | malate dehydrogenase, NAD-dependent |
Protein accession | YP_001639113 |
Protein GI | 163851070 |
COG category | [C] Energy production and conversion |
COG ID | [COG0039] Malate/lactate dehydrogenases |
TIGRFAM ID | [TIGR01763] malate dehydrogenase, NAD-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.160131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.329919 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGCA GCAAGATCGC GCTCATCGGC GCCGGACAGA TCGGCGGCAC GCTGGCCCAT CTCGCCGGCC TCAAGGAACT CGGTGACGTG GTGCTGTTCG ACATCGTCGA CGGCGTGCCG CAGGGCAAGG CGCTCGACAT TGCCGAGTCC GCTCCCGTCG ACGGCTTCGA CGCCAAGTAT TCGGGCGCCA GCGACTATTC CGCGATCGCC GGGGCCGACG TGGTGATCGT GACGGCGGGC GTGCCGCGCA AGCCCGGCAT GAGCCGTGAC GACCTCATCG GCATCAACCT GAAGGTGATG GAGGCGGTCG GCGCCGGCAT CAAGGAGCAC GCGCCCGACG CCTTCGTGAT CTGCATCACC AACCCGCTCG ACGCGATGGT GTGGGCGCTG CAGAAGTTCT CGGGGCTGCC CACCAACAAG GTCGTCGGCA TGGCCGGCGT GCTCGACTCC GCCCGCTTCC GCCACTTCCT GGCCGAAGAG TTCGGCGTCT CGGTCGAGGA CGTGACCGCC TTCGTGCTCG GCGGCCACGG CGACGACATG GTGCCGCTGA CCCGCTACTC GACGGTGGCC GGCGTGCCGC TGACCGATCT GGTCAAGCTC GGCTGGACCA CCCAGGAGAA GCTCGACGCC ATGGTCGAGC GCACCCGCAA GGGCGGCGGC GAGATCGTCA ACCTCCTGAA GACCGGCTCG GCCTTCTACG CGCCCGCCGC CTCCGCCATC GCCATGGCCG AGAGCTACCT GCGCGACAAG AAGCGGGTTC TGCCCTGCGC CGCCTACCTC GACGGCCAGT ACGGCATCGA CGGCCTCTAT GTCGGCGTGC CGGTGGTGAT CGGCGAGAAC GGCGTCGAGC GTGTGCTCGA GGTGACCTTC AACGACGACG AGAAGGCGAT GTTCGAGAAG TCGGTCAATT CGGTGAAGGG CTTGATCGAG GCCTGCAAGA GCGTCAACGA CAAGCTCGCG TAA
|
Protein sequence | MARSKIALIG AGQIGGTLAH LAGLKELGDV VLFDIVDGVP QGKALDIAES APVDGFDAKY SGASDYSAIA GADVVIVTAG VPRKPGMSRD DLIGINLKVM EAVGAGIKEH APDAFVICIT NPLDAMVWAL QKFSGLPTNK VVGMAGVLDS ARFRHFLAEE FGVSVEDVTA FVLGGHGDDM VPLTRYSTVA GVPLTDLVKL GWTTQEKLDA MVERTRKGGG EIVNLLKTGS AFYAPAASAI AMAESYLRDK KRVLPCAAYL DGQYGIDGLY VGVPVVIGEN GVERVLEVTF NDDEKAMFEK SVNSVKGLIE ACKSVNDKLA
|
| |