Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0345 |
Symbol | |
ID | 4808494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 434110 |
End bp | 435066 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105759 |
Product | malate dehydrogenase (NAD) |
Protein accession | YP_001036776 |
Protein GI | 125972866 |
COG category | [C] Energy production and conversion |
COG ID | [COG0039] Malate/lactate dehydrogenases |
TIGRFAM ID | [TIGR01771] L-lactate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.569498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATGG TAAAAAGTAG GTCAAAAGTT GCAATCATTG GTGCTGGTTT TGTAGGTGCG TCTGCAGCCT TCACAATGGC TTTGCGGCAA ACCGCAAATG AACTTGTTCT CATCGATGTT TTCAAGGAAA AAGCCATAGG CGAGGCTATG GATATTAACC ACGGTCTTCC ATTTATGGGA CAGATGTCAT TGTATGCCGG TGATTATTCC GACGTTAAAG ACTGTGATGT TATCGTAGTC ACGGCCGGAG CCAACAGAAA ACCTGGTGAA ACACGTCTTG ACCTTGCAAA GAAAAACGTT ATGATTGCAA AAGAAGTAAC TCAAAACATC ATGAAGTATT ACAACCATGG TGTAATACTT GTAGTATCCA ATCCTGTTGA CATTATAACT TATATGATCC AAAAATGGTC AGGCCTCCCT GTGGGAAAAG TTATAGGTTC AGGTACCGTA CTTGACAGTA TCAGATTCAG ATACTTGTTA AGCGAAAAAT TGGGCGTTGA CGTAAAGAAT GTACACGGCT ACATAATAGG CGAACACGGT GATTCACAGC TTCCGTTGTG GAGCTGCACA CATATCGCCG GTAAAAATAT CAACGAATAT ATCGATGATC CGAAATGCAA TTTCACAGAA GAAGACAAGA AAAAAATCGC TGAAGATGTT AAAACTGCGG GTGCAACCAT TATCAAGAAC AAAGGTGCAA CATACTATGG TATTGCAGTT TCAATCAACA CAATAGTTGA AACACTCCTT AAGAATCAGA ATACAATAAG AACCGTAGGA ACCGTTATAA ACGGCATGTA TGGAATAGAA GATGTTGCAA TAAGCCTTCC ATCCATCGTA AATTCCGAAG GTGTTCAGGA AGTTCTCCAA TTTAATCTGA CTCCTGAAGA AGAAGAAGCT TTAAGATTCT CAGCGGAGCA GGTTAAAAAA GTATTGAACG AAGTTAAGAA TTTATAA
|
Protein sequence | MEMVKSRSKV AIIGAGFVGA SAAFTMALRQ TANELVLIDV FKEKAIGEAM DINHGLPFMG QMSLYAGDYS DVKDCDVIVV TAGANRKPGE TRLDLAKKNV MIAKEVTQNI MKYYNHGVIL VVSNPVDIIT YMIQKWSGLP VGKVIGSGTV LDSIRFRYLL SEKLGVDVKN VHGYIIGEHG DSQLPLWSCT HIAGKNINEY IDDPKCNFTE EDKKKIAEDV KTAGATIIKN KGATYYGIAV SINTIVETLL KNQNTIRTVG TVINGMYGIE DVAISLPSIV NSEGVQEVLQ FNLTPEEEEA LRFSAEQVKK VLNEVKNL
|
| |