Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1623 |
Symbol | |
ID | 4601026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1569171 |
End bp | 1570187 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639774396 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding |
Protein accession | YP_921021 |
Protein GI | 119720526 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.215019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAAAA TAGCTGTTGT TAATTCAAGG TCTTTCGGGG TAATTGCGCC CGATGTTCTA CAAAAATTGC AGGAAGTGGC TCAGATCGAT TTTATCGACG TCGAAAAAAC TCTGAGAGGT AAAGCGCTAG CTGAAAAGCT CGAAGGATAC CATTTCATCA TTGCTAGTGT AACTCCTCTC TACGATAGAG AGTTCTTCGA GAACAACCGA AGCTTATTGC TCATAGCAAG GCATGGGATC GGTTACGATA ACGTAGATGT AGATGCGGCG ACAGAGCAGG GCGTAATAGT TACAAGGGTT CCTGGATCAA GAGAGAGAGA CGCTGTGGCG GAACTTGCTG TGGCCTTGTG CCTAAACGTA GCGCGTAAGG TTTGCCAGGC CGCCACGCTG GTTCGAGAGG GAAAATGGGC TGAGAGGGGT AAAATTGTCG GTGTTAATAT CTCAGGAAAG ACTGTTGGCA TAATCGGTTT AGGGAACATT GGTAGTAGAG TTGCAGAAAT TTTCTCTAGA GGTTTCAATG CCAAAGTTGT AGCGTACGAC CCATTTGTAG GAAAAGACTA TGCCGCCCGG TTTGGAGCCG AGCTTGTAGA TCTGGATACG CTCCTAAGGG AGTCCGATAT AATACTGCTA CATGCACCTC TCACGAAGGA AACATACCAC ATGATCGGCG AGAAAGAGAT AGATAAAATG AAGAAAGGAG TCATAGTAGT AAATACTGCT AGAGGCGAGC TGATAGATAC TAATGCTCTT ATAAAGGGCT TAGAGTCGGG AAAAATTGCA GGAGTAGGCC TTGACGTCGT AGAAGGAGAA CCCATAGGAG CAGATCATCC TCTGCTAAAG TATAGAAACG TCGTTATCAC GCCGCACATA GGTGCAAATA CCTACGAGGG ACTCAGGGGG ATGGACGAAG CCAATGCTGA TGCAATACTA AAGGTTATCC GCGGAGAAGC ACCGCTTGAG TACATGGTAA ATCCAGAGGT TCTTAAGAGG GGTACCAGGG CTAATCTAAG AACTTAG
|
Protein sequence | MYKIAVVNSR SFGVIAPDVL QKLQEVAQID FIDVEKTLRG KALAEKLEGY HFIIASVTPL YDREFFENNR SLLLIARHGI GYDNVDVDAA TEQGVIVTRV PGSRERDAVA ELAVALCLNV ARKVCQAATL VREGKWAERG KIVGVNISGK TVGIIGLGNI GSRVAEIFSR GFNAKVVAYD PFVGKDYAAR FGAELVDLDT LLRESDIILL HAPLTKETYH MIGEKEIDKM KKGVIVVNTA RGELIDTNAL IKGLESGKIA GVGLDVVEGE PIGADHPLLK YRNVVITPHI GANTYEGLRG MDEANADAIL KVIRGEAPLE YMVNPEVLKR GTRANLRT
|
| |