Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0290 |
Symbol | |
ID | 4808508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 362395 |
End bp | 363681 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105702 |
Product | homoserine dehydrogenase |
Protein accession | YP_001036722 |
Protein GI | 125972812 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000435351 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAG TGAAAATTGG TTTCCTTGGT TTTGGAAATG TTGGAACCGG AGCTTATAAA ATTATTCGTG ATAACGGTAA TGATATTTTT CTAAGTGAAG GAGTTCGGCT GAAAGTTGCG AAAATTCTTG TAAGAGATAT TAATAAGAAA AGAAATGTGG AAGTTGACAA TTCTATTCTG ACGGATAAAT TTGAAGAAAT AGTAAATGAC CCCGGAATCT CCATTGTTGC TGAATTCATG GGCGGTGTGG AGCCTGCCAG GGATTATGTT GTTTCCTGCC TTAAGAATAA AAAAACCGTT GTGACTGCAA ACAAAGAGCT TATTGCCAAA CATGGATTGG AGCTTCAGGC GCTTGCCAAA GAAAACGGAG TGGGTCTTTA TTATGAAGCA AGTGTGGCAG GAGGTATTCC GGTTATAAAG ATTCTTGCAG AGTCGCTTCA GGCAAACAAA ATTGGCGAAA TTATGGGGAT TATTAATGGT ACCACTAACT ATATACTTAC AAAGATGTCC GAGGAGGGAA GAAGTTTCAG TGATGTGCTT GCAGAAGCAC AAAGACTGGG GTATGCGGAA CCGGATCCTA CTGCTGACAT AGAAGGCTAT GATGCTATGT ATAAAATTTC CATATTATCC TCCATGGCAT TTCATAAAAA GGTGGATGTT GACAAAATCT ACAGGGAGGG TATTACCCAA ATTACCCCGG AAGATATTGA ATACGGCAGG GAGCTGGGGT TTGCAATCCG TCTTCTTGCC ATTGCCAAAA AGCGCAACAA TACCATCGAA GTAAGAGTGC ATCCGACATT TATACCTCTT GACCATCCTT TGGCTGCTGT AAGGCATTCC TTTAATGCAG TGTTCCTTAA AGGGGACGCC GTGGGGAATA TAATGCTGTA CGGCAGAGGA GCCGGGGACC TTCCTACGGG AAGTGCAATT GTGTCGGATA TAATTACTGC ATGCCATCAG AAGGACAAGC ACAGATATAT CAGTTTTTAC AATGACGAAG AAGGCTCAGC TGAGAAGATA AAATTCAATG ATGACTGGGA AAGCGAATTT TTTGTCAGAC TTACGGTAAA GGACAAGCCG GGAGTTCTTG CCAAAATTGC CGGATGCTTT GGCAAACACG GAGTGAGTAT AGCATCGGTA ATTCAGAAGG ACAGGGGAAA GGACGCCGTT CCTTTGATAT TTGTAACCCA TTTGGCCAAG GAGCTTTCAA TGAAAAAAGC CATATCTGAT ATTGCAGAAG TTGAGGATGT GTTAATGGTT GAAAATATCA TACCGGTTGA GCGTTAA
|
Protein sequence | MEEVKIGFLG FGNVGTGAYK IIRDNGNDIF LSEGVRLKVA KILVRDINKK RNVEVDNSIL TDKFEEIVND PGISIVAEFM GGVEPARDYV VSCLKNKKTV VTANKELIAK HGLELQALAK ENGVGLYYEA SVAGGIPVIK ILAESLQANK IGEIMGIING TTNYILTKMS EEGRSFSDVL AEAQRLGYAE PDPTADIEGY DAMYKISILS SMAFHKKVDV DKIYREGITQ ITPEDIEYGR ELGFAIRLLA IAKKRNNTIE VRVHPTFIPL DHPLAAVRHS FNAVFLKGDA VGNIMLYGRG AGDLPTGSAI VSDIITACHQ KDKHRYISFY NDEEGSAEKI KFNDDWESEF FVRLTVKDKP GVLAKIAGCF GKHGVSIASV IQKDRGKDAV PLIFVTHLAK ELSMKKAISD IAEVEDVLMV ENIIPVER
|
| |