Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1376 |
Symbol | |
ID | 4809371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1679668 |
End bp | 1680915 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106800 |
Product | homoserine dehydrogenase |
Protein accession | YP_001037801 |
Protein GI | 125973891 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000372588 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAATA TAGCAGTGAT GGGATACGGA GTGGTCGGCT CCGGAGTTGT TGAAGTTATA AGAAAAAACA GTGTCAGCAT ATCAAAGAAA GCGGGTCAGG AGATTCGCGT AAAAAAAATA CTGGATATCA GGGATTTTCC GGACAGTCCT GAACGGGATT TGTTTACAAA GAATCCTGAT GAAATTTTTG ATGACCCGTC AATAGGAATA GTTGTTGAGA CCATAGGCGG AATAGGTGCT GCGTACGAAT TTACAAAGAA GGCTTTGAGC AAAGGAAAGA ATGTTGTAAC CTCGAACAAG GAGCTGGTTG CAACCCATGG ACCTGAACTT TTGAAGCTTG CAAAGGAAAA TGGAGTAAAC TATCTGTTTG AAGCAAGTGT CGGCGGTGGA ATTCCCATTA TCAGGCCTTT GAACCGCTGC CTTGCCGCAA ATGAAATACA CAGCATCATA GGAATACTCA ACGGAACTAC GAACTACATA TTAACACAGA TGAAAAGGCA GGGAAAAGAT TTTGACGAGG CTTTGAAAGA GGCACAGCAG AAAGGATATG CGGAAGCGGA TCCAACCGCG GACATAGAAG GGCATGATGC ATGCAGAAAA ATTGCAATAC TCTCATCCAT TGCGTACAAT GAATTTGTTG ATTACAAAAA GATACATACG GAAGGCATAA AAAAAATAAG CCTTGCGGAT ATGAAATATG CCGAAAGCAT GGATTCAACC ATCAAGCTTG TCGCCATAAG CGAAAAAATC GGTGACGGTA TTATGGCAAG GGTTGCTCCT GCGATAGTAA GCAGCAAAAG TCCGCTTTAC AGTGTTGAAG ATGTTTTTAA TGCTATTGTT GTGAGAGGAG ATGCAATTGG AGAAGTGATG TTTTACGGCC CGGGAGCGGG CAAGCTCCCC ACGGCAAGCG CCGTTGTGGC GGATGTAATT GAAATTGTGA AGCATTGGGG TACCTGCGGC AGCTATAACT GGGTTGTAAA AGACGGCGGC AACGTCATTG ATTTGAAAGA AACCAGGACA AGGTATTTTG TGAGACTGAA AGTGGAGAAT GAAGCTGAAG CTAAAAAAGC GGTGGAGAAT GCTTTTGGAA ATGTTGAATG GGTAAAAGCG TATGATGCAA GTGTACAGGA TGAATTGGCA TTTGTTACTT CCTGCGTTTT GGAGAAAGAC TATTGCAATT CTCTTCAACA ACTTAAAGGC AGCAAAGCTG TAAAAGATGT GGTAAATGCC ATAAGAGTCC TGGACTAA
|
Protein sequence | MVNIAVMGYG VVGSGVVEVI RKNSVSISKK AGQEIRVKKI LDIRDFPDSP ERDLFTKNPD EIFDDPSIGI VVETIGGIGA AYEFTKKALS KGKNVVTSNK ELVATHGPEL LKLAKENGVN YLFEASVGGG IPIIRPLNRC LAANEIHSII GILNGTTNYI LTQMKRQGKD FDEALKEAQQ KGYAEADPTA DIEGHDACRK IAILSSIAYN EFVDYKKIHT EGIKKISLAD MKYAESMDST IKLVAISEKI GDGIMARVAP AIVSSKSPLY SVEDVFNAIV VRGDAIGEVM FYGPGAGKLP TASAVVADVI EIVKHWGTCG SYNWVVKDGG NVIDLKETRT RYFVRLKVEN EAEAKKAVEN AFGNVEWVKA YDASVQDELA FVTSCVLEKD YCNSLQQLKG SKAVKDVVNA IRVLD
|
| |