Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1083 |
Symbol | |
ID | 4811381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1289870 |
End bp | 1290919 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106505 |
Product | putative homoserine kinase type II (protein kinase fold)-like protein |
Protein accession | YP_001037508 |
Protein GI | 125973598 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTATAA ACCATGAACC CCTTTTTGAT GTGCTTTCAC AATATGATAT AAAGGTCGTC TCGATAAGAA ATGAAAGCTA CAAGGATAAA AAAGGTGTTT GGTGGATACA AACCCCTGAT GAATACAAAA TTCTAAAAAA GATATCAAAT TCGGAAGACA CTTTTAAATA TATATTGAGT GCTGCGGAGC ACCTAAGAAA AAACGGAGTA AATATTCCTG CTGTATACAA AACAAAGGAC GGAAAAGACT ATGTGAATAT TAACGGAACC TGCTACGTTT TATATGAGGC GGTTGAAGGC AAAAATCCTT CATATAATTC ACCTGAAGAC TTCAGGGCGA TTGTCAGAGA ACTTGCCGGA TTTCATGCCG CATCAGTGGG ATTTTCGCCT CCGGACAACA CAAAACCAAA AATTCATCTG GGTAAATGGG TTGAACAATA CACAGAACAA GTGGAAGACA TGAACAGGTT CTATCAAACC GAACTTGAGA AAAGCGAAAA CGACAGAATA GGAAAAGTAA TTATCGAAGA GTTTCCCGCC TTTTATGAAA GGGCAAAACA AGCGATTGAA GGATTGAAGG GAAAAGAATA CCAAGACTGG GTTGAAAAAG TCAAAAGCCG GGGCGGGCTT TGCCATCAGG ATTTTGCAGC TGGAAATCTT TTAAAAAATC CTTCGGGAAA AATTTTTGTT CTCGACACGG ATTCAATTAC CATAGACATT CCGGCACGGG ATATAAGAAA GCTCCTTAAC AAAATCATGA AGAAAAACGG AAAATGGGAT TTGGAAATTC TTCGCAAGTT TATACGAATT TATCAATCAG AAAATCCATT GAGTTTTTCC GAATGGACGG TTGTAAAGTT CGACCTCATG TTCCCTCATC TGTTCCTGGG AGCTATGAAT AAATTTTATT ATAAAAGAGA CAAAGAATGG AGTTTTGAAA AGTATCTGAA AAGAATAAAT GAAATGACCG CTTTGGAAAA GACCATTACA CCTGTTTTGG AAAACTTCGA CTCCATTGTT TATGAAGAGA TTAATCAAAG GAAGGACTGA
|
Protein sequence | MPINHEPLFD VLSQYDIKVV SIRNESYKDK KGVWWIQTPD EYKILKKISN SEDTFKYILS AAEHLRKNGV NIPAVYKTKD GKDYVNINGT CYVLYEAVEG KNPSYNSPED FRAIVRELAG FHAASVGFSP PDNTKPKIHL GKWVEQYTEQ VEDMNRFYQT ELEKSENDRI GKVIIEEFPA FYERAKQAIE GLKGKEYQDW VEKVKSRGGL CHQDFAAGNL LKNPSGKIFV LDTDSITIDI PARDIRKLLN KIMKKNGKWD LEILRKFIRI YQSENPLSFS EWTVVKFDLM FPHLFLGAMN KFYYKRDKEW SFEKYLKRIN EMTALEKTIT PVLENFDSIV YEEINQRKD
|
| |