Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1084 |
Symbol | |
ID | 4811382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1290925 |
End bp | 1291962 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106506 |
Product | spore coat protein |
Protein accession | YP_001037509 |
Protein GI | 125973599 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATGAAG TTGGCAAAAA CCCAAACATG GATTTGAGTA AGCTTGCAAG TTCTGTATTG GAAGAGTATG GAATCGAACC GGAAAACATT AGTGTAGTTC AAAGTGCAAA TATAAAAACC GTATGGAGAA TAAAAACGAA GGACCGTGAA CTGTGTCTCA AAAGATTGAA ACATCCATTA GACAAAGCTC TCTTTTCCGT AAACGCCCAG GATTTTATAT ACAATCATGG CGGAAATGTC GCGGGAATAA TCCGGGATAA AGAAGGAAAT CTTATTCATT CTTTCAACGA CCAGCTGTTC GTTGTATATG AATGGCTTTA CGGAAGGGAT TTGTCCTTTG TCAATGCTGA TGACTTAAAA TCCGCCCTGC ACGGCCTTGC CAAATTTCAT ATTGCGTCAA AGGGTTATGT CGCCCCGGAA GGTGCCAAAG TCTCTTCCAA GCTCGGCAGG TGGCCTGAAC AGTACAAATC CATGGCAGAC AAACTTTCTT CCTGGAAAGA AGCATCCCTG GGAAAACCTG CTTCAGCTTC TGTCAATGCT TATCTCAAAA ATGTTGACGA AATGCTTGAT ATCTGCCATC GGGCCATGGA GCTTTTAAAT GCCTCAAAAT ATGCCGAGTT GGCAGGTGAA AATTCCAAAT CGGCTGTTTT ATGCCATCAG GATTACGGCA AGGGAAATGC ACTTTTTACA GACAATGGTG TTTATGTCAT AGATCTTGAC GGAGTAACCT GGGACCATCC TGGACGGGAT CTTCGAAAAA TAATCGGCAA GCTGTCGGAG AACAGAGGAG CCTGGTCTTT GGATCAAATC GAAAAAATCC TTGACTGGTA CAGCGAAATA AATCCTCTTT CCACCGCAGA CAGGGAACTT ATTTATATTG ACCTTATGTA CCCCCACTGG TTTTTTGGCC TTGTTAAAAA CATTTTCAAG AACAATAAAA GCGAAAGTCC GTCAAAGATT GAAAAAACAG CAAGGCTGGA AACTTCCAAA GTACCATTGC TTGCCGAAAA GCTTCGGGAT ATAAAATCGC AGGGCTAA
|
Protein sequence | MNEVGKNPNM DLSKLASSVL EEYGIEPENI SVVQSANIKT VWRIKTKDRE LCLKRLKHPL DKALFSVNAQ DFIYNHGGNV AGIIRDKEGN LIHSFNDQLF VVYEWLYGRD LSFVNADDLK SALHGLAKFH IASKGYVAPE GAKVSSKLGR WPEQYKSMAD KLSSWKEASL GKPASASVNA YLKNVDEMLD ICHRAMELLN ASKYAELAGE NSKSAVLCHQ DYGKGNALFT DNGVYVIDLD GVTWDHPGRD LRKIIGKLSE NRGAWSLDQI EKILDWYSEI NPLSTADREL IYIDLMYPHW FFGLVKNIFK NNKSESPSKI EKTARLETSK VPLLAEKLRD IKSQG
|
| |