Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1942 |
Symbol | |
ID | 4810725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2317632 |
End bp | 2318531 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107358 |
Product | hypothetical protein |
Protein accession | YP_001038353 |
Protein GI | 125974443 |
COG category | [R] General function prediction only |
COG ID | [COG5401] Spore germination protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.141052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGAC TTTTGTGTGT TATTACTGTA TGCATCATGA TATTATCCTT TTGTGCAGGT TGTTCAGATA ACGACAGGTT TGGATTGGGA GACAACGAAG AAGCAAGTGT TGCAAGCAGC ATTGTGCTAA GTGAGGCTGA TGCCCGGCAG ATTGCCGACA AATCGCCGAT ACGTCTTTAT TTTGCAAATG AGGATAATTC CAAACTAAAG CTGGAGATAA GATATATACC GGTTAGTGAA ACCACCAAAA GTGTAAACCA CCTTGCCGAA ATAATTGTCA ACGAATTGAT CAAAGGTCCC AAAGTTGCAG GATTAAAGCC TACAATACCG GAAGGTACCA AACTACGCTC AGCGATAAAG ATTGAGGGAG ACGTTGCCAT TGTCGATTTT ACGAAGGAGT TCAGGGATAA TCATCCGGGA GGAAAAGCGG AAGAAAGAAT GACAATTTAC TCGGTTGTAA ACTCGTTGAC GGAGCTGAAG GAGATAAACA AGGTGAAATT TTTGATAGAA GGAAAGTCGT CTCCTGACTT CAAGGGAAAT TTCAGATTTA ATACCGAATT TCCAAGAAGT ACCCAGCTGA TAAGCAATAA GGCGGAACCG GTGGGGACTG TGAACAGCAA GGATGCGGCG GACAAAAAAG ATGATTCGGC TTCCGGAGAC AAAGATACCG CTGATGTCAG CAATGAGACG GAAAACGAGA GTGTCGAAAC GGGAGCCCAA ACATCGGATG ATGAAGAAGT TTTTGTTGAT GATTCCTTAG ACGGCGAAGC ACAGGAAACT TACCAGGAAA CCAACGACGA TGAAAAGTGG CAGGAAACTT ACGACGAGGC TTCCGATGAA GAGGCCCGGC AGACATTTAG TGATGAATTT GAAGAAACAT ACATAGAATA CCTTGAATAA
|
Protein sequence | MRRLLCVITV CIMILSFCAG CSDNDRFGLG DNEEASVASS IVLSEADARQ IADKSPIRLY FANEDNSKLK LEIRYIPVSE TTKSVNHLAE IIVNELIKGP KVAGLKPTIP EGTKLRSAIK IEGDVAIVDF TKEFRDNHPG GKAEERMTIY SVVNSLTELK EINKVKFLIE GKSSPDFKGN FRFNTEFPRS TQLISNKAEP VGTVNSKDAA DKKDDSASGD KDTADVSNET ENESVETGAQ TSDDEEVFVD DSLDGEAQET YQETNDDEKW QETYDEASDE EARQTFSDEF EETYIEYLE
|
| |