Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1852 |
Symbol | |
ID | 4809403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2198560 |
End bp | 2199450 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107271 |
Product | Hsp33 protein |
Protein accession | YP_001038266 |
Protein GI | 125974356 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1281] Disulfide bond chaperones of the HSP33 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000569764 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGATT ATATTGTCAG AGCTACAGCA AAAGAAGGCA CAATAAGGGC TCTTGCAGCA ATTACAACAA ACATGGTTAA AGAGGCGCAA AAAGTCCATG GACTGTCGCC GCTTGCCACA GTCGCTTTAG GAAGGACAAT GACTGCGGCA GCCATGATGT CCACAACCTT GAAGGAGGAA AATGCAGTCA TAACGCTGCA AATCAAGGGG GATGGGCCGA TAGGCGGAAT TGTCGTAGTG GTTGATTCGT CTGCAAATGT AAAAGGGTAT GTTCACAACC CGCTGGTGTA TCTGCCTTTA AACAGCCAGG GGAAATATGA TGTTGCCGGA GCTGTCGGAA ACGGATATTT GAATGTAATA AAAGATTTGG GATTGAGAGA GCCTTACGTG GGACATGTTG ATCTTGTTTC CGGTGAGATT GCCGAGGATA TTACATATTA TTATGCATAT TCTGAACAAG TTCCCACGGC CACTGCTTTG GGAGTGCTGA CCAATGCCAC CGAAATTGTT GTAAGCGCGG GAGGATTTAT TTTGCAGTTG ATGCCGGGAG CTGATGACGA CACAATTTCG TTTATTGAAA ACAAGATAAG TTCAATACCG CCGGTTTCGA CGCTTTTGGC GCAAAACAAA AGTCCTGAGG ATATTCTTGA GATGCTTCTT TCCGAAAAAG ATATGAAAAT AATTGGCAAG TCCCCCTGCA GATACCTGTG CAACTGCTCA AGAGAGCGAA TGGAGAGAAA TATAATGACT TTGGGCAAAG AAGAGATAAT GGGTATGATC AACGAAAACC ACGGGGCGGA GGCGCATTGC CATTTTTGCA ATAAGAAATA CTGGTTTTCG GAAGAGGACC TTTTAAGGCT TGTAAAAATA ATAGAATCAC AAAAAAGTTG A
|
Protein sequence | MEDYIVRATA KEGTIRALAA ITTNMVKEAQ KVHGLSPLAT VALGRTMTAA AMMSTTLKEE NAVITLQIKG DGPIGGIVVV VDSSANVKGY VHNPLVYLPL NSQGKYDVAG AVGNGYLNVI KDLGLREPYV GHVDLVSGEI AEDITYYYAY SEQVPTATAL GVLTNATEIV VSAGGFILQL MPGADDDTIS FIENKISSIP PVSTLLAQNK SPEDILEMLL SEKDMKIIGK SPCRYLCNCS RERMERNIMT LGKEEIMGMI NENHGAEAHC HFCNKKYWFS EEDLLRLVKI IESQKS
|
| |