Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2037 |
Symbol | |
ID | 7083797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2298568 |
End bp | 2299431 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699064 |
Product | Hsp33 protein |
Protein accession | YP_002355681 |
Protein GI | 217970447 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1281] Disulfide bond chaperones of the HSP33 family |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGC CCCTCCCCAC CAGCTATGTG CAACGTTTCC TGCTCGAGGA CCTCGACATC CGCGGCGCGG TCGTGCGCCT CACCGACGTC TGGGAAGCCA TGCAGGACGG CCGCGGCTAT GCGCCCGCGG TGGTGCGACT GCTCGGCGAG ATGAGCGCAG TCTCCACCGT GATCGCCGGC AACCTCAAGC AGCCCGGCCG GCTGACCTTC CAGATCACCG GCCACGGCCC CGTCAGCCTG CTGGTGATCG ACTGCGCCGA GACCCTGAAC CTGCGCGGTT ACGCCAAGGC CGAGGGCACG CCCGCCGCGG GCACGCTGGT CGAGCTCGTC GGCGACGGGC GCCTGCAACT CTCCCTCGAC ATCGAAGGCC TCGATCAGCC CTACCAGAGC CTGGTGCCGC TGGAAGGCGA CAGCATCGCC GAGGTCTTCG AGCACTATCT CGTCCAGTCC GAGCAGCAGC CCGCCAGGCT GTGGCTGGCG TGCAGCGCGC AGGCGGCGGT GGCCCTGTTC GTGCAGAAAC TGCCCGGTGC CGACCTCAAG GACATCGACG GTTGGTCGCG CGTCCAGCAG CTGGCCCACA CCGTGCGCGA GGACGAGCTG CTCGGCCTCG ACGCCGAGCA GATCCTGCGC CGCCTGTTCG CCGAGGAGGA CATCCGCCTC TTCGACGCGC GTCCGGTCAC CCACGAATGG CCGGCCGATC CGGACAAGAT CGCCGAGATG TTGCGCGCAC TCGGCGAGGA CGAAGTACGC ACGGTCCTGG ACCAGCATGG CGAGGTGGTG GTGCATGACG ATCTGTCCAA CCACACCTAC CGTTTCGATC GCAGCGACGT GGACGCGCTC TTCCGCCCGC CCACCCTGCA TTGA
|
Protein sequence | MKTPLPTSYV QRFLLEDLDI RGAVVRLTDV WEAMQDGRGY APAVVRLLGE MSAVSTVIAG NLKQPGRLTF QITGHGPVSL LVIDCAETLN LRGYAKAEGT PAAGTLVELV GDGRLQLSLD IEGLDQPYQS LVPLEGDSIA EVFEHYLVQS EQQPARLWLA CSAQAAVALF VQKLPGADLK DIDGWSRVQQ LAHTVREDEL LGLDAEQILR RLFAEEDIRL FDARPVTHEW PADPDKIAEM LRALGEDEVR TVLDQHGEVV VHDDLSNHTY RFDRSDVDAL FRPPTLH
|
| |