Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3920 |
Symbol | |
ID | 7873566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4316322 |
End bp | 4317377 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700857 |
Product | protein of unknown function DUF306 Meta and HslJ |
Protein accession | YP_002890880 |
Protein GI | 237654566 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3187] Heat shock protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0592511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACC GTCTTTCCCT GTCCTCGAGT CTCGCGATGA AAATCTCTTC CCTGTTGGGC CTCGGTCTCC TCGCCGGCCT GGTCGGCGGC TGCAGCCTGT TCCGCCCTGC CGCGCCTGCG CTGGCCGCCG AAGCATCCCC CCAGGCCGTG GCCGAGCGCC CGGCCGACCC CACGCCGGCG CTCGATCCCC AGCCGCCGCT CGAGTTCCGC TGCGGCGAGC GCAGGGTCGA GCTGCGCCAC TTCGGCGAGC TCGCCACCCT CGGCGTCGAC GGGAAGGTGT TCACGCTGCG GCCCGAGCGC AGCGCCTCGG GCATGCGGAT GGTGTCGGTC GACGACGAGC GTACCGGCAT CTGGGTCAAG GGCAACTCCG CGCGGCTGAG CCTCGCCGGC GTCGAGCAGC CGGAGTGCCG GCGGGTGGCG GAGAAGCCGC TGTTCCGCGC GGTGGGGAAC GAGCCGGCTT GGCGGCTCGA CATCGGCGCG CACGGCCTGG AGCTGATCGC CGACGGCGGC GCCTCGCGTG CGTTCGCCGC TGCGCCGCTG ATCGTGGAGG CGGCGGGCAT GCGCAGCTAC CAGGGCACGA GCGTCGGCGG CGAGCTGGAG GCGCTGGTGT TCGAGCGCCT GTGTGTCGAC ACGATGAGCG GGATGACGCA CCCGAACAGC GTCGAGGTGC GCTGGCAGGA CCACGTGCTG CGCGGCTGCG GTGGCGATCC CGCGCAACTG CTGCAGGGTG AGCCCTGGGC GGTGGTGGAG CTCGACGGCC GCGCGGTGGC CGATCCGGCG CGCGTGACGC TCGCCTTCGC GGCCGACGGC AGGCTTGCCG GCCTGGCCGC ATGCAACCGC TACTTCGGCA GCTATGTGCT GTCGGGCGAG GGCCTGCGGC TGTCGCCGCT GGGGGCGACC AGGATGGCCT GCGAGCCGCG TGCGATGGAA GACGAGCAGC GCTTCATGGC GGCCGCCGCG CGGGTGACGG GCTTTGCGAT CGCAGGTGAC GGCGGGCTGG AGCTGCGTGC GGGCGATCGG GTGGTGATGC GCGCGCGGCG CATGGGCGAG CGGTGA
|
Protein sequence | MTDRLSLSSS LAMKISSLLG LGLLAGLVGG CSLFRPAAPA LAAEASPQAV AERPADPTPA LDPQPPLEFR CGERRVELRH FGELATLGVD GKVFTLRPER SASGMRMVSV DDERTGIWVK GNSARLSLAG VEQPECRRVA EKPLFRAVGN EPAWRLDIGA HGLELIADGG ASRAFAAAPL IVEAAGMRSY QGTSVGGELE ALVFERLCVD TMSGMTHPNS VEVRWQDHVL RGCGGDPAQL LQGEPWAVVE LDGRAVADPA RVTLAFAADG RLAGLAACNR YFGSYVLSGE GLRLSPLGAT RMACEPRAME DEQRFMAAAA RVTGFAIAGD GGLELRAGDR VVMRARRMGE R
|
| |