Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3734 |
Symbol | |
ID | 7873733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4103794 |
End bp | 4105092 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700680 |
Product | hypothetical protein |
Protein accession | YP_002890704 |
Protein GI | 237654390 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCC ATTTCGGCCC GCTGGCGACC GCGAATCCGC CCCCGCGGCC TCGGGGGCGC TGGCGAATCT TGCGCTGGAT CGTGTCATGG AATGAAGGTT TGGTTACAAT CCGACACAAC CCGGTTCCCG GCAGGGGTAT TCCCGTGCGC GAAATCCAGC TCTCCGAAGA CCTCGACGGC CGCTACGATC GGCTCTCCCG CGATCTGGGC GCGCGCGCGC TCGCCGGCCT GCTCGCCGTG TCCGGTTCGA TCCTCGTCCT GCTCGGCGGG GCGGTCGTGG GTGCCGTGGC GACCTGGGAG GCGATCGGCC GGGGCACACC GGTGGAAACG CTGTTCTGGG TGCTCTTCAT GCTCGGCAGC ATCGCCGCGA TCTGGGCCAC GCTGGCCTCC CTGGCGAGCC TCGCGCCGCG GCCCTGCGGG CTGGCGGTCG CGCGCGAACG GGTGGGCGAG CTCTACGCGC TGGTCGATGC GCTCGCGCAC GTGGCGGGGG TGAGCCCGAT CCGGCGCATC TACGTGACGG GCGAGATCAA CGCCTCGATC GTGCAGCGCC CGCGCCACCT GGGTGTCGGC GAGATGGCGA CCGATCTGCT GATCGGCCTG CCGCTGGCCC ATGCGCTCGG CCCCCGCCAG CTCGCGGCGG TGATCGCCCA CGAGATCGGC CACATGGTCG CACGCCAGCG CGGGATGCAC GGCCGGAGCG GTTGGCTGCA GGCGTGGTGG ATGCGCACGC TGGACGAGCT TGCGGCCGCG TTGCCGTCCG GCTTTCCCTG GTTCGACCCC CAGGGCGACG CCCTGTGCAT GAGGATGCTG CTGCTGGCGC AAATGGAGGA GTACGCCGCC GACCGCACCG CGGTGCGCCT GGTAGGGGCC GAGCTGCTTG CCGGGACCCT GGTCGAGCTC TGCTGCAAGG CGGACTTCCT CGCCAACGAC TACCTGCCGC GCGTGCATGC CCTGGCCGAG TGCGACGAGG CGTCTTGCGT GCGGCCCTAC CGCGAGATGG GGCACGGCTT TGCCGCCGGC TTCGCCCAAT CGAGCGCTGC CATCGATCCG CAGCGCGTGC TGCGCGCGGA CGCCGGCGAC CCCTTCCACC CCGCGCTGCA GGATCGCCTC GTGGCGATCG GCGTCGCCAT GCCCGAGCGG CTGCAGGTGT CGGGTCCTTC TGCGGCGCAG CATTTCTTCG GCGACAGCCT GCCTTTCCTG GCCTGGCATT TCGACCGGAT CTGGTTGGAG AGCCTGCGCG ACGCTTGCCC GCCGGCGCTC AGTCCGCGCG CGCCGGAGCG GGTGCTGCAG GGGGCGTGA
|
Protein sequence | MASHFGPLAT ANPPPRPRGR WRILRWIVSW NEGLVTIRHN PVPGRGIPVR EIQLSEDLDG RYDRLSRDLG ARALAGLLAV SGSILVLLGG AVVGAVATWE AIGRGTPVET LFWVLFMLGS IAAIWATLAS LASLAPRPCG LAVARERVGE LYALVDALAH VAGVSPIRRI YVTGEINASI VQRPRHLGVG EMATDLLIGL PLAHALGPRQ LAAVIAHEIG HMVARQRGMH GRSGWLQAWW MRTLDELAAA LPSGFPWFDP QGDALCMRML LLAQMEEYAA DRTAVRLVGA ELLAGTLVEL CCKADFLAND YLPRVHALAE CDEASCVRPY REMGHGFAAG FAQSSAAIDP QRVLRADAGD PFHPALQDRL VAIGVAMPER LQVSGPSAAQ HFFGDSLPFL AWHFDRIWLE SLRDACPPAL SPRAPERVLQ GA
|
| |