Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3765 |
Symbol | |
ID | 7873762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4151113 |
End bp | 4152063 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700709 |
Product | protein of unknown function DUF1022 |
Protein accession | YP_002890733 |
Protein GI | 237654419 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3660] Predicted nucleoside-diphosphate-sugar epimerase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTGCG GTCAAAGCCT GAAACCGATC ACCGGCATCT GCTCCGACAT GACCATCCTC TGGCTCGTCA CCGACAACAA GCCCGGCCAC CGCAGCCAGC TGCAGGGCCT GGCGCAGGCG CTGGCCGCAC GAACCCCGAT CGACACGCGC TGGATCGACG CCCCATCCGG GCGTGCGGCG CTCTGGCAGT GGGTCGGCGG ACGCTTCCCG CCCGGCGCCG GCCTGCCCGA TCCGGACCTC ATCCTCGTCG CCGGCCATCG CACCCACCTC GCGGGCCTGG CGGCGCGGCG CGCGCGTGGC GGCAAGCTGG TCGCGCTGAT GCGCCCGAGC CTGCCGCTGG CCTGGTTCGA CCTGTGCGTG ATCCCGCAGC ACGACGCCCC GCCCGCGCGC GCCAACGTCA TCGCCACCCG CGGCGTGCTC AACACCGCGC GCCCGAGCGC CGAGCGCGCG GCCGACCGCG GCCTCTTCCT GATCGGCGGG CCGTCGAAGC ACCACGGCTG GGACAGCGCC GGCCTGCTCG CCCAGCTCGA CGCCATCCTC ACCGCCACGC CGGGCATGCG CTGGACGCTG ACCACCTCGC GACGCACGCC CGCCGAGACC GAGTCCGCCC TGCTCGCCCT GCGCGAACGC GGCGTGGACG TGCGCCCGGT GCGCGACACC CCGCCCGGCT GGGCGATGGA GCAGGTCGCG CGCAGCGCGC AGGCCTGGGT ATCGGAAGAC AGCGTGTCGA TGGTGTACGA GAGCCTCACC GCCGGCGCCG CCACCGGCCT GCTCGCCGTG CCACGCCTGG GCGAGACGCG CATCGCCGCA GGCGTGGCCG CGCTGCAGCG CGAAGGCTTC GTCACCCCAT TCGTCGATTG GCAGCGCACC GGCCGCCTGA GCACGCCCCC CGAGCGCCTG GCCGAAGCCG AGCGCGTGGC CGAGGCGGTG CTGGCACGGT TCGGAGCGTG A
|
Protein sequence | MDCGQSLKPI TGICSDMTIL WLVTDNKPGH RSQLQGLAQA LAARTPIDTR WIDAPSGRAA LWQWVGGRFP PGAGLPDPDL ILVAGHRTHL AGLAARRARG GKLVALMRPS LPLAWFDLCV IPQHDAPPAR ANVIATRGVL NTARPSAERA ADRGLFLIGG PSKHHGWDSA GLLAQLDAIL TATPGMRWTL TTSRRTPAET ESALLALRER GVDVRPVRDT PPGWAMEQVA RSAQAWVSED SVSMVYESLT AGAATGLLAV PRLGETRIAA GVAALQREGF VTPFVDWQRT GRLSTPPERL AEAERVAEAV LARFGA
|
| |