Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3771 |
Symbol | |
ID | 7874015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4155488 |
End bp | 4156483 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700715 |
Product | protein of unknown function DUF1022 |
Protein accession | YP_002890739 |
Protein GI | 237654425 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3660] Predicted nucleoside-diphosphate-sugar epimerase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGAAG TGAGGCCGCT CGCCATCTGG CTGCTCAGCG ACGGCGCCCC CGGGCACCTC AGCCAGAGCC GCGGCATCGT CGATGCGCTC GCCACCCGGC GGATGGTGGA AACGCGCACC ATCGAGCTGA AGATCCGCAG CCCTTTCTGG AAGCGCCTGG GCCGCCTGCT GCTGCCGCGC ATCCGCGACA CCGACACGTG GCTGCGGCGG ATCTACGGCC TCACCCCACC CGCCGGCCAG CCCGCGCTGA TCGTCTCGTC CGGCGCCAAC ACCCTGCTCG CCAACGCCCT GCTCGCGCGC CGGTACGGCG TGCCCAACGT GTATAGCGGC ACCCTCAAGG GCTACGACCC GGCCGCCTTC GCCCGCGTGT TCACCGTGGT GCCGCTCGGT GTCGCGTGCA ATCACGTGCT GCCCCTGCCG CCGGTGCCGG GCGAGCTCGC CCGTCCGCTG CCTCCCCCGA CCGGCGAGCC ACGGATCGCG GTGCTCGTCG GCGGCGATGG CGCCGGCTAT GTCTTCAAGG AAGCCGACTG GCGCGCCTTC GGCGCCGGCC TCGCCGCGCT CGCCCGCCGC GACGGCGCCC GCCTGCTGCT CACCACGTCC CGCCGTACCG GCGCTGTCGC CGAAGCCTGG CTTCGGGACA GCATCCCCGC CGAACTGATC GCCGACGCGG TGTGGTGGTC GCAGGCGCCG CGCAAGGTCA TGCGCGATTT CCTCGGCGCG GCAAGCGCGA TCGTCGTCAC CGAGGACAGC CTCAGCATGG TGGCCGAGAG CCTCTACTCC GGCCGGCCGG TGGTCTCGGT CTCGCCCGCG GTCGCCCAGC CCAACGCCAA CGACGGCGCC GCGCTGCAGG CTTATGCAGA GCGCGGCCTG CTGGTGCGCC AGCCGCTCGC GGGCCTCGCC GACATCGCAT TCCCCGACTG CAGCACGCCC GTGCCCGACG TGCAGGCCGA GATCGCCGAC GTCGTCCTCG CGCTGCTGGA GCCGACAACA CCATGA
|
Protein sequence | MEEVRPLAIW LLSDGAPGHL SQSRGIVDAL ATRRMVETRT IELKIRSPFW KRLGRLLLPR IRDTDTWLRR IYGLTPPAGQ PALIVSSGAN TLLANALLAR RYGVPNVYSG TLKGYDPAAF ARVFTVVPLG VACNHVLPLP PVPGELARPL PPPTGEPRIA VLVGGDGAGY VFKEADWRAF GAGLAALARR DGARLLLTTS RRTGAVAEAW LRDSIPAELI ADAVWWSQAP RKVMRDFLGA ASAIVVTEDS LSMVAESLYS GRPVVSVSPA VAQPNANDGA ALQAYAERGL LVRQPLAGLA DIAFPDCSTP VPDVQAEIAD VVLALLEPTT P
|
| |