Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3100 |
Symbol | |
ID | 7874570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3356661 |
End bp | 3357632 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700023 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002890075 |
Protein GI | 237653761 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.605914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATCC GCTTCCTCAA CGGCCAGAGC AATGTCTTCG CCGCCGCCAA CCCCTTCGAG GTCTCCGAGT ACGTGCGCGC CAACGTGGGT TCGCACAGCC TGCGCCTGCC GCGCGCCAGC GACGCCAGCG CCTCGCTCAG CCACCGCCGC GCCGGCACGC TGGATCTTTG CCGTCTCAGC TACGGCGCGC AGGCCCGCGT GCTGTCCGAG AGCCTCGGCG ACATCTACCA CGCCCAGTTC ATCCTGCAAG GCTATTGCAG CTACACGCTC GCCAACCGCA CGCTCGACCT GCCCGCCGGC CACGTGCTGG TGCTCAACCC GGACGAGCCG GTGGACCTCA CCTACTCGGA CAACTGCGAG AAGTTCATCG TCCGCATCCC CTCGGCGATG CTCGACGACG CCTGCACCGA GCACCGCTGG TTCAAGCCCA ACGAGCGCAT CAAGTTCAGC CCCGAGCCGC AGCGTTTCGA GGACATCGAC AGCCTGCTGC TGCTGTTGCG CCTGCTCTGC GAGGAGGCCG AATCCGAGCT GGCGACGCCG CAGATGCTGC AGCACTACTG TCGCGTGGTC ACCACCAAGC TGATGGTGAT GCTCAAGCAC AACGTCAGCA TGGTCGCCCC CACCCGGCAC GCGCCCAGCT TCGAGCGCCT GGTGAACTAC ATCGAGCGCA ACATCAAGCT CGATCTCAGC GCCGAGGATC TCGCCCACTA CGCCGGGCTG AGCCTGCGCT CGCTCTACCT GCTGTTCGAG AAGAACGTGA AGACGACGCC GAAGAACTTC GTGCGCCAGA AGAAGCTCGA GAAGGTGCAT TCGATCCTGA GCGACCCGGG CCAGGCCTGT CCGAACGTCA CCGCGGTCGC GCTCGAGTAC GGCTTCTCGC ACCTGGGCCG CTTCTCCGAA CTTTACAAAT CCACCTACGG CGTGCTGCCC TCGCAGTCGA TCCGCTGCCG CCAGCCCCAG GCCGGGCGCT GA
|
Protein sequence | MPIRFLNGQS NVFAAANPFE VSEYVRANVG SHSLRLPRAS DASASLSHRR AGTLDLCRLS YGAQARVLSE SLGDIYHAQF ILQGYCSYTL ANRTLDLPAG HVLVLNPDEP VDLTYSDNCE KFIVRIPSAM LDDACTEHRW FKPNERIKFS PEPQRFEDID SLLLLLRLLC EEAESELATP QMLQHYCRVV TTKLMVMLKH NVSMVAPTRH APSFERLVNY IERNIKLDLS AEDLAHYAGL SLRSLYLLFE KNVKTTPKNF VRQKKLEKVH SILSDPGQAC PNVTAVALEY GFSHLGRFSE LYKSTYGVLP SQSIRCRQPQ AGR
|
| |