Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2976 |
Symbol | |
ID | 7874366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3223418 |
End bp | 3224455 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699897 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002889952 |
Protein GI | 237653638 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACAC CCGCTTGCTC GCCCTTGCCG CCCGCGGCAC CCGATGCGGC CCCGGTATCG GCGGGCCTGC CCGGGCCTGT CCAGGTGGGC GACGTGAGCG ATTTCGGCAG GGGGCTGCTT GCCTGGAATG CGACCTACGA GCAGCTCGCA CCGGGCTGCT TCGAGGGCGG CGCGGAGGAG TTGTGGATCG ACGACGGCCT CGAGGTCCTG TGGGAGACGG GCAACCGCTC GGTGTGGACC GCAGGCAGCA ATCGGGCGGG CATGGTGTCG CTCGGCATTC CGGTGGCAGG CGGCGGCGCC GGCATGTATT GCGGCGTTCC GATCCAGGAC GGGGCGGTGT CGTGGCTTCC GGGCGGCGGA GATTTCGAGA TCTTCTCCCG CGAGCGCATG GATATCGTCT CGGCGACGGT GTCCGAATCC CTGCTGTGCG GATTTGCCGC CAGCGACTCG CCGCGGACCG CGGAGGCATC GCTGCAGCGC CCGTTTCTCC AGCAGCAGCC GAGGCGTGCC GGGATGTTGC GCCGCGCGCT GATCGAGATC GTCGTCGTGG CCCGCCGCCA GCCCGGCCTG CTGGAGATTC CGGCGAGCAG GGCGGCGATG CGCGACTGCG TCCTGTCGCT GATGCTCGAT ACGCTGGATC GGGCCTCGGA GCGCCCGACG GCGGACCTGC GTCCGAGCGT GAAGGCCTGG ATCGTGCGCA AGGTGCGCGA GCTGGCGCTC GAGCGCCCGA GCGAGCCCCT CCAGATCGCC GACATCTGCC GCAGCCTCGC GGTGTCGCGG CGCTCGCTGC AGTACGCGTT CGAGGACCTC GTCGGGATGG GGGCGGTGGA GTTCCTGCGC AACGTGCGCC TGAACGCCGT GCGGCGCGAG CTCCGTGTTG CGACCGGCAG CCCCGATGAA CCGATCGCCG CGATCGCGGC GCGCTGGGGC TTCTGGCACA TGCCGCGGTT CGCGGCGTAC TACCGGGCCC TGTTCGGAGA GCTGCCCTCG GAGACGCGTC GGCGAGAGGG TGGCAGCACG CCGCACGGTC CCGGTTGA
|
Protein sequence | MPTPACSPLP PAAPDAAPVS AGLPGPVQVG DVSDFGRGLL AWNATYEQLA PGCFEGGAEE LWIDDGLEVL WETGNRSVWT AGSNRAGMVS LGIPVAGGGA GMYCGVPIQD GAVSWLPGGG DFEIFSRERM DIVSATVSES LLCGFAASDS PRTAEASLQR PFLQQQPRRA GMLRRALIEI VVVARRQPGL LEIPASRAAM RDCVLSLMLD TLDRASERPT ADLRPSVKAW IVRKVRELAL ERPSEPLQIA DICRSLAVSR RSLQYAFEDL VGMGAVEFLR NVRLNAVRRE LRVATGSPDE PIAAIAARWG FWHMPRFAAY YRALFGELPS ETRRREGGST PHGPG
|
| |