Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3137 |
Symbol | |
ID | 7874279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3394440 |
End bp | 3395372 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700065 |
Product | anaerobic benzoate catabolism transcriptional regulator |
Protein accession | YP_002890111 |
Protein GI | 237653797 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0703] Shikimate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.125923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACC AGAATCCGAC CCTCGCAGCC TCCGACGCGC CTGTACCGGT CGCGCACCCG GCGGTGCGAC CCGATCCGCG CAGCGACGGG GAGTTCCTGC TCGCGCTCGG CCGGCGCGTG CGCGAGACGC GCGACCGGCG TGGCCTGACT CGCAAGCAGC TGGCTCGCGA CGCGGGGGTG TCGGAGCGGC ATCTCGCCCA TCTTGAGGCC GGCGAGGGCA ACGTCTCGAT CGTGCTGCTA CGCCATATCG CCAGCGCATT GGGGCTGTCG CTGCCGGAGA TGTTGAACCT GGGCGCCGAG GAGTCGGTCG AGCAGCGCCT GGTGCGGCGC ATCCTCGAAC AACTGCCGCG CCACCGGCTG GAGGACGTGG TCTTCCGGTT GATGCGCGAC TACGGCCAGG AAGAGGCCGC GCGGCGCAAC CGGATCGCCC TCATCGGCCT GCGCGGCGCC GGCAAGACCA CACTGGGCCG CAGGCTGGCG GCCGAACTCG GCTTCCGTTT CGTCGAGCTC GATGCCGAGA TCGAACGCGA GACCGGCATG CCCATGGGCG AGATCTTCGC CCTCTACGGC CAGTCGGGAT ATCGCCGCAT CGAACAGCGT TGCCTGCGGC GCGCACTCGA CAGCGGTGAA CGCACGGTGC TCGCCACGGG CGGCGGAATC GTCTCGCAGA CCGAGACCTA CGACCTGCTG CTGTCGCGCT GTCTGACGAT CTGGCTCCGC ACCTCGCCCG AGGAGCACAT GCAGCGCGTC AGCGCGCAGG GCGACCTGCG TCCGATGGCC GGTAGCGCGG AAGCGATGGA AGACCTGCGC CGCATCCTCG CCGCGCGTGA ACCGCAGTAC CTGCGCGCCG ACCATGTGGT CGATACCTCG GGCCAGGGCG AGGACGAGAG CTTCCGCGCG CTGCGCGCCG CCGTGGAGGA CCGCGGCCGC TGA
|
Protein sequence | MNDQNPTLAA SDAPVPVAHP AVRPDPRSDG EFLLALGRRV RETRDRRGLT RKQLARDAGV SERHLAHLEA GEGNVSIVLL RHIASALGLS LPEMLNLGAE ESVEQRLVRR ILEQLPRHRL EDVVFRLMRD YGQEEAARRN RIALIGLRGA GKTTLGRRLA AELGFRFVEL DAEIERETGM PMGEIFALYG QSGYRRIEQR CLRRALDSGE RTVLATGGGI VSQTETYDLL LSRCLTIWLR TSPEEHMQRV SAQGDLRPMA GSAEAMEDLR RILAAREPQY LRADHVVDTS GQGEDESFRA LRAAVEDRGR
|
| |