Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0747 |
Symbol | |
ID | 7083976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 827955 |
End bp | 829001 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697772 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002354414 |
Protein GI | 217969180 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.40849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCC TCGCCCCCTC GCCCGCCGAA CTCACCCGCG GCCGCGCCGC CACGCCGATC GCCTTCATCC GTGCGATCGT TGCCGGGTAT CGGCTGCGGG GCATGGATCC GGCCAGCGCG CTGGCGCAGG CGCAGATCCC GCCCGCGCTG CTGGAAGATC CGGCCGCACA TGTCACCGCC GCGCAGATGG AGCTGCTCTC GGGCGTGGCG ATGCAGGAGC TCGACGACGA GGCGCTGGGC TGGTTCTCGC GCCGCCTGCC CTGGGGCAGC TACGGCATGC TCTGCCGCGC CTCACTGACC TCGCCGACGC TGGAGGTGGC GCTCAAGCGC TGGTGCCGTC ACCACCGCCT GCTGACCGAG GACATCGTCT TCGAGCTCGC GCAACGCGGC GGCATGGCTA CGATCCACGT CGCCGAGCAC GCCGGACTCG GCGAGCTGCG CGAGTTCTGC CTCGTCAGCA CGCTGCGCTA CCTGCTCGGC TACGCCTGCT GGCTGGTCGA TTCGCGCATC GCGCTCGGCG AGGCCGCCTT CCCCTTCCCT GCCCCGGCGC ACGCCGACGC CTACGCCTAC CTCTTCGCCG GCCCGGCGCG TTTCTCCGCC GCGGCCGCCT GCATCCGCTT CGACGCGCGC TACCTCGCCC TGCCGGTGCG CCGCGACGAG AAGGCGCTGC AGGCCATGCT GCAGCGTGCG CTGCCGCTCA CCGTGCTGCA GTATCGCCGC GACCGCCTGC TGGTGCATGG CGTCGCGCAA TTGCTCGCCG CCAACCCCGC CGCCGCCCAC ACCGCCGAGG AGGTTGCCGC ACAGCTCAAC CTGTCGGTGC GCACCCTGCA CCGACAGCTC AAGGAAGAAG GCGTGTCGCT GCAGCGCCTG AAGAACGCGG CGCGGCGGGA GCATGCGGTG AAGCTGTTGC TGCAGAGCGC CAGACCGGTG AAGCAGATCG CAGCCGCCGT CGGCTTCGAC AGCGAGAAGA GCTTCGCGCG CGCGTTCCGG GAGTGGACGG GGGTCGCGCC AAGCGCGTAC CGGTCAAACG CAGAGCCCGC CTGCTAG
|
Protein sequence | MKILAPSPAE LTRGRAATPI AFIRAIVAGY RLRGMDPASA LAQAQIPPAL LEDPAAHVTA AQMELLSGVA MQELDDEALG WFSRRLPWGS YGMLCRASLT SPTLEVALKR WCRHHRLLTE DIVFELAQRG GMATIHVAEH AGLGELREFC LVSTLRYLLG YACWLVDSRI ALGEAAFPFP APAHADAYAY LFAGPARFSA AAACIRFDAR YLALPVRRDE KALQAMLQRA LPLTVLQYRR DRLLVHGVAQ LLAANPAAAH TAEEVAAQLN LSVRTLHRQL KEEGVSLQRL KNAARREHAV KLLLQSARPV KQIAAAVGFD SEKSFARAFR EWTGVAPSAY RSNAEPAC
|
| |