Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0774 |
Symbol | |
ID | 7084165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 860335 |
End bp | 861237 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697798 |
Product | Carboxymethylenebutenolidase |
Protein accession | YP_002354440 |
Protein GI | 217969206 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0412] Dienelactone hydrolase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.12147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCCA CCCCCAGCCG TGCGCAACAC GCCACCGAAG ACGAGCGCGG CCAGTTCGAC AGCCTGCTCC CCGCCGCCCG CCTCGACCGC CGCGGCTTTC TCGCAGGCAT CGCCGCCACC GGCTTCGCGC TCGCCGTGCA GCCTGTCCAT GCCAGCACCG TGATCAGCAC GAGCGGCACC GGCCTCGCCA CGGGCACGGC CGCCATCGCC GTCGAGGGCG GCGAGTTGCC GCTCTACTAC GCGCGGCCCG CCGCAGGCGA CAGGTTGCCG GTCGTGCTCG TGGTGCAGGA GATCTTCGGC GTCCACGAAC ACATCCGCGA CGTCTGCCGC CGCCTCGCAC ACGCGGGCTA CCTCGCGATC GCGCCTGAAC TCTTCTTCCG CCAAGGCGAT CCCACCACCC AGCCCGACAT CCCCGCGATC CTGCAGAACA TCGTCGCCAA GGTGCCCGAC GCCCAGGTCA TGGGCGACCT CGACGCCTGC GCCGCATGGT CGGCGAGCCA GGGCGGCGAC CCCGCCCGCC TCGCGATCAC CGGCTTCTGC TGGGGCGGGC GCATCACCTG GCTGTACGCC GCGCACAAGC CATCGGTAAA AGCCGGCGTG GCGTGGTACG GACGCCTGAG CGGCGCCGTT TCGGAGTTCA CCCCGCAGCA CCCGCTCGAC ATCGTCGGCA GGCTGCACGC GCCGGTGCTG GGGCTCTACG GCGGCCAGGA CCAGGGCATC CCGCTCGCCG ACGTGGAGAA GATGCAGGCG GCGCTCTCCG CTGCGGGCGG CCGCAGCACC ATCCACGTCT ATCCCGACGC ACCCCACGCC TTCCACGCCG ACTACCGGCC GAGCTACCGC AAGGCCGAGG CCGAGGACGG CTGGAAGCGC GCGCTCGCGC ACCTTGGCGC GGCGCTCGGC TGA
|
Protein sequence | MQATPSRAQH ATEDERGQFD SLLPAARLDR RGFLAGIAAT GFALAVQPVH ASTVISTSGT GLATGTAAIA VEGGELPLYY ARPAAGDRLP VVLVVQEIFG VHEHIRDVCR RLAHAGYLAI APELFFRQGD PTTQPDIPAI LQNIVAKVPD AQVMGDLDAC AAWSASQGGD PARLAITGFC WGGRITWLYA AHKPSVKAGV AWYGRLSGAV SEFTPQHPLD IVGRLHAPVL GLYGGQDQGI PLADVEKMQA ALSAAGGRST IHVYPDAPHA FHADYRPSYR KAEAEDGWKR ALAHLGAALG
|
| |