Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0642 |
Symbol | |
ID | 7084580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 727333 |
End bp | 728484 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697668 |
Product | transglutaminase domain protein |
Protein accession | YP_002354310 |
Protein GI | 217969076 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCC GCAACTTCCT GCGCGCCGGC ACGCTCGCGG CCGGCCTCGC GCTGCCCGCG CTCGCGCGCG CCCAACAATC CGCCGCGACC ACGGCGACCA CGCCGCTCGC CACCCCCGCC CCGCTCGCCG AGCCCCCCGC CTTCGCGCCC ACGCCGGACG CCGGCTGGCG CCGCTTCGAG TTGACGAGCC GCGTCGAACC CCTGCCCGGC GACGGTCCGC TGCGCGTGTG GGTGCCGCTG CCGGCGATGC ACGAGACCGC CTGGCAGCGC CCGATGGGCA GCCTGTGGCA GGGCAACGCC GACCTCATGG AACGTGTGCG CGACCGCTCC GGCGCCGAGA TGGTCTACGC CGAGTGGGCG CCGGGCATCG CCCAGCCCCG GCTCGAGATC CTCAGCCGCT TCGCCACCCG CGACCGCGCC ATCGACTTCA GCCAGCCCAG CGCCGCGCCC CAGGTACTGC CGCGCGCCGA ACGCATGCAC TACCTGCGCG CCACCGCGCT GCTGCCCACC GACGGCATCG TGCGCGATAC CGCACGCGAC ATCGTCCACG GCGCGAAGAC CGACGAGGAC AAGGCGCGCG CCATCTACGA ATGGATCGTG GACAACACCT TCCGCGAGCC CAAGGTGCGC GGCTGCGGCA TCGGCGACAT CCGCACGATG CTGGAGACCG GCAACCTCGC GGGCAAGTGC GCCGACCTCA ACGCCCTCTT CGTCGGCCTG GCGCGCGCAG CCGGGCTGCC GGCACGCGAC GTCTACGGCC TGCGCGTCGC CGACTCGCGC TTCGGCTACA AGAGCCTGGG CAAGAGCGGC AACGTCTCCA AGGCCCAGCA CTGCCGAGCC GAGGTCTTCC TGGAGCGCTT CGGCTGGGTG CCGGTGGACC CCGCGGACGT GCGCAAGGTC GTCCTGGAGG AACCGCCGGG CAAGCTCTCC ATGGTCGATC CCAAGGTCGC CGCGGTGCGC AAGCAGCTCT TCGGCGCCTG GGAGATGAAC TGGCTGGCCT ACAACGACGC CCACGACCTG CGCCTGCCCG GCAGCACCGG CAGCGAGATC CCCTTCCTGA TGTACCCCCA GGGCGAACTC GCCGGCCAGC GCTTCGACAG CCTGGACCCC GACGCCTTCA GCTACACGCT GAGCGCACGG GAAATTGCCT GA
|
Protein sequence | MNRRNFLRAG TLAAGLALPA LARAQQSAAT TATTPLATPA PLAEPPAFAP TPDAGWRRFE LTSRVEPLPG DGPLRVWVPL PAMHETAWQR PMGSLWQGNA DLMERVRDRS GAEMVYAEWA PGIAQPRLEI LSRFATRDRA IDFSQPSAAP QVLPRAERMH YLRATALLPT DGIVRDTARD IVHGAKTDED KARAIYEWIV DNTFREPKVR GCGIGDIRTM LETGNLAGKC ADLNALFVGL ARAAGLPARD VYGLRVADSR FGYKSLGKSG NVSKAQHCRA EVFLERFGWV PVDPADVRKV VLEEPPGKLS MVDPKVAAVR KQLFGAWEMN WLAYNDAHDL RLPGSTGSEI PFLMYPQGEL AGQRFDSLDP DAFSYTLSAR EIA
|
| |