Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3877 |
Symbol | |
ID | 7873528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4274933 |
End bp | 4275895 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700819 |
Product | transglutaminase domain protein |
Protein accession | YP_002890842 |
Protein GI | 237654528 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTCT ACGAGATCCT CCACGACACC ACCTACCGCT ACGACAGTGC GGTGGCGCTG TCGCAGCAGG TGGCTCACCT GCGCCCGCGC GAATGCGCCG GCCAGCGCAC CCTCGCGCAC CACCTGGAGA TCGCGCCGGC GCCGGCCCGG CGTAGCGAGC GCAGCGACTT CTTCGGCAAC CCGGTGACCG CGTTCGCGCT GCACGCGCCG CACACGGAAC TCGTCGTGCG CGCACGCAGC CGCATCCGCG TCGACCCGCC GGCCTGGCCC GAGCCGGCGG CGAGTCCGCC CTGGGAGACC GCGCGCGACC ACCTGCGCGA CGGCCTCTCC GGCCACGCGC CGCTGCGCGC CGACGCGCGC GACGCCGGGC AGTACCGCTT CGCTTCGCCC TTCGTGGCGC TCGGCGACGA GGCGGCGCAG TACGCCGACT ATGCGCTGGC GAGCTTCACC CCGGGGCGGC CGGTGCTCGA CGCCTTGCTC GCGCTGTCGG CGCGCATCCA CGCCGATTTT CGCTTCGACC CCGCGGCGAC CAGCGTCGCC ACCCCGGTGG CGGAGGTCTT CGCGGCGCGC CGTGGGGTGT GCCAGGACTT CTCCCACCTG ATGATCGCCT GTCTGCGCGC GCTCGGCCTG GCCGCGCGCT ACGTCTCCGG CTACCTGCTC ACCGAGCCTC CGCCGGGCCA GCTGCGCCTG ATCGGCGCCG ACGCCTCGCA CGCCTGGGTG GCGCTGTGGT GCCCGGGCGC GGGCTGGATC GACATCGATC CGACCAACGA CCTGCAGCCT GGCAGCGGTC ACATCACGCT GGCCTGGGGG CGGGACTACG GCGACGTGTG TCCGCTGCGC GGCGTCATTC TCGGCGGGGG CGGGCACGGC GTCGAGGTCG CGGTGACGGT GATGCCGGTG GAGGAGGGCC CGCCCCCGCG GCGGGGCGGG CGCCCGGCGC GTGCCGCCGA CGCACCAGAT TGA
|
Protein sequence | MALYEILHDT TYRYDSAVAL SQQVAHLRPR ECAGQRTLAH HLEIAPAPAR RSERSDFFGN PVTAFALHAP HTELVVRARS RIRVDPPAWP EPAASPPWET ARDHLRDGLS GHAPLRADAR DAGQYRFASP FVALGDEAAQ YADYALASFT PGRPVLDALL ALSARIHADF RFDPAATSVA TPVAEVFAAR RGVCQDFSHL MIACLRALGL AARYVSGYLL TEPPPGQLRL IGADASHAWV ALWCPGAGWI DIDPTNDLQP GSGHITLAWG RDYGDVCPLR GVILGGGGHG VEVAVTVMPV EEGPPPRRGG RPARAADAPD
|
| |