Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0862 |
Symbol | |
ID | 7084719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 953611 |
End bp | 954717 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697884 |
Product | Glucan 1,3-beta-glucosidase |
Protein accession | YP_002354525 |
Protein GI | 217969291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.168056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACGA CGCGTACCAA GCTGCGCGGG GTAAACCTGG GTAGCTGGCT GCTGCTGGAG AAGTGGATGG TGCCCAGCCT GTTCGAGGGC CTGGAGGCGA CCGACGAGAC CACCTGGTGC GCCGAGCTCG GCCCCGCCGC CACCGAGCGC CTGCGCCGGC ACTGGAACAC TTTCGTCACC CGCGAGGACT TCGCCTGGAT CGCCGCACGC GGCCTCAACG CGGTGCGCAT CCCGATCGGG CACTGGATCT TCGGCCCCGA CTACCCCTAC CACCCCAAGT ACGGCGCCCA CCGCCACCCC TTCGTCACCG GCGGCATCGA GGTGCTCGAC CGCGCCCTCG ACTGGGCGCA GGAGTTCGGC CTGCGCGTGA TCATCGACCT GCACGCCGCG CCCGGCTGCC AGAACGGCTT CGATAACGGC GGCATCAAGG ACGTGGTGGA GTGGCATACG AAGAAGGAAT ACCTCGAGCA CTCGCTCGCC GTGCTCGAGC GCCTGGCCGA GCGCTACCGC GCGCACCCGG CGCTGCACGG CATCGAGCTG CTCAACGAGC CGCGCTGGGA CGTGCCGACC GACTACCTGA AGTCCTACTA CCTCGAGGCC TACGCGCGCA TCCGCAAGCA CTGCGCGCCG GAGACGGTGG CGGTGGTGTT CCACGACGGC TTCCGCAGCT TCCGCGAGTA CCTCGGCTTC ATGCAGGCGC CGGCCTTCCG CAACGTGGTG TTCGACTATC ACCGCTACCA GTGCTTCGAG CGCTGTGACA TCGACATGGA CATCCACGGC CACATCCGCA AGGCGGCGGT GGACTGGCGC GAGGAAGCCG ACGCGATCAA CGCCGAGCTC GGCCTGCCGG CGGTGTGCGG CGAGTGGAGC CTGGGGCTGG ACCTGAAGGT GGTGTCGCTG TGGGCCGAGG GGCCGTTCAA CCACGCCCTC GAGCACATGG ACGACTTCCA GCAGGACGTG GCCAGCCGCG CCTACGGCGA CAGCCAGCTG ATGACCTTCG AACGCCTGGC GGGCTGGTTC TTCTGGAGCT ACAAGACCGA GACCACGCCG GCCTGGTGCT TGCGCGCGTG CGTGGAGCGC GGCTGGCTGC CCTCACGCTT CGGGTAG
|
Protein sequence | METTRTKLRG VNLGSWLLLE KWMVPSLFEG LEATDETTWC AELGPAATER LRRHWNTFVT REDFAWIAAR GLNAVRIPIG HWIFGPDYPY HPKYGAHRHP FVTGGIEVLD RALDWAQEFG LRVIIDLHAA PGCQNGFDNG GIKDVVEWHT KKEYLEHSLA VLERLAERYR AHPALHGIEL LNEPRWDVPT DYLKSYYLEA YARIRKHCAP ETVAVVFHDG FRSFREYLGF MQAPAFRNVV FDYHRYQCFE RCDIDMDIHG HIRKAAVDWR EEADAINAEL GLPAVCGEWS LGLDLKVVSL WAEGPFNHAL EHMDDFQQDV ASRAYGDSQL MTFERLAGWF FWSYKTETTP AWCLRACVER GWLPSRFG
|
| |