Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3654 |
Symbol | |
ID | 7873159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4011269 |
End bp | 4012633 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700595 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_002890624 |
Protein GI | 237654310 |
COG category | [R] General function prediction only |
COG ID | [COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) |
TIGRFAM ID | [TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.142399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCCA CCCGCATCCT GCTCGACGCC GCCGACATCC CGACGCACTG GTACAACGTC GCCGCCGATC TGCCGACGCC GCTCGCGCCG CCGCTCGGCC CCGACGGCCA TCCGGCCACG CCGGAGCAGA TGGGGGTCAT CTTCCCGCCG GCGATCCTCG AACAGGAGAT GAGCACCGAG CGCTGGATCG CCATCCCGCA GGAGGTGCGC GAGATCTACC GCCTGTGGCG GCCGAGCCCG CTGTGTCGCG CGGTGCGCCT GGAGCAGGCG CTCGGCACCC CGGCGAAGAT CTTCTTCAAG TACGAGGGCG TCTCGCCCGC CGGCTCGCAC AAGCCCAACT CCGCGGTGCC GCAGGCCTTC TACAACAAGC AGGCCGGCAT CACCCGCCTG ACCACCGAGA CCGGCGCCGG GCAGTGGGGC TCGTCGATCG CCTTCGCCGG GCAGATGTTC GGGCTGGAGG TGCGCATCTA CATGGTCAAG GTCAGCTACG ACCAGAAGCC CTACCGCCGC CTGATGATGC AGACCTGGGG CGGCGAGGTC TTCGCCAGCC CCTCGCCGCA CACCGAGACC GGCCGCCGCC TGCTCGCCGA GAACCCGGAC AACCCCGGCT CGCTCGGCAT CGCCATCTCC GAGGCGGTGG AAGAGGCCGC CGGCCGCGCC GACACCAACT ACACCCTGGG TTCGGTGCTC AACCACGTGG TGCTGCACCA GAGCATCATC GGCCTGGAGG CGAAGAAGCA GCTCGACAAG GTCGGCCTCT ATCCCGACGT CGTCATCGGC CCCTGCGGCG GCGGCTCGAG CTTCGCCGGC ATCGCCTTCC CCTTCCTCGC CGACAAGGCC GCGGGCGACA AGCGCGCCGC CACGCTGCGC TGCGTGGCGG TGGAGCCGAC CTCCTGCCCG ACCCTGACCA AGGGCCAGTA CGCCTACGAC TTCGGCGACG CCTCCGGCTT CACCCCGCTG ATGAAGATGT ATACGCTGGG CCACGATTTC ATGCCGCCCG GAATCCATGC CGGCGGCCTG CGCTACCATG GCGACTCGCC GCTGGTGTCG AACCTGCTGC ACGCCGGCCT CATCGAGGCC GCCGCGGTGC CGCAGCTGGC GACCTTCGAG GCGGGGGTGC AGTTCGCACG CGCCGAGGGC ATCATCCCGG CGCCCGAGTC CTGCCACGCC ATCCGTCAGG CCATCGACGA GGCGCTCGCC TGCAAGGCGA CCGGCGAGGC GAAGACCATT CTGTTCAACC TGACCGGCCA CGGCCACTTC GACATGAGCT CGTACGAGCG CTACTTCTCG GGCAAGCTCG AGGACTTCGA CTACCCGGCC GAGGCGGTGG CAACTTCGCT CGCCCACCTG CCCAAGGTCG GCTGA
|
Protein sequence | MEATRILLDA ADIPTHWYNV AADLPTPLAP PLGPDGHPAT PEQMGVIFPP AILEQEMSTE RWIAIPQEVR EIYRLWRPSP LCRAVRLEQA LGTPAKIFFK YEGVSPAGSH KPNSAVPQAF YNKQAGITRL TTETGAGQWG SSIAFAGQMF GLEVRIYMVK VSYDQKPYRR LMMQTWGGEV FASPSPHTET GRRLLAENPD NPGSLGIAIS EAVEEAAGRA DTNYTLGSVL NHVVLHQSII GLEAKKQLDK VGLYPDVVIG PCGGGSSFAG IAFPFLADKA AGDKRAATLR CVAVEPTSCP TLTKGQYAYD FGDASGFTPL MKMYTLGHDF MPPGIHAGGL RYHGDSPLVS NLLHAGLIEA AAVPQLATFE AGVQFARAEG IIPAPESCHA IRQAIDEALA CKATGEAKTI LFNLTGHGHF DMSSYERYFS GKLEDFDYPA EAVATSLAHL PKVG
|
| |