Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1906 |
Symbol | |
ID | 7085675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2148979 |
End bp | 2150250 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698931 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_002355553 |
Protein GI | 217970319 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCG AGACCATCGC CGTGCACGGC GGCTACTCGC CCGATCCGAC CACCAAGGCC GTCGCGGTGC CGATCTACCA GACCACCTCG TACGCCTTCG ACGACACCCA GCACGGCGCG GACCTCTTCG ACCTCAAGGT GCAGGGCAAC ATCTACACCC GCATCATGAA CCCGACCACC GCGGTGCTCG AGCAGCGCGT GGCCCAGCTC GAAGGCGGCA TCGGCGCGCT CGCCGTGGCC TCGGGCATGT CGGCGATCAC CTACGCCATC CAGACCATCG CCGAAGCCGG CGACAACATC GTCTCGGCGT CGACGCTGTA CGGTGGCACC TACAACCTGT TCGCGCATAC CTTCCCGCAG TTCGGCATCG AGGTGCGCTT CGCCGACTAC CGCGACCCCG ACAGCTTCGC CGCGCTCATC GACGCGCGTA CCAAGGCGAT CTACTGCGAA TCGGTGGGCA ACCCGCTCGG CAACGTCACC GACATCGGCC GTCTCGCCGA GATCGCGCAC AAGGCCGGCG TGCCGCTGAT CGTGGACAAC ACCGTGCCCT CGCCCTACCT GTGCCGCCCC TTCGAGCACG GCGCCGACAT CGTGGTGCAC GCGCTCACCA AGTACCTCGG CGGCCACGGC AACTCGATCG GCGGCGTGAT CGTCGACTCG GGCAAGTTCC CCTGGGCCGA GCACAAGGCG CGCTTCAAGC GCCTCAACGA GCCCGACGTC TCCTACCACG GCGTGTGCTA CACCGAGGCT CTGGGTGCAG CCGCCTTCAT CGGCCGCGCG CGCGTGGTGC CGCTGCGCAA CACCGGCGCG GCGATCTCGC CCTTCAACAG CTTCCTGATC CTGCAGGGCA TCGAGACCCT GGCGCTGCGC ATGGACCGCA TCTGCACCAA CACGATCAAG GTCGCCGAGT ACCTGAAGAA GCACGCCAAG GTCGAGTGGG TGAACTACGC CGGCCTGCCC GACCACGCCG ACCACGCCCT GGTGCAGAAG TACATGGGCG GGCGCGCCTC GGGCATCCTG TCCTTCGGGG TCAAGGGCGG CTTTGAAGCC GGCGGCCGCT TCCAGGATGC GCTGAAGCTC ATCACCCGCC TGGTGAACAT CGGCGACGCC AAGTCGCTCG CCTGCCACCC GGCCTCCACC ACCCACCGCC AGCTCTCGCC GGCCGAACTC GCCAAGGCCG GCGTGTCGCC CGACATGGTG CGACTGTCGA TCGGCATCGA GCACATCGAC GACATCGTTG CCGATCTGGA GCAGGCGCTG GCGGCGGTCT GA
|
Protein sequence | MKLETIAVHG GYSPDPTTKA VAVPIYQTTS YAFDDTQHGA DLFDLKVQGN IYTRIMNPTT AVLEQRVAQL EGGIGALAVA SGMSAITYAI QTIAEAGDNI VSASTLYGGT YNLFAHTFPQ FGIEVRFADY RDPDSFAALI DARTKAIYCE SVGNPLGNVT DIGRLAEIAH KAGVPLIVDN TVPSPYLCRP FEHGADIVVH ALTKYLGGHG NSIGGVIVDS GKFPWAEHKA RFKRLNEPDV SYHGVCYTEA LGAAAFIGRA RVVPLRNTGA AISPFNSFLI LQGIETLALR MDRICTNTIK VAEYLKKHAK VEWVNYAGLP DHADHALVQK YMGGRASGIL SFGVKGGFEA GGRFQDALKL ITRLVNIGDA KSLACHPAST THRQLSPAEL AKAGVSPDMV RLSIGIEHID DIVADLEQAL AAV
|
| |