Gene Tmz1t_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1906 
Symbol 
ID7085675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2148979 
End bp2150250 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID643698931 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002355553 
Protein GI217970319 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCG AGACCATCGC CGTGCACGGC GGCTACTCGC CCGATCCGAC CACCAAGGCC 
GTCGCGGTGC CGATCTACCA GACCACCTCG TACGCCTTCG ACGACACCCA GCACGGCGCG
GACCTCTTCG ACCTCAAGGT GCAGGGCAAC ATCTACACCC GCATCATGAA CCCGACCACC
GCGGTGCTCG AGCAGCGCGT GGCCCAGCTC GAAGGCGGCA TCGGCGCGCT CGCCGTGGCC
TCGGGCATGT CGGCGATCAC CTACGCCATC CAGACCATCG CCGAAGCCGG CGACAACATC
GTCTCGGCGT CGACGCTGTA CGGTGGCACC TACAACCTGT TCGCGCATAC CTTCCCGCAG
TTCGGCATCG AGGTGCGCTT CGCCGACTAC CGCGACCCCG ACAGCTTCGC CGCGCTCATC
GACGCGCGTA CCAAGGCGAT CTACTGCGAA TCGGTGGGCA ACCCGCTCGG CAACGTCACC
GACATCGGCC GTCTCGCCGA GATCGCGCAC AAGGCCGGCG TGCCGCTGAT CGTGGACAAC
ACCGTGCCCT CGCCCTACCT GTGCCGCCCC TTCGAGCACG GCGCCGACAT CGTGGTGCAC
GCGCTCACCA AGTACCTCGG CGGCCACGGC AACTCGATCG GCGGCGTGAT CGTCGACTCG
GGCAAGTTCC CCTGGGCCGA GCACAAGGCG CGCTTCAAGC GCCTCAACGA GCCCGACGTC
TCCTACCACG GCGTGTGCTA CACCGAGGCT CTGGGTGCAG CCGCCTTCAT CGGCCGCGCG
CGCGTGGTGC CGCTGCGCAA CACCGGCGCG GCGATCTCGC CCTTCAACAG CTTCCTGATC
CTGCAGGGCA TCGAGACCCT GGCGCTGCGC ATGGACCGCA TCTGCACCAA CACGATCAAG
GTCGCCGAGT ACCTGAAGAA GCACGCCAAG GTCGAGTGGG TGAACTACGC CGGCCTGCCC
GACCACGCCG ACCACGCCCT GGTGCAGAAG TACATGGGCG GGCGCGCCTC GGGCATCCTG
TCCTTCGGGG TCAAGGGCGG CTTTGAAGCC GGCGGCCGCT TCCAGGATGC GCTGAAGCTC
ATCACCCGCC TGGTGAACAT CGGCGACGCC AAGTCGCTCG CCTGCCACCC GGCCTCCACC
ACCCACCGCC AGCTCTCGCC GGCCGAACTC GCCAAGGCCG GCGTGTCGCC CGACATGGTG
CGACTGTCGA TCGGCATCGA GCACATCGAC GACATCGTTG CCGATCTGGA GCAGGCGCTG
GCGGCGGTCT GA
 
Protein sequence
MKLETIAVHG GYSPDPTTKA VAVPIYQTTS YAFDDTQHGA DLFDLKVQGN IYTRIMNPTT 
AVLEQRVAQL EGGIGALAVA SGMSAITYAI QTIAEAGDNI VSASTLYGGT YNLFAHTFPQ
FGIEVRFADY RDPDSFAALI DARTKAIYCE SVGNPLGNVT DIGRLAEIAH KAGVPLIVDN
TVPSPYLCRP FEHGADIVVH ALTKYLGGHG NSIGGVIVDS GKFPWAEHKA RFKRLNEPDV
SYHGVCYTEA LGAAAFIGRA RVVPLRNTGA AISPFNSFLI LQGIETLALR MDRICTNTIK
VAEYLKKHAK VEWVNYAGLP DHADHALVQK YMGGRASGIL SFGVKGGFEA GGRFQDALKL
ITRLVNIGDA KSLACHPAST THRQLSPAEL AKAGVSPDMV RLSIGIEHID DIVADLEQAL
AAV