Gene Tmz1t_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3253 
Symbol 
ID7874474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3558304 
End bp3561501 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content71% 
IMG OID643700187 
Productsulfatase 
Protein accessionYP_002890225 
Protein GI237653911 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.189767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTCG CCACCGCCGC CCGCAGCCTC ACCATCCTGA TCTGCACGCA CAACCGCGCG 
GACCTGCTCG CACGCGTCAT CGATTCGCTC GAGTCCGCCC GCCAGCCTGC CGGCTGGAGT
GTCCGCCTGT TCGTCGTGGC CAACGCCTGC ACCGACGGCA CGCACGAGTT CCTGGCCGGG
CGCAGCGACC GCGCGGATCG GCTGGCGCTG TCGTGGATCG CGGAGCCCAC GCCGGGAAAG
TCGCATGCGC TCAACCGCGC CCTGCCCTTG CTGGAGGACG AGCTGGTCGC CTTCGTCGAC
GACGACCACC GCGTGGACGC CGACTACCTG CTCGGGGTCA CCGCCGCCGC GGAGCGCTGG
CCCGAGGCCG ATCTCTTCTG CGGGCGGATC GTGCCCGACT GGGACGGCAG CGAGCCGGCC
TGGGTGCACG ACGAAGGCCC CTATCGCATC TACCCGCTGC CGGTCCCGCG CTACGACCAG
GGCATGGAGG ACTTCCCCGT CGACCTCGAG GGGCCGATCC CGGGAGGCGG CAATCTCGTC
GCCCGCCTGC CGGTGATCGG TGCGACGGGA CCGTTCGCGA TCGAACTCGG GCCGACCGGA
CACGATCTCG GCGGATCGGA GGACGCCGAC TGGATCCTGC GCGCCTTGCG CAAGGGGGCC
CGCCTGCATT ACGCACCGCA GATCCTCCAG TACCACTATG TCGATAGCGA ACGCCTGACC
TTGTCCTACG TCGCACGCAA GGGCTACCAG CGCTCGCAAT CGGTCACGCG GGTGCGCGCG
GAGTTCGACC GGGTGCCGCG CTACATGTGG CGCAAGGCCG CGGGCTACGC CCTCGGGCTG
GCGTTCTCGT GGCGGCTGCA GGCACGGCGC TTCTATCTGG TCCGCCTGGC TTCCTCGATG
GGCGAGATCT CCGGCATCCG CGACCGCCAC CGGCGCAAGC GCAGGCGTGC GGCACTGCCG
ATGCTCGGGG CGGAAGCGGG ATCGGCGGCC TTGCTGGTGG CCGCCGCGCT CACGCTCGCG
CTGGCGACGG TGCTCGCGCG CCACTGGCTG GGCGAAGCCC TGCTCGGCGC CGGCGTGGTG
GGCGCGCTGA CGAGCGCGGT GCTGGTAGCC AAGTCGGTGC GGGACTTCTC GCGCACCGGG
CCGGGCCTGC GCGAAGAGAT CCTCGCGCGC TATCGCGGCT ACGTCGTCTT CGCCATCGCC
CGCCTGGGGC TCGCGGCCCT CGGCCTCGCC GCGTTCTGGG CCTTCCCGGG CACCGCGCTG
TGGATCACTG CGGCGGAAGC GCTCGGTCGG GAACCGCCGC TGTGGACGAC TGCCGCGGGC
GGCGCCCTCA CCCTCGCGCT CGCGACGGTG TATGCCGGCT GCCGAGCACT TTCGCAAAAC
CCGGGCCTGG TGATCGCGTC CTGGCAATAT CGTACCGTCC GCATCCATCG CCTGTGGCGC
GCGCTGTCGC AGCGCGGTCT CGACCTCATC GCGCGCATCG TGCTGGCGAC CGGAATCGGC
CTGGTCGGGG CGATCGCGTT GCTGCGCTAC CACCAGGGCG GCAGCGCGGA CGCGGGGGCC
ATGCTGCTCG TTACCTGCGG CTACATCGCG CTGCTCGCCT GGGCGATCTG GGAGCCCGAC
GGCACACACG CCCCCACCCC GCGACGGCGC GCGCGGCGCA ACGTGCTGAT GATCGGCAGC
GATACGCTCC GGGCCGACCG CATCGGCGCG CAACGCGAAG GCGCGTCGAT CACCCCGAAC
ATCGACGCGC TGGCCGCACG AGGCACCCGC TTCGGCGCCT GCTACGTCCC CTGCGCGCGC
ACCGCACCGA GCCTGATCTC GCTGTTCACC GCGACCTGGC CGCACCACCA CGGGGTGCGC
GACAACTACG TGGCCGGCGC CGAGACGCGG CTCGAAGACA AGACCCTGCC GAACATACTG
CGCGCACTGG GCTATCGCAC CGCGGCGGTT TCCGACTGGT GCGGCGCCGA TCTGGGCAAG
TTCGATTTCG GCTTCGACAT CACCGATCTG CCCGAGGATC AGTGGAACCT GAAGTACCTC
ATCCGCCAGG GCCCGAAGGA CATCCGCCTG TTCCTGTCGC TGTTCCTGCA CAATCGTCTC
GGCCGTCACC TTCTTCCCGA GATTCACTAC CTCGGCGGCG TGCCCCAGAC CGGCATGCTC
GGACGCCGCG CGCGACGCAC GCTGTCGCGC CTGGCTGCGG GTGACGAACC CTTCCTGCTC
AACCTCTTCT ATTCGACCAC GCATCCGCCT TTCGCATCGG AGCACCCCTA CTACACCCGG
TTCTCCGATC CTGCCTATTC CGGCGCGTCC AAGTTCGCCA TGGCCCGGCT GACCGAACCG
TTCGAGATCA TCCGCCGCCA GGGCGAGCCG CGCGAGGAGT TCGACCTCGA CCAGATCCTC
GATCTCTACG ATGGCTGCGT GGCCCAGTTC GACGACGAGG TCGGGCGCCT GCTGCGCCAG
CTCGACGACA GCGGCCTCGC GGAGGACACC ATCGTGGTGC TCTACAGCGA CCACGGCATG
GAGTTCTTCG AGCACGGCAC CTGGGGCCAG GGCAACTCCG CCCTGGGCGA CTTCAGCGCG
CGCGTGCCGC TGATCGTCGT CGACCCGGCC CGACCGGGCG GCCAGCGCGT CGACCAGGTG
GTGCGCAGCG TGGACATCAT GCCGACCCTG CTCGATCTGC TCGGTGCGCC CTCGGTCGGC
TGCGACGGGG TGTCGCTGCG CCCGGCGATC GCCGATCCGG CGACCGACCT GCACCTGCGC
GCGTTCAACG AGACCGGGAT CTGGATCGCA CCGGTCCCCG GTCTACCCGA AGGACACCTG
AGCTATCCCA ACCTGCTGGA ACTGCTCGAC GTTCCGGACA TCGCGGCGGG CAGCCTGTCG
CTGCGCGAGC GCTACCGCCA GACGGTACTG GTCGCCAAGG ACCGCATGGT GCGGGACGGG
CGCTGGAAGC TCGTCTACCA GCCGCTCGAG CACGGCAGGC TTCTCAGCCT GTACGACGTC
GAGAGCGACC CCGGGTGCAC CGCGGACGTC GCGTCCCGGC ATCCCGCAGA GGTCGAGCGC
CTGTGGGCGC AGCTGCGCGC CTGGATGGCG AACGACCCGG CGCTGCGCGG AGATCCCCGC
CTGGACCTGC CGCCGACGCC CGCAGCGACC TCAGCCGCCC GCGCGCCGGA AGCGGATCTC
GCGCCCGAGA TGCGGTAG
 
Protein sequence
MNVATAARSL TILICTHNRA DLLARVIDSL ESARQPAGWS VRLFVVANAC TDGTHEFLAG 
RSDRADRLAL SWIAEPTPGK SHALNRALPL LEDELVAFVD DDHRVDADYL LGVTAAAERW
PEADLFCGRI VPDWDGSEPA WVHDEGPYRI YPLPVPRYDQ GMEDFPVDLE GPIPGGGNLV
ARLPVIGATG PFAIELGPTG HDLGGSEDAD WILRALRKGA RLHYAPQILQ YHYVDSERLT
LSYVARKGYQ RSQSVTRVRA EFDRVPRYMW RKAAGYALGL AFSWRLQARR FYLVRLASSM
GEISGIRDRH RRKRRRAALP MLGAEAGSAA LLVAAALTLA LATVLARHWL GEALLGAGVV
GALTSAVLVA KSVRDFSRTG PGLREEILAR YRGYVVFAIA RLGLAALGLA AFWAFPGTAL
WITAAEALGR EPPLWTTAAG GALTLALATV YAGCRALSQN PGLVIASWQY RTVRIHRLWR
ALSQRGLDLI ARIVLATGIG LVGAIALLRY HQGGSADAGA MLLVTCGYIA LLAWAIWEPD
GTHAPTPRRR ARRNVLMIGS DTLRADRIGA QREGASITPN IDALAARGTR FGACYVPCAR
TAPSLISLFT ATWPHHHGVR DNYVAGAETR LEDKTLPNIL RALGYRTAAV SDWCGADLGK
FDFGFDITDL PEDQWNLKYL IRQGPKDIRL FLSLFLHNRL GRHLLPEIHY LGGVPQTGML
GRRARRTLSR LAAGDEPFLL NLFYSTTHPP FASEHPYYTR FSDPAYSGAS KFAMARLTEP
FEIIRRQGEP REEFDLDQIL DLYDGCVAQF DDEVGRLLRQ LDDSGLAEDT IVVLYSDHGM
EFFEHGTWGQ GNSALGDFSA RVPLIVVDPA RPGGQRVDQV VRSVDIMPTL LDLLGAPSVG
CDGVSLRPAI ADPATDLHLR AFNETGIWIA PVPGLPEGHL SYPNLLELLD VPDIAAGSLS
LRERYRQTVL VAKDRMVRDG RWKLVYQPLE HGRLLSLYDV ESDPGCTADV ASRHPAEVER
LWAQLRAWMA NDPALRGDPR LDLPPTPAAT SAARAPEADL APEMR