Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3253 |
Symbol | |
ID | 7874474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3558304 |
End bp | 3561501 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700187 |
Product | sulfatase |
Protein accession | YP_002890225 |
Protein GI | 237653911 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.189767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTCG CCACCGCCGC CCGCAGCCTC ACCATCCTGA TCTGCACGCA CAACCGCGCG GACCTGCTCG CACGCGTCAT CGATTCGCTC GAGTCCGCCC GCCAGCCTGC CGGCTGGAGT GTCCGCCTGT TCGTCGTGGC CAACGCCTGC ACCGACGGCA CGCACGAGTT CCTGGCCGGG CGCAGCGACC GCGCGGATCG GCTGGCGCTG TCGTGGATCG CGGAGCCCAC GCCGGGAAAG TCGCATGCGC TCAACCGCGC CCTGCCCTTG CTGGAGGACG AGCTGGTCGC CTTCGTCGAC GACGACCACC GCGTGGACGC CGACTACCTG CTCGGGGTCA CCGCCGCCGC GGAGCGCTGG CCCGAGGCCG ATCTCTTCTG CGGGCGGATC GTGCCCGACT GGGACGGCAG CGAGCCGGCC TGGGTGCACG ACGAAGGCCC CTATCGCATC TACCCGCTGC CGGTCCCGCG CTACGACCAG GGCATGGAGG ACTTCCCCGT CGACCTCGAG GGGCCGATCC CGGGAGGCGG CAATCTCGTC GCCCGCCTGC CGGTGATCGG TGCGACGGGA CCGTTCGCGA TCGAACTCGG GCCGACCGGA CACGATCTCG GCGGATCGGA GGACGCCGAC TGGATCCTGC GCGCCTTGCG CAAGGGGGCC CGCCTGCATT ACGCACCGCA GATCCTCCAG TACCACTATG TCGATAGCGA ACGCCTGACC TTGTCCTACG TCGCACGCAA GGGCTACCAG CGCTCGCAAT CGGTCACGCG GGTGCGCGCG GAGTTCGACC GGGTGCCGCG CTACATGTGG CGCAAGGCCG CGGGCTACGC CCTCGGGCTG GCGTTCTCGT GGCGGCTGCA GGCACGGCGC TTCTATCTGG TCCGCCTGGC TTCCTCGATG GGCGAGATCT CCGGCATCCG CGACCGCCAC CGGCGCAAGC GCAGGCGTGC GGCACTGCCG ATGCTCGGGG CGGAAGCGGG ATCGGCGGCC TTGCTGGTGG CCGCCGCGCT CACGCTCGCG CTGGCGACGG TGCTCGCGCG CCACTGGCTG GGCGAAGCCC TGCTCGGCGC CGGCGTGGTG GGCGCGCTGA CGAGCGCGGT GCTGGTAGCC AAGTCGGTGC GGGACTTCTC GCGCACCGGG CCGGGCCTGC GCGAAGAGAT CCTCGCGCGC TATCGCGGCT ACGTCGTCTT CGCCATCGCC CGCCTGGGGC TCGCGGCCCT CGGCCTCGCC GCGTTCTGGG CCTTCCCGGG CACCGCGCTG TGGATCACTG CGGCGGAAGC GCTCGGTCGG GAACCGCCGC TGTGGACGAC TGCCGCGGGC GGCGCCCTCA CCCTCGCGCT CGCGACGGTG TATGCCGGCT GCCGAGCACT TTCGCAAAAC CCGGGCCTGG TGATCGCGTC CTGGCAATAT CGTACCGTCC GCATCCATCG CCTGTGGCGC GCGCTGTCGC AGCGCGGTCT CGACCTCATC GCGCGCATCG TGCTGGCGAC CGGAATCGGC CTGGTCGGGG CGATCGCGTT GCTGCGCTAC CACCAGGGCG GCAGCGCGGA CGCGGGGGCC ATGCTGCTCG TTACCTGCGG CTACATCGCG CTGCTCGCCT GGGCGATCTG GGAGCCCGAC GGCACACACG CCCCCACCCC GCGACGGCGC GCGCGGCGCA ACGTGCTGAT GATCGGCAGC GATACGCTCC GGGCCGACCG CATCGGCGCG CAACGCGAAG GCGCGTCGAT CACCCCGAAC ATCGACGCGC TGGCCGCACG AGGCACCCGC TTCGGCGCCT GCTACGTCCC CTGCGCGCGC ACCGCACCGA GCCTGATCTC GCTGTTCACC GCGACCTGGC CGCACCACCA CGGGGTGCGC GACAACTACG TGGCCGGCGC CGAGACGCGG CTCGAAGACA AGACCCTGCC GAACATACTG CGCGCACTGG GCTATCGCAC CGCGGCGGTT TCCGACTGGT GCGGCGCCGA TCTGGGCAAG TTCGATTTCG GCTTCGACAT CACCGATCTG CCCGAGGATC AGTGGAACCT GAAGTACCTC ATCCGCCAGG GCCCGAAGGA CATCCGCCTG TTCCTGTCGC TGTTCCTGCA CAATCGTCTC GGCCGTCACC TTCTTCCCGA GATTCACTAC CTCGGCGGCG TGCCCCAGAC CGGCATGCTC GGACGCCGCG CGCGACGCAC GCTGTCGCGC CTGGCTGCGG GTGACGAACC CTTCCTGCTC AACCTCTTCT ATTCGACCAC GCATCCGCCT TTCGCATCGG AGCACCCCTA CTACACCCGG TTCTCCGATC CTGCCTATTC CGGCGCGTCC AAGTTCGCCA TGGCCCGGCT GACCGAACCG TTCGAGATCA TCCGCCGCCA GGGCGAGCCG CGCGAGGAGT TCGACCTCGA CCAGATCCTC GATCTCTACG ATGGCTGCGT GGCCCAGTTC GACGACGAGG TCGGGCGCCT GCTGCGCCAG CTCGACGACA GCGGCCTCGC GGAGGACACC ATCGTGGTGC TCTACAGCGA CCACGGCATG GAGTTCTTCG AGCACGGCAC CTGGGGCCAG GGCAACTCCG CCCTGGGCGA CTTCAGCGCG CGCGTGCCGC TGATCGTCGT CGACCCGGCC CGACCGGGCG GCCAGCGCGT CGACCAGGTG GTGCGCAGCG TGGACATCAT GCCGACCCTG CTCGATCTGC TCGGTGCGCC CTCGGTCGGC TGCGACGGGG TGTCGCTGCG CCCGGCGATC GCCGATCCGG CGACCGACCT GCACCTGCGC GCGTTCAACG AGACCGGGAT CTGGATCGCA CCGGTCCCCG GTCTACCCGA AGGACACCTG AGCTATCCCA ACCTGCTGGA ACTGCTCGAC GTTCCGGACA TCGCGGCGGG CAGCCTGTCG CTGCGCGAGC GCTACCGCCA GACGGTACTG GTCGCCAAGG ACCGCATGGT GCGGGACGGG CGCTGGAAGC TCGTCTACCA GCCGCTCGAG CACGGCAGGC TTCTCAGCCT GTACGACGTC GAGAGCGACC CCGGGTGCAC CGCGGACGTC GCGTCCCGGC ATCCCGCAGA GGTCGAGCGC CTGTGGGCGC AGCTGCGCGC CTGGATGGCG AACGACCCGG CGCTGCGCGG AGATCCCCGC CTGGACCTGC CGCCGACGCC CGCAGCGACC TCAGCCGCCC GCGCGCCGGA AGCGGATCTC GCGCCCGAGA TGCGGTAG
|
Protein sequence | MNVATAARSL TILICTHNRA DLLARVIDSL ESARQPAGWS VRLFVVANAC TDGTHEFLAG RSDRADRLAL SWIAEPTPGK SHALNRALPL LEDELVAFVD DDHRVDADYL LGVTAAAERW PEADLFCGRI VPDWDGSEPA WVHDEGPYRI YPLPVPRYDQ GMEDFPVDLE GPIPGGGNLV ARLPVIGATG PFAIELGPTG HDLGGSEDAD WILRALRKGA RLHYAPQILQ YHYVDSERLT LSYVARKGYQ RSQSVTRVRA EFDRVPRYMW RKAAGYALGL AFSWRLQARR FYLVRLASSM GEISGIRDRH RRKRRRAALP MLGAEAGSAA LLVAAALTLA LATVLARHWL GEALLGAGVV GALTSAVLVA KSVRDFSRTG PGLREEILAR YRGYVVFAIA RLGLAALGLA AFWAFPGTAL WITAAEALGR EPPLWTTAAG GALTLALATV YAGCRALSQN PGLVIASWQY RTVRIHRLWR ALSQRGLDLI ARIVLATGIG LVGAIALLRY HQGGSADAGA MLLVTCGYIA LLAWAIWEPD GTHAPTPRRR ARRNVLMIGS DTLRADRIGA QREGASITPN IDALAARGTR FGACYVPCAR TAPSLISLFT ATWPHHHGVR DNYVAGAETR LEDKTLPNIL RALGYRTAAV SDWCGADLGK FDFGFDITDL PEDQWNLKYL IRQGPKDIRL FLSLFLHNRL GRHLLPEIHY LGGVPQTGML GRRARRTLSR LAAGDEPFLL NLFYSTTHPP FASEHPYYTR FSDPAYSGAS KFAMARLTEP FEIIRRQGEP REEFDLDQIL DLYDGCVAQF DDEVGRLLRQ LDDSGLAEDT IVVLYSDHGM EFFEHGTWGQ GNSALGDFSA RVPLIVVDPA RPGGQRVDQV VRSVDIMPTL LDLLGAPSVG CDGVSLRPAI ADPATDLHLR AFNETGIWIA PVPGLPEGHL SYPNLLELLD VPDIAAGSLS LRERYRQTVL VAKDRMVRDG RWKLVYQPLE HGRLLSLYDV ESDPGCTADV ASRHPAEVER LWAQLRAWMA NDPALRGDPR LDLPPTPAAT SAARAPEADL APEMR
|
| |