Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1478 |
Symbol | |
ID | 7083561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1646875 |
End bp | 1648551 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643698496 |
Product | sulfatase |
Protein accession | YP_002355133 |
Protein GI | 217969899 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG TCGAGCAAGG ACAAGGACAA GGCGCACCGT ACAACATCGT GTTCATCCTT ACCGATCAGG AGCGTTACTT CCGCCCTGAC GAACTGCCCG CCGGCTATAC GCTTCCGGCA CGCGAGCGCC TGGCCAAGAA TGGTGTGGTG TTCGAGAACC ACCGCATCAA CTCCTGCGTG TGCACCCCCT CGCGCTCGGT GATCTACACC GGCCGCCACA TCCAGCAGAC CAGGATGTTC GACAACACGA ACTTCCCCTG GATCAGCAGC ATGTCCACCG ACATCAAGAC GCTCGGGCAC ATGATGCGCG AGGCCGGCTA CTACACCGCC TACAAGGGCA AGTGGCACCT GACCCGCGAG TTCGAGACCG ACAACACGCT CGCGGCGCCG CAGAAGATCT TCACCAAGGA GATGGAGGCC TACGGCTTTT CCGACTACCT CGGCGTCGGC GACATCATCG CGCACACTCA GGGCGGCTAC CTGCACGACG GCTTGATCGC CGCCGCCGCG GCGAGCTGGT TGCGCAGCAA GGCCGCGGAG CTCGCCGAGC AGCAGAAGCC GTGGTTCCTC GCGGTGAACC TGGTCAACCC GCACGACGTG ATGTTCTACA ACACCGACGA GCCCGGCCAG CCCGTGCAGG GCAAGCACCA TCTGACCCAC CTCGCCGGCG ATCCGGAGCA CGCGATGTAC AAGAAGCAGT GGGACATCGA CCTTCCCGCC AGCTTCAAGC AGCCGATCGA CGCCCCCGGA CGCCCCGCCG CGCACATCGA CCACACCATC GGCAACGACG TCATGACCGG CGTGATCCCG ACGAACGAGG AATGGCGCTG GCGCAAGCGC CACAACTTCT ACCTGAACGC CTTGCAGGAC GTCGACCGTC ACATCATGAC GCTGCTCGAC GAACTGGAGG ACCGCGGGCT GGCTTCGAAC ACCATCGTCA TCCTGACCTC GGACCACGGC GAACTCGGTG GCGCACACCA GATGACCGGC AAGGGTGCCA CCTCCTATCG CGAGCAGAAC AACGTGCCCT TGATCGTGGC GCATCCGGCC TTTGCGGGGG GCAAGCGCTG CAAGGCCGTG ACCACCCACC TCGACCTCGC GCCGACCCTG ATCGCGCTCA CCAACGCGAG CCCGGAGACA AAGGCGGCAA TCGCCCAGAC GCTGCCGGGC AAGGACTTCT CGCCCGTGCT CGCCGCGCCG GAACAGGCGA ACGTCGACAC CGTGCGCGAC GGGCAGTTGT ACTGCTTCAA CATGTTCGCC TCGCTGGACG GCAGCTTCCT GCAGAAGGCC AGCGCGCTCC TTGCACAGCC GGGTGGCGCG GCGAAGATCA AGGAATCCGG CCTGCGCCCC GACCTGAGCA AGCGCGGCGC GATCCGCAGC GTGTTCGACG GCCGCTACCA GTTCACCCGC TACTTTTCGC CCAAGCAGCA CAACCGGCCG ACGTCGATCG ACGAGCTGTT CGCACTCAAC GACGTCGAGC TGTTCGACCT CCAGAACGAC CCGGACGAAG TCGACAACCT TGCCCAGGAC CCGGTCAAGA ACGCCGCGCT GCTGCTCATG ATGAACGACA AGCTCAACCG GCTGATCGAC GAGGAAGTCG GCGAGGACGT CGGCCAGATG CTGCCGGGCG GCGTGGATGG CGGCTGGGTG GCGACCCCCG CGGTGCACGA CCTCTGA
|
Protein sequence | MSIVEQGQGQ GAPYNIVFIL TDQERYFRPD ELPAGYTLPA RERLAKNGVV FENHRINSCV CTPSRSVIYT GRHIQQTRMF DNTNFPWISS MSTDIKTLGH MMREAGYYTA YKGKWHLTRE FETDNTLAAP QKIFTKEMEA YGFSDYLGVG DIIAHTQGGY LHDGLIAAAA ASWLRSKAAE LAEQQKPWFL AVNLVNPHDV MFYNTDEPGQ PVQGKHHLTH LAGDPEHAMY KKQWDIDLPA SFKQPIDAPG RPAAHIDHTI GNDVMTGVIP TNEEWRWRKR HNFYLNALQD VDRHIMTLLD ELEDRGLASN TIVILTSDHG ELGGAHQMTG KGATSYREQN NVPLIVAHPA FAGGKRCKAV TTHLDLAPTL IALTNASPET KAAIAQTLPG KDFSPVLAAP EQANVDTVRD GQLYCFNMFA SLDGSFLQKA SALLAQPGGA AKIKESGLRP DLSKRGAIRS VFDGRYQFTR YFSPKQHNRP TSIDELFALN DVELFDLQND PDEVDNLAQD PVKNAALLLM MNDKLNRLID EEVGEDVGQM LPGGVDGGWV ATPAVHDL
|
| |