Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1480 |
Symbol | |
ID | 7083563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1650141 |
End bp | 1652510 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698498 |
Product | sulfatase |
Protein accession | YP_002355135 |
Protein GI | 217969901 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC AGGACAACAA ACCCCGTACC CACCTGCCGA TGCGCAATAC CGCGCCGGCG CGCTTCGTTA ACTACGATGC CAAGGATCCG GACACGGTCA CGACCCCGAT CGAGCAGTTG CGTCCGCCCG AAGGCGCGCC GAACGTGCTG ATCGTGCTCA TCGACGACTG CGGCTTCGGC GCGCCCAGCA CCTTCGGCGG CCCCTGCGCC ACGCCCTTTG CCGACAAGCT GGCCGCGGGC GGCCTGCGCT ACAACCGCTT CCACACCACC GCGATCTGCT CGGCCACGCG CCAGTCCCTG CTCACCGGCC GCAACCACCA CAACGCCGGC ATGGCCGGCA TCACCGAGAT CGCCACCGGC CAGCCGGGCT ACTCCTCGGT GCTGCCCAAC TCCATGGCGC CGCTGGCCAA GACGCTCAAG CTCAACGGCT ACAACACCGC ACAGTTCGGC AAATGCCACG AGGTGCCGGT GTGGCAGTCG AGCCCGGTCG GCCCCTTCGA CCAGTGGCCC ACGGGCGGGG GCGGCTTCGA GTACTTCTAC GGCTTCATCG GTGGCGAGGC GCACCAGTGG TATCCCGCGC TCGTCGAGGG CACCACCCCG GTCGCGCCGC CGAAGACGCC CGAGGAGGGT TACCACCTGA TGGAGGACAT GACCGACAAG GCCATCGGCT GGATCCAGCA GCAGAAGGCC CTCTCGCCCG ACAAGCCCTT CTTCGCCTAT TTCGCGCCCG GCGCGACCCA CGCGCCGCAC CATGTGCCCA AGGAATGGGC CGACAAGTAC AAGGGCAAGT TCGACGCCGG CTGGGATGTG CTGCGCGAGG AGACCTTCGC CCGCCAGAAG AAGCTCGGCG TCATTCCTGC CGACTGCGAA CTCACTGCCC GGCCCAAGGA CGTTCCCGGC TGGGACGACA TGCCCGAGGC GTTCAAGCCG GTGCTGCGTC GGCAGATGGA GGTCTATGCG GGCTTCCTCG AGTTCACCGA CCATCACGTG GGCCGCCTGT TCGAGGCGAT CGAGAATCTC GGCATCGCCG ACGACACGCT GGTGTATTAC ATCATCGGCG ACAACGGCGC CTCGGCCGAG GGCTCGTTCA ACGGCTGCTT CAACGAGATG AGCTACTTCA ACGGCCTGCA GTCCCTCGAG ACCGCCGAGT ACCTGACGGA GCGGATCGAC AAGCTCGGCG GGCCGGAGTC GTACAACCAC TTCGCGGTGG GTTGGGCGCA TGCGCTGAAC ACGCCCTACC AGTGGACCAA GCAGGTCGCC TCGCACTTCG GCGGCACGCG CAACGGCACC ATCGTGCACT GGCCGAAGGG CATCCAGGCC AAGGGCGAGT TGCGCACGCA TTTTGCCCAC GTGATCGACG TGGCGCCGAC CATCCTCGAG GCGGCGGGCA TTCCCGAGCC GGTCTCGGTC GATGGCATCC AGCAGGACCC GATCGATGGG GTCAGCATGC TGAAGACGTT CAACGACGCC AAGGCACCGG AAGCCCACGA GACGCAGTAC TTCGAGGTCA TGGGCAACCG GGCCATCTAT CACAAGGGCT GGACCGCGGT GACCAAGCAC TACACGCCGT GGATTCCGAA CGTGAAGCCC GCGCTCGACG ACGACGTCTG GGAGCTCTAC GACACCACCA CGGACTGGGC GCAGTCCAAG GACCTGTCGA AGGAGATGCC GGACAAGCTG CGCGAACTGC AGCGCCTGTG GATCATCGAG GCGACGCGCA ACAAGGTGCT GCCGATCGAT GACCGGATGT TCGAGAAGAT AAATCCGGAC ACCGCCGGCC GGCCGACGCT GGTGAAGGGC AAGACCCAGC TCCTGGCCGG TGGCATGAGC CACCTGGGCG AGAACTGTGT CCTCAACATC AAGAACAAGT CGCACTCCGT CACCGCAGCG ATCGTGGTCC CGCAAGGCGG GGCAGAGGGC GTCATCATCG CGCAGGGCGC CGATATCGGC GGCTGGAGCC TGTATGCCAA GGGCGGCAAG CTCAAGTACT GCTACAACTG GGGCGGCTTC AAGCACTTCA TGATCGAAGG CGCTTCGGTC ATGGCTCCGG GCGAGCATCA GGTGCGCATG GAGTTCGCCT ACGCCGGTGG CGGCCTCGGC AAGGGCGGCA AGGTGACGCT GTACGTCGAT GGCCAGCAGG ACGGCGAGGG CGAAGTCGGC GCAACGCTGG CGATGATCTT CTCGGCCGAC GACGGTCTGG ATGTCGGCAA GGACGGCGGC TCGGCGGTAT CGCCGGACTA CAAGCCTGGC AACAATTCCT TCAACGGCAA GGTGAAGGGC GTGCAGCTCG CGATCGACGA GGCCGCGGAA GACCTGGATC ACCTGCTCGA TCCGGCGGAC GTCATCCGCA TGGCGATGGC CCGCCAGTAA
|
Protein sequence | MSKQDNKPRT HLPMRNTAPA RFVNYDAKDP DTVTTPIEQL RPPEGAPNVL IVLIDDCGFG APSTFGGPCA TPFADKLAAG GLRYNRFHTT AICSATRQSL LTGRNHHNAG MAGITEIATG QPGYSSVLPN SMAPLAKTLK LNGYNTAQFG KCHEVPVWQS SPVGPFDQWP TGGGGFEYFY GFIGGEAHQW YPALVEGTTP VAPPKTPEEG YHLMEDMTDK AIGWIQQQKA LSPDKPFFAY FAPGATHAPH HVPKEWADKY KGKFDAGWDV LREETFARQK KLGVIPADCE LTARPKDVPG WDDMPEAFKP VLRRQMEVYA GFLEFTDHHV GRLFEAIENL GIADDTLVYY IIGDNGASAE GSFNGCFNEM SYFNGLQSLE TAEYLTERID KLGGPESYNH FAVGWAHALN TPYQWTKQVA SHFGGTRNGT IVHWPKGIQA KGELRTHFAH VIDVAPTILE AAGIPEPVSV DGIQQDPIDG VSMLKTFNDA KAPEAHETQY FEVMGNRAIY HKGWTAVTKH YTPWIPNVKP ALDDDVWELY DTTTDWAQSK DLSKEMPDKL RELQRLWIIE ATRNKVLPID DRMFEKINPD TAGRPTLVKG KTQLLAGGMS HLGENCVLNI KNKSHSVTAA IVVPQGGAEG VIIAQGADIG GWSLYAKGGK LKYCYNWGGF KHFMIEGASV MAPGEHQVRM EFAYAGGGLG KGGKVTLYVD GQQDGEGEVG ATLAMIFSAD DGLDVGKDGG SAVSPDYKPG NNSFNGKVKG VQLAIDEAAE DLDHLLDPAD VIRMAMARQ
|
| |