Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1931 |
Symbol | |
ID | 7084399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2174163 |
End bp | 2175911 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698956 |
Product | sulfatase |
Protein accession | YP_002355578 |
Protein GI | 217970344 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.514301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGT TACGTCTTTC CCACGTCCCG CCCTTCGCTC CCTCGGCCGG CGCATCGACG CCCGTCGGGC GTCGCGCTCT GGGCGCTTCG GCGTTCGCCG CCGCCTGGTC CGGCGCGCGC AGCTGGCGCC CCGAGGTGTC GGGCGAGGGC TTGCTGGTCG GGCTCGGCCT CTATTTCGCG CTCGCCTGCA ATACGCCCTT CTGGCGTGCG CTGCTCGCGA GCCGTGGCGG CGAGGGCGGC GGCCTCTTCT ATGTGCTCGC GCTCGGCGTG GCGCTCGCCG CGCTCAACGT CGCGCTGCTC GCGCCGCTGC TCAACCGCTG GACGACCAAG CCGCTGCTGG GCGGCCTGAT CCTCGTGGCC GCGGTGTCGA GCTACTACGC CGGCCACTTC GGCGTGTACT TCGACCCGAG CATGCTGCGC AACGTGCTGC GCACCGACCT CGCCGAGGCG CGCGAGCTCC TCACGCCCGG CTTCTTCCTG CAGGTCTCCG CGCTGGCCCT GCCGCCCCTC GTCTTTCTTG CGCGAGCGCG CGTGCGCCGG CGTCCGCTGC GGCGCGCGCT GGCGATCCGG GCCGCCGCCG CCGTGCTCGC GCTGCTCGTG GCCGTCGCGG CGCTCGGTAG CGTGTTCAAG GACTTCTCCG GGCAGATGCG CAACCACAAG GAGCTGCGCT ACCTGATCAC CCCCGCCGCG CCGCTGTGGT CGCTCGCGCG CGTGCTGAGC CGCGACGCGC AGGCGGCCAA TCAGCCGCGC CGGCCGGTCG GCGCCGATGC CCGCCTGGGA GCGAGCTGGG CGGCGGCGAA GAAGCCGACC CTGTTCGTGA TCGTGGTCGG CGAGACCGCG CGCGCCGCCA ACTGGGGCCT GGACCGTGGG GCGGGGCAGT CGCCCGCCCA CGACACCACG CCCGAGCTCG CCCGCCGCGC GGTCATCAAC TTCCCGGACG TGACGAGCTG CGGCACCAAC ACCGAGGTGT CGGTGCCCTG CATGTTCTCG CTGCAGGGGC GGCGCAACTA CGACGAGGAT GCCATCCGCG GCAGCGAGTC CTTGCTCGAT GTGCTGCGTC ACGCCGGCCT GCGGGTGGTG TGGAACGACA ACCAGTCCGG CTGCAAGGGC GTGTGCGCCG GGGTCGAGAG CCTGCGCCCC GACCCCGCCG CCTTGCCCGC GCTGTGCGAC GGCGAGCGCT GCCTCGACGA GGCCTTGCTG GAGAGCAGTC GGGCGCTGCT GCGTGATCCG CAAGGCAACC TCGTGCTGGT CCTGCATCAG CTTGGCAACC ACGGTCCGGC CTACTTCCGC CGCTATCCGG AAGCCTTCCG CCGCTTCACG CCGACCTGCG ACGACGAGGA CCTGTCGAAG TGCACCCGCG AGCAGGTGGT CAACAGCTAT GACAACGCGC TGAGCTACAC CGACCACGTG CTCGCCCGCG GCATCGACCT GCTGAAGGAG CTGGAGCCGC GCTACGACGC CGCGCTGCTG TATGTCTCGG ACCATGGCGA GTCGCTCGGC GAGAACGGCC TCTACCTGCA CGGGCTGCCG TACTCGATCG CGCCCGCGGA GCAGACCCGC GTGCCGATGC TGATGTGGCT GTCGTCCGGC TTCGCCGCGC GCAACCGGGT CGATGCGGCG TGCCTGCGCG GGCAGGCCGC CCGGCCCGCC AGCCATGACA ACCTCTTCCA TACCGTCCTG GGCCTGCTCG ACGTGCGCAC CGCGGTCCGC GACGACGCAC TCGACCTGAC CGCCCCCTGC CGGAGCTGA
|
Protein sequence | MFKLRLSHVP PFAPSAGAST PVGRRALGAS AFAAAWSGAR SWRPEVSGEG LLVGLGLYFA LACNTPFWRA LLASRGGEGG GLFYVLALGV ALAALNVALL APLLNRWTTK PLLGGLILVA AVSSYYAGHF GVYFDPSMLR NVLRTDLAEA RELLTPGFFL QVSALALPPL VFLARARVRR RPLRRALAIR AAAAVLALLV AVAALGSVFK DFSGQMRNHK ELRYLITPAA PLWSLARVLS RDAQAANQPR RPVGADARLG ASWAAAKKPT LFVIVVGETA RAANWGLDRG AGQSPAHDTT PELARRAVIN FPDVTSCGTN TEVSVPCMFS LQGRRNYDED AIRGSESLLD VLRHAGLRVV WNDNQSGCKG VCAGVESLRP DPAALPALCD GERCLDEALL ESSRALLRDP QGNLVLVLHQ LGNHGPAYFR RYPEAFRRFT PTCDDEDLSK CTREQVVNSY DNALSYTDHV LARGIDLLKE LEPRYDAALL YVSDHGESLG ENGLYLHGLP YSIAPAEQTR VPMLMWLSSG FAARNRVDAA CLRGQAARPA SHDNLFHTVL GLLDVRTAVR DDALDLTAPC RS
|
| |