Gene Tmz1t_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1931 
Symbol 
ID7084399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2174163 
End bp2175911 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content72% 
IMG OID643698956 
Productsulfatase 
Protein accessionYP_002355578 
Protein GI217970344 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.514301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGT TACGTCTTTC CCACGTCCCG CCCTTCGCTC CCTCGGCCGG CGCATCGACG 
CCCGTCGGGC GTCGCGCTCT GGGCGCTTCG GCGTTCGCCG CCGCCTGGTC CGGCGCGCGC
AGCTGGCGCC CCGAGGTGTC GGGCGAGGGC TTGCTGGTCG GGCTCGGCCT CTATTTCGCG
CTCGCCTGCA ATACGCCCTT CTGGCGTGCG CTGCTCGCGA GCCGTGGCGG CGAGGGCGGC
GGCCTCTTCT ATGTGCTCGC GCTCGGCGTG GCGCTCGCCG CGCTCAACGT CGCGCTGCTC
GCGCCGCTGC TCAACCGCTG GACGACCAAG CCGCTGCTGG GCGGCCTGAT CCTCGTGGCC
GCGGTGTCGA GCTACTACGC CGGCCACTTC GGCGTGTACT TCGACCCGAG CATGCTGCGC
AACGTGCTGC GCACCGACCT CGCCGAGGCG CGCGAGCTCC TCACGCCCGG CTTCTTCCTG
CAGGTCTCCG CGCTGGCCCT GCCGCCCCTC GTCTTTCTTG CGCGAGCGCG CGTGCGCCGG
CGTCCGCTGC GGCGCGCGCT GGCGATCCGG GCCGCCGCCG CCGTGCTCGC GCTGCTCGTG
GCCGTCGCGG CGCTCGGTAG CGTGTTCAAG GACTTCTCCG GGCAGATGCG CAACCACAAG
GAGCTGCGCT ACCTGATCAC CCCCGCCGCG CCGCTGTGGT CGCTCGCGCG CGTGCTGAGC
CGCGACGCGC AGGCGGCCAA TCAGCCGCGC CGGCCGGTCG GCGCCGATGC CCGCCTGGGA
GCGAGCTGGG CGGCGGCGAA GAAGCCGACC CTGTTCGTGA TCGTGGTCGG CGAGACCGCG
CGCGCCGCCA ACTGGGGCCT GGACCGTGGG GCGGGGCAGT CGCCCGCCCA CGACACCACG
CCCGAGCTCG CCCGCCGCGC GGTCATCAAC TTCCCGGACG TGACGAGCTG CGGCACCAAC
ACCGAGGTGT CGGTGCCCTG CATGTTCTCG CTGCAGGGGC GGCGCAACTA CGACGAGGAT
GCCATCCGCG GCAGCGAGTC CTTGCTCGAT GTGCTGCGTC ACGCCGGCCT GCGGGTGGTG
TGGAACGACA ACCAGTCCGG CTGCAAGGGC GTGTGCGCCG GGGTCGAGAG CCTGCGCCCC
GACCCCGCCG CCTTGCCCGC GCTGTGCGAC GGCGAGCGCT GCCTCGACGA GGCCTTGCTG
GAGAGCAGTC GGGCGCTGCT GCGTGATCCG CAAGGCAACC TCGTGCTGGT CCTGCATCAG
CTTGGCAACC ACGGTCCGGC CTACTTCCGC CGCTATCCGG AAGCCTTCCG CCGCTTCACG
CCGACCTGCG ACGACGAGGA CCTGTCGAAG TGCACCCGCG AGCAGGTGGT CAACAGCTAT
GACAACGCGC TGAGCTACAC CGACCACGTG CTCGCCCGCG GCATCGACCT GCTGAAGGAG
CTGGAGCCGC GCTACGACGC CGCGCTGCTG TATGTCTCGG ACCATGGCGA GTCGCTCGGC
GAGAACGGCC TCTACCTGCA CGGGCTGCCG TACTCGATCG CGCCCGCGGA GCAGACCCGC
GTGCCGATGC TGATGTGGCT GTCGTCCGGC TTCGCCGCGC GCAACCGGGT CGATGCGGCG
TGCCTGCGCG GGCAGGCCGC CCGGCCCGCC AGCCATGACA ACCTCTTCCA TACCGTCCTG
GGCCTGCTCG ACGTGCGCAC CGCGGTCCGC GACGACGCAC TCGACCTGAC CGCCCCCTGC
CGGAGCTGA
 
Protein sequence
MFKLRLSHVP PFAPSAGAST PVGRRALGAS AFAAAWSGAR SWRPEVSGEG LLVGLGLYFA 
LACNTPFWRA LLASRGGEGG GLFYVLALGV ALAALNVALL APLLNRWTTK PLLGGLILVA
AVSSYYAGHF GVYFDPSMLR NVLRTDLAEA RELLTPGFFL QVSALALPPL VFLARARVRR
RPLRRALAIR AAAAVLALLV AVAALGSVFK DFSGQMRNHK ELRYLITPAA PLWSLARVLS
RDAQAANQPR RPVGADARLG ASWAAAKKPT LFVIVVGETA RAANWGLDRG AGQSPAHDTT
PELARRAVIN FPDVTSCGTN TEVSVPCMFS LQGRRNYDED AIRGSESLLD VLRHAGLRVV
WNDNQSGCKG VCAGVESLRP DPAALPALCD GERCLDEALL ESSRALLRDP QGNLVLVLHQ
LGNHGPAYFR RYPEAFRRFT PTCDDEDLSK CTREQVVNSY DNALSYTDHV LARGIDLLKE
LEPRYDAALL YVSDHGESLG ENGLYLHGLP YSIAPAEQTR VPMLMWLSSG FAARNRVDAA
CLRGQAARPA SHDNLFHTVL GLLDVRTAVR DDALDLTAPC RS