Gene Tmz1t_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1478 
Symbol 
ID7083561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1646875 
End bp1648551 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content65% 
IMG OID643698496 
Productsulfatase 
Protein accessionYP_002355133 
Protein GI217969899 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG TCGAGCAAGG ACAAGGACAA GGCGCACCGT ACAACATCGT GTTCATCCTT 
ACCGATCAGG AGCGTTACTT CCGCCCTGAC GAACTGCCCG CCGGCTATAC GCTTCCGGCA
CGCGAGCGCC TGGCCAAGAA TGGTGTGGTG TTCGAGAACC ACCGCATCAA CTCCTGCGTG
TGCACCCCCT CGCGCTCGGT GATCTACACC GGCCGCCACA TCCAGCAGAC CAGGATGTTC
GACAACACGA ACTTCCCCTG GATCAGCAGC ATGTCCACCG ACATCAAGAC GCTCGGGCAC
ATGATGCGCG AGGCCGGCTA CTACACCGCC TACAAGGGCA AGTGGCACCT GACCCGCGAG
TTCGAGACCG ACAACACGCT CGCGGCGCCG CAGAAGATCT TCACCAAGGA GATGGAGGCC
TACGGCTTTT CCGACTACCT CGGCGTCGGC GACATCATCG CGCACACTCA GGGCGGCTAC
CTGCACGACG GCTTGATCGC CGCCGCCGCG GCGAGCTGGT TGCGCAGCAA GGCCGCGGAG
CTCGCCGAGC AGCAGAAGCC GTGGTTCCTC GCGGTGAACC TGGTCAACCC GCACGACGTG
ATGTTCTACA ACACCGACGA GCCCGGCCAG CCCGTGCAGG GCAAGCACCA TCTGACCCAC
CTCGCCGGCG ATCCGGAGCA CGCGATGTAC AAGAAGCAGT GGGACATCGA CCTTCCCGCC
AGCTTCAAGC AGCCGATCGA CGCCCCCGGA CGCCCCGCCG CGCACATCGA CCACACCATC
GGCAACGACG TCATGACCGG CGTGATCCCG ACGAACGAGG AATGGCGCTG GCGCAAGCGC
CACAACTTCT ACCTGAACGC CTTGCAGGAC GTCGACCGTC ACATCATGAC GCTGCTCGAC
GAACTGGAGG ACCGCGGGCT GGCTTCGAAC ACCATCGTCA TCCTGACCTC GGACCACGGC
GAACTCGGTG GCGCACACCA GATGACCGGC AAGGGTGCCA CCTCCTATCG CGAGCAGAAC
AACGTGCCCT TGATCGTGGC GCATCCGGCC TTTGCGGGGG GCAAGCGCTG CAAGGCCGTG
ACCACCCACC TCGACCTCGC GCCGACCCTG ATCGCGCTCA CCAACGCGAG CCCGGAGACA
AAGGCGGCAA TCGCCCAGAC GCTGCCGGGC AAGGACTTCT CGCCCGTGCT CGCCGCGCCG
GAACAGGCGA ACGTCGACAC CGTGCGCGAC GGGCAGTTGT ACTGCTTCAA CATGTTCGCC
TCGCTGGACG GCAGCTTCCT GCAGAAGGCC AGCGCGCTCC TTGCACAGCC GGGTGGCGCG
GCGAAGATCA AGGAATCCGG CCTGCGCCCC GACCTGAGCA AGCGCGGCGC GATCCGCAGC
GTGTTCGACG GCCGCTACCA GTTCACCCGC TACTTTTCGC CCAAGCAGCA CAACCGGCCG
ACGTCGATCG ACGAGCTGTT CGCACTCAAC GACGTCGAGC TGTTCGACCT CCAGAACGAC
CCGGACGAAG TCGACAACCT TGCCCAGGAC CCGGTCAAGA ACGCCGCGCT GCTGCTCATG
ATGAACGACA AGCTCAACCG GCTGATCGAC GAGGAAGTCG GCGAGGACGT CGGCCAGATG
CTGCCGGGCG GCGTGGATGG CGGCTGGGTG GCGACCCCCG CGGTGCACGA CCTCTGA
 
Protein sequence
MSIVEQGQGQ GAPYNIVFIL TDQERYFRPD ELPAGYTLPA RERLAKNGVV FENHRINSCV 
CTPSRSVIYT GRHIQQTRMF DNTNFPWISS MSTDIKTLGH MMREAGYYTA YKGKWHLTRE
FETDNTLAAP QKIFTKEMEA YGFSDYLGVG DIIAHTQGGY LHDGLIAAAA ASWLRSKAAE
LAEQQKPWFL AVNLVNPHDV MFYNTDEPGQ PVQGKHHLTH LAGDPEHAMY KKQWDIDLPA
SFKQPIDAPG RPAAHIDHTI GNDVMTGVIP TNEEWRWRKR HNFYLNALQD VDRHIMTLLD
ELEDRGLASN TIVILTSDHG ELGGAHQMTG KGATSYREQN NVPLIVAHPA FAGGKRCKAV
TTHLDLAPTL IALTNASPET KAAIAQTLPG KDFSPVLAAP EQANVDTVRD GQLYCFNMFA
SLDGSFLQKA SALLAQPGGA AKIKESGLRP DLSKRGAIRS VFDGRYQFTR YFSPKQHNRP
TSIDELFALN DVELFDLQND PDEVDNLAQD PVKNAALLLM MNDKLNRLID EEVGEDVGQM
LPGGVDGGWV ATPAVHDL