Gene Tmz1t_2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2462 
Symbol 
ID7874145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2655086 
End bp2656810 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content74% 
IMG OID643699384 
Productsulfatase 
Protein accessionYP_002889441 
Protein GI237653127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAC GGGCGCGTGA TCTTGTGCTC GGCACGCTCC TGCCGCTGCT CGCGCTCGAC 
GCCTTGCTGG TGTTCGACAA CGCCTGGCCG ACGCTGTGGC CGCGGCCGAC GCCGGCCGTC
TCGATCGAGC TCGCCTTCGC CGTCGCCGCA CTGGTGCTCT TCGCCGCCTG GCGGAAACAC
GCAGCGCACG GCCTCGTGCG CGTGCTCGCG GGGCTGGCCA CGCTGTGGAT CGTGCTGCGC
TACGTGCAGG TGACCGTCCC CGCGCTGTTC GGCCGCCCGC TCAACCTCTA CTGGGACCTG
CCCCACCTCG GCGCCGTGCT CGACATGGGC GGCGACGGTC CGGGCGCGAA GGTCCTGCTC
GCGCTCGCGC TCGGTATCCT CGTCGTGCTG GTACTGCATC GCGTGGTGTC GGCCTGCGTG
CGTGCCCTCG CACGCAGCGC GGCGGCACCG GGCGCGCGTG CGCTGCTCGG CGGTGTCGCG
GGCGCGGCAA TCCTGCTGTG GGCGCTGGCG CCGTGGCCCG GCCCGCTCGC CGCGTCGGCC
TTCGCGCGGC CGGTCGCCGC GCTGGTCTCC GATCAGATCC GCTTCCTGCA CTCGGCGCTC
GCCGCCGACG GCGGCGAGCG GCTCGGCCCC GGCCCCGACT TCCGCGGCGA CCTCGCGGCG
CTGCGGGGCG CAGACGTGCT GATCGTGTTC GCCGAGGCCT ACGGCGCGGT CAGCTTCGAT
CGACCGCCGA TCACCGCGGC GCTCGCCGAC GCACGCACCG AGCTGGGTGC CGCGATCGCG
GCCAGCGGCC GCGAGGCCGT CTCCGCCCGC GTGGTCTCGC CGACCTTCGG CGGCGCCTCC
TGGCTGGCGC ACGCCGCCGT GCTCGCGGGG GTCGATACCC GCGACCCGGC CGACCACGCA
CTCTTGCTGA CCACCGATCG CCCCACCCTG GTCCGCCACT TCGCCACCCA CGGCTACCGT
ACCGTGGGCT GGATGCCAGG GCTGCAGCGC CCCTGGCCGG AGGGCCGCTT CTACGGCTTC
GACCGCATCG CCGATGCCGA CAGCGCGGGC TACGCCGGCC TGCCCTTCGG CTTCTGGCGC
ATCCCCGACC AGGCCTCGAT GGCGCGCATC CACGTGGACG AGCTGGGCGG CAGCTTCGGC
GAGGCAGTCG GTACGCAGCA GGCCCGCTCC AGCCCGGGCT CCTCCCGGGC TTCTGCGGGC
GACACCTCCG CCAGCGCGCC CACTTCCCGT CCGGGCGCAC GCGCGGAGAC CGCTGCCGCG
CGTCGCGCGC CACGGCTGGT CGTTTTCGCC ACCGTCTCCA CGCACGCCCC CTTCGCAGCG
ATCCCACCCT TGCGCGAGGA CTGGTCGCGG CTGCTGCGCG CGGACGCCTT CAGCCAGGAG
GAGGTCGAAG CCGCGGCGGC CGTGCGGGTG TCCTGGACCG AGCCGCTCCC CGCTTATCTG
GCCTCGATGC GTTACCAGCT CGGCTGGCTC GCCGACTACC TCGCCCATCA CGCCGCGCGG
GAGCTGGTCC TGATCGTGAT CGGCGACCAC CAGCCGATCG GCACGGTGAG CGGGCCGGAC
CAGCCGCACG ACGTGCCGGT GCATGTGATC GCCTCCGACC CCGCGCTGCT CACCCGCTTC
GCGGCCGCCG GCTTCGTCGC CGGCCTGACG CCGCCACAGC AACCCCTCGG CCCGATGCAT
GAGCTTGCCC AGGTGCTGGT GGATGCCTTC TCGGGCCCGC GGTGA
 
Protein sequence
MSARARDLVL GTLLPLLALD ALLVFDNAWP TLWPRPTPAV SIELAFAVAA LVLFAAWRKH 
AAHGLVRVLA GLATLWIVLR YVQVTVPALF GRPLNLYWDL PHLGAVLDMG GDGPGAKVLL
ALALGILVVL VLHRVVSACV RALARSAAAP GARALLGGVA GAAILLWALA PWPGPLAASA
FARPVAALVS DQIRFLHSAL AADGGERLGP GPDFRGDLAA LRGADVLIVF AEAYGAVSFD
RPPITAALAD ARTELGAAIA ASGREAVSAR VVSPTFGGAS WLAHAAVLAG VDTRDPADHA
LLLTTDRPTL VRHFATHGYR TVGWMPGLQR PWPEGRFYGF DRIADADSAG YAGLPFGFWR
IPDQASMARI HVDELGGSFG EAVGTQQARS SPGSSRASAG DTSASAPTSR PGARAETAAA
RRAPRLVVFA TVSTHAPFAA IPPLREDWSR LLRADAFSQE EVEAAAAVRV SWTEPLPAYL
ASMRYQLGWL ADYLAHHAAR ELVLIVIGDH QPIGTVSGPD QPHDVPVHVI ASDPALLTRF
AAAGFVAGLT PPQQPLGPMH ELAQVLVDAF SGPR