Gene Tmz1t_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3870 
Symbol 
ID7874111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4265597 
End bp4267333 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content66% 
IMG OID643700812 
Productsulfatase 
Protein accessionYP_002890835 
Protein GI237654521 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTC CCCGCAACAC CCCCAACCCG TCCCCGGGCA GTCGCTGGCG GAAACTCGCC 
GCCTGCGCCG GGATGCTGAT GGCAGCCGGC CTCGGCTTGT TGCATGGCGC CCCCCAGGCC
AGTGCTGCCG AGCCGGCCAA GAAACCCAAT ATCCTGCTGA TCGTTGCCGA CGACATGGGC
TATTCCGATG TCGGCGCCTT CGGCGGCGAG ATCGAGACCC CGAACATCGA CGCGCTGGCC
CAGCGCGGCC TGAGCGCGAC CAATTTCTAC GTCGCCCCGA CCTGCTCGCC GACCCGCTCG
ATGCTGCTCA CCGGCACCGA CAACCACGTC GCCGGCTTTG GCGTCATGTC CGAGTACACG
GGGCCGCAGC AGAAGGGCAA GCCCGGCTAC GAAGGCCATC TGAACCAGCG CATGACCAGC
ATCGCCACCC TGTTGCGCGA TTCCGGCTAC CACACCTACA TGGCCGGCAA ATGGCACCTC
GGCGAGGAGA AGGGACAGTG GCCGGCCGAC CAGGGCTTCG AGCGCGACTT CACCCTGATG
CAGGGTGGCG GCAGCCACTG GTCGGATATG GGTTATCCCA ACCCCCAGCA TCCGAACCTG
ACCTTCACGC GCAACGGCAA GCTGCTCGAC AAGCTGCCCG ACGACCACTT CTCGACGGCG
GCCTTCAGCG ATTTCATCAC GCAGTCGATC GACGAGAACA AGGCCGACGG CAAGCCCTTC
TTCGCCTATC TCTCCTACCA GGCGGTGCAT AGCCCCTTCG CCCTGCCGGA CGACTGGATC
GACAAGTACA AGGGACGCTA CGACCAGGGC TACGATGCAC TGCGCGCCGA GCGCCTGGCG
CGGATGAAGG CGATGGGACT GGTCGGCGCC GACGTCAGCC TGGCACCGCG CATGCCCAAC
GTGCCCGCGT GGGACAGCCT GACGCCCGAG CAGAAGAAGA TTTCCGCCCG CAAGATGGAA
GTCTATGCGG CGATGGTGGC CAACATGGAC CACCACATCG GCCGCGTCCT CGGCCACCTG
AAGGCCAACG GCCAGCTCGA CAACACGCTG GTGCTGTTCT TCTCCGACAA CGGCGCCGAA
CCGGTCGAAC TGCTCGAGCT CGCGGCCTCG GTCGATCCGG CGATGAAGGT CTGGCTGGAG
AAGAACTGGG ACACCAGGCC GGAAAACTGC GGCCGCAAGA TGTCCGTCTG TGACTACGGC
GCCGCCTGGG CCCAGGTCGG CTCGACCCCC TTCAACTACT TCAAGCACTA CACCGCCGAG
GGCGGCATCC GCTCGCCGCT GATCGCCGCC GGTCCCGGCG TCGTCTCCGA CGGCCAGACC
ACCCGGGCCG TCCTGCACGT GACCGATGTC GTACCGACAC TCCTCGAACT TGCCGGCGTC
AGCCACCCGT CGCAGCGCGG AGGAAGCGAT CAGGCACCGC TGACCGGCAA GTCGATGCAG
CCTGTCCTCG TCGGCAAGGC GCAGGACATC CGCAGCGCCG ACGAATGGAT CGGCTGGGAG
CTGTTCGGCA ACCGCGCCCT GCGCCAGGGC GACTGGAAGG CCCTGTCCCT GCTGAAGGCA
GCCGGCGGCA CCGGCGAATG GCAGCTGTAC AACCTGAAGG ACGACCCGAC GGAGTCACGC
GACCTGGCGT CCAGCCAGCC GGCCAAGCTC GCGGAGCTGA CCCGGCTGTG GGACCTCTAT
GCCAGCCAGA ACGGCGTCAT TCTCACCGGT GACGGTCCGT TCAAGGGCAG GAAATAA
 
Protein sequence
MTIPRNTPNP SPGSRWRKLA ACAGMLMAAG LGLLHGAPQA SAAEPAKKPN ILLIVADDMG 
YSDVGAFGGE IETPNIDALA QRGLSATNFY VAPTCSPTRS MLLTGTDNHV AGFGVMSEYT
GPQQKGKPGY EGHLNQRMTS IATLLRDSGY HTYMAGKWHL GEEKGQWPAD QGFERDFTLM
QGGGSHWSDM GYPNPQHPNL TFTRNGKLLD KLPDDHFSTA AFSDFITQSI DENKADGKPF
FAYLSYQAVH SPFALPDDWI DKYKGRYDQG YDALRAERLA RMKAMGLVGA DVSLAPRMPN
VPAWDSLTPE QKKISARKME VYAAMVANMD HHIGRVLGHL KANGQLDNTL VLFFSDNGAE
PVELLELAAS VDPAMKVWLE KNWDTRPENC GRKMSVCDYG AAWAQVGSTP FNYFKHYTAE
GGIRSPLIAA GPGVVSDGQT TRAVLHVTDV VPTLLELAGV SHPSQRGGSD QAPLTGKSMQ
PVLVGKAQDI RSADEWIGWE LFGNRALRQG DWKALSLLKA AGGTGEWQLY NLKDDPTESR
DLASSQPAKL AELTRLWDLY ASQNGVILTG DGPFKGRK