Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3870 |
Symbol | |
ID | 7874111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4265597 |
End bp | 4267333 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700812 |
Product | sulfatase |
Protein accession | YP_002890835 |
Protein GI | 237654521 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATTC CCCGCAACAC CCCCAACCCG TCCCCGGGCA GTCGCTGGCG GAAACTCGCC GCCTGCGCCG GGATGCTGAT GGCAGCCGGC CTCGGCTTGT TGCATGGCGC CCCCCAGGCC AGTGCTGCCG AGCCGGCCAA GAAACCCAAT ATCCTGCTGA TCGTTGCCGA CGACATGGGC TATTCCGATG TCGGCGCCTT CGGCGGCGAG ATCGAGACCC CGAACATCGA CGCGCTGGCC CAGCGCGGCC TGAGCGCGAC CAATTTCTAC GTCGCCCCGA CCTGCTCGCC GACCCGCTCG ATGCTGCTCA CCGGCACCGA CAACCACGTC GCCGGCTTTG GCGTCATGTC CGAGTACACG GGGCCGCAGC AGAAGGGCAA GCCCGGCTAC GAAGGCCATC TGAACCAGCG CATGACCAGC ATCGCCACCC TGTTGCGCGA TTCCGGCTAC CACACCTACA TGGCCGGCAA ATGGCACCTC GGCGAGGAGA AGGGACAGTG GCCGGCCGAC CAGGGCTTCG AGCGCGACTT CACCCTGATG CAGGGTGGCG GCAGCCACTG GTCGGATATG GGTTATCCCA ACCCCCAGCA TCCGAACCTG ACCTTCACGC GCAACGGCAA GCTGCTCGAC AAGCTGCCCG ACGACCACTT CTCGACGGCG GCCTTCAGCG ATTTCATCAC GCAGTCGATC GACGAGAACA AGGCCGACGG CAAGCCCTTC TTCGCCTATC TCTCCTACCA GGCGGTGCAT AGCCCCTTCG CCCTGCCGGA CGACTGGATC GACAAGTACA AGGGACGCTA CGACCAGGGC TACGATGCAC TGCGCGCCGA GCGCCTGGCG CGGATGAAGG CGATGGGACT GGTCGGCGCC GACGTCAGCC TGGCACCGCG CATGCCCAAC GTGCCCGCGT GGGACAGCCT GACGCCCGAG CAGAAGAAGA TTTCCGCCCG CAAGATGGAA GTCTATGCGG CGATGGTGGC CAACATGGAC CACCACATCG GCCGCGTCCT CGGCCACCTG AAGGCCAACG GCCAGCTCGA CAACACGCTG GTGCTGTTCT TCTCCGACAA CGGCGCCGAA CCGGTCGAAC TGCTCGAGCT CGCGGCCTCG GTCGATCCGG CGATGAAGGT CTGGCTGGAG AAGAACTGGG ACACCAGGCC GGAAAACTGC GGCCGCAAGA TGTCCGTCTG TGACTACGGC GCCGCCTGGG CCCAGGTCGG CTCGACCCCC TTCAACTACT TCAAGCACTA CACCGCCGAG GGCGGCATCC GCTCGCCGCT GATCGCCGCC GGTCCCGGCG TCGTCTCCGA CGGCCAGACC ACCCGGGCCG TCCTGCACGT GACCGATGTC GTACCGACAC TCCTCGAACT TGCCGGCGTC AGCCACCCGT CGCAGCGCGG AGGAAGCGAT CAGGCACCGC TGACCGGCAA GTCGATGCAG CCTGTCCTCG TCGGCAAGGC GCAGGACATC CGCAGCGCCG ACGAATGGAT CGGCTGGGAG CTGTTCGGCA ACCGCGCCCT GCGCCAGGGC GACTGGAAGG CCCTGTCCCT GCTGAAGGCA GCCGGCGGCA CCGGCGAATG GCAGCTGTAC AACCTGAAGG ACGACCCGAC GGAGTCACGC GACCTGGCGT CCAGCCAGCC GGCCAAGCTC GCGGAGCTGA CCCGGCTGTG GGACCTCTAT GCCAGCCAGA ACGGCGTCAT TCTCACCGGT GACGGTCCGT TCAAGGGCAG GAAATAA
|
Protein sequence | MTIPRNTPNP SPGSRWRKLA ACAGMLMAAG LGLLHGAPQA SAAEPAKKPN ILLIVADDMG YSDVGAFGGE IETPNIDALA QRGLSATNFY VAPTCSPTRS MLLTGTDNHV AGFGVMSEYT GPQQKGKPGY EGHLNQRMTS IATLLRDSGY HTYMAGKWHL GEEKGQWPAD QGFERDFTLM QGGGSHWSDM GYPNPQHPNL TFTRNGKLLD KLPDDHFSTA AFSDFITQSI DENKADGKPF FAYLSYQAVH SPFALPDDWI DKYKGRYDQG YDALRAERLA RMKAMGLVGA DVSLAPRMPN VPAWDSLTPE QKKISARKME VYAAMVANMD HHIGRVLGHL KANGQLDNTL VLFFSDNGAE PVELLELAAS VDPAMKVWLE KNWDTRPENC GRKMSVCDYG AAWAQVGSTP FNYFKHYTAE GGIRSPLIAA GPGVVSDGQT TRAVLHVTDV VPTLLELAGV SHPSQRGGSD QAPLTGKSMQ PVLVGKAQDI RSADEWIGWE LFGNRALRQG DWKALSLLKA AGGTGEWQLY NLKDDPTESR DLASSQPAKL AELTRLWDLY ASQNGVILTG DGPFKGRK
|
| |