Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2462 |
Symbol | |
ID | 7874145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2655086 |
End bp | 2656810 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643699384 |
Product | sulfatase |
Protein accession | YP_002889441 |
Protein GI | 237653127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAC GGGCGCGTGA TCTTGTGCTC GGCACGCTCC TGCCGCTGCT CGCGCTCGAC GCCTTGCTGG TGTTCGACAA CGCCTGGCCG ACGCTGTGGC CGCGGCCGAC GCCGGCCGTC TCGATCGAGC TCGCCTTCGC CGTCGCCGCA CTGGTGCTCT TCGCCGCCTG GCGGAAACAC GCAGCGCACG GCCTCGTGCG CGTGCTCGCG GGGCTGGCCA CGCTGTGGAT CGTGCTGCGC TACGTGCAGG TGACCGTCCC CGCGCTGTTC GGCCGCCCGC TCAACCTCTA CTGGGACCTG CCCCACCTCG GCGCCGTGCT CGACATGGGC GGCGACGGTC CGGGCGCGAA GGTCCTGCTC GCGCTCGCGC TCGGTATCCT CGTCGTGCTG GTACTGCATC GCGTGGTGTC GGCCTGCGTG CGTGCCCTCG CACGCAGCGC GGCGGCACCG GGCGCGCGTG CGCTGCTCGG CGGTGTCGCG GGCGCGGCAA TCCTGCTGTG GGCGCTGGCG CCGTGGCCCG GCCCGCTCGC CGCGTCGGCC TTCGCGCGGC CGGTCGCCGC GCTGGTCTCC GATCAGATCC GCTTCCTGCA CTCGGCGCTC GCCGCCGACG GCGGCGAGCG GCTCGGCCCC GGCCCCGACT TCCGCGGCGA CCTCGCGGCG CTGCGGGGCG CAGACGTGCT GATCGTGTTC GCCGAGGCCT ACGGCGCGGT CAGCTTCGAT CGACCGCCGA TCACCGCGGC GCTCGCCGAC GCACGCACCG AGCTGGGTGC CGCGATCGCG GCCAGCGGCC GCGAGGCCGT CTCCGCCCGC GTGGTCTCGC CGACCTTCGG CGGCGCCTCC TGGCTGGCGC ACGCCGCCGT GCTCGCGGGG GTCGATACCC GCGACCCGGC CGACCACGCA CTCTTGCTGA CCACCGATCG CCCCACCCTG GTCCGCCACT TCGCCACCCA CGGCTACCGT ACCGTGGGCT GGATGCCAGG GCTGCAGCGC CCCTGGCCGG AGGGCCGCTT CTACGGCTTC GACCGCATCG CCGATGCCGA CAGCGCGGGC TACGCCGGCC TGCCCTTCGG CTTCTGGCGC ATCCCCGACC AGGCCTCGAT GGCGCGCATC CACGTGGACG AGCTGGGCGG CAGCTTCGGC GAGGCAGTCG GTACGCAGCA GGCCCGCTCC AGCCCGGGCT CCTCCCGGGC TTCTGCGGGC GACACCTCCG CCAGCGCGCC CACTTCCCGT CCGGGCGCAC GCGCGGAGAC CGCTGCCGCG CGTCGCGCGC CACGGCTGGT CGTTTTCGCC ACCGTCTCCA CGCACGCCCC CTTCGCAGCG ATCCCACCCT TGCGCGAGGA CTGGTCGCGG CTGCTGCGCG CGGACGCCTT CAGCCAGGAG GAGGTCGAAG CCGCGGCGGC CGTGCGGGTG TCCTGGACCG AGCCGCTCCC CGCTTATCTG GCCTCGATGC GTTACCAGCT CGGCTGGCTC GCCGACTACC TCGCCCATCA CGCCGCGCGG GAGCTGGTCC TGATCGTGAT CGGCGACCAC CAGCCGATCG GCACGGTGAG CGGGCCGGAC CAGCCGCACG ACGTGCCGGT GCATGTGATC GCCTCCGACC CCGCGCTGCT CACCCGCTTC GCGGCCGCCG GCTTCGTCGC CGGCCTGACG CCGCCACAGC AACCCCTCGG CCCGATGCAT GAGCTTGCCC AGGTGCTGGT GGATGCCTTC TCGGGCCCGC GGTGA
|
Protein sequence | MSARARDLVL GTLLPLLALD ALLVFDNAWP TLWPRPTPAV SIELAFAVAA LVLFAAWRKH AAHGLVRVLA GLATLWIVLR YVQVTVPALF GRPLNLYWDL PHLGAVLDMG GDGPGAKVLL ALALGILVVL VLHRVVSACV RALARSAAAP GARALLGGVA GAAILLWALA PWPGPLAASA FARPVAALVS DQIRFLHSAL AADGGERLGP GPDFRGDLAA LRGADVLIVF AEAYGAVSFD RPPITAALAD ARTELGAAIA ASGREAVSAR VVSPTFGGAS WLAHAAVLAG VDTRDPADHA LLLTTDRPTL VRHFATHGYR TVGWMPGLQR PWPEGRFYGF DRIADADSAG YAGLPFGFWR IPDQASMARI HVDELGGSFG EAVGTQQARS SPGSSRASAG DTSASAPTSR PGARAETAAA RRAPRLVVFA TVSTHAPFAA IPPLREDWSR LLRADAFSQE EVEAAAAVRV SWTEPLPAYL ASMRYQLGWL ADYLAHHAAR ELVLIVIGDH QPIGTVSGPD QPHDVPVHVI ASDPALLTRF AAAGFVAGLT PPQQPLGPMH ELAQVLVDAF SGPR
|
| |