Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0174 |
Symbol | |
ID | 4808662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 209822 |
End bp | 211714 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105585 |
Product | sulfatase |
Protein accession | YP_001036608 |
Protein GI | 125972698 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.326693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCA GTATAGAAAG AGCAAGAAGC AATATTTTTA CCAATACAAG ACCGCAGCTG GATGTGTTTG GAATAATTTA TACTGTGCTT TTTGTATGTT CCATTGTATT TAAAGGGGTG TTTCTCCAAT TTCAGAACCA AATTAATTTC AAACCTCTTT TTTCAACCAC AAATATTTTC ATGTTTGTTG CTTCAATGTC TTTTACATTG GTACTGGCGG CTTTGTTGAC CGTTTTTCAC ACAAAGAGAA GAGTGTTGTT TTTCATATCC AACATTTTAA TGTCGGTTTT GCTTCTTTCT GATGCTCTGT ATTTGCGCTA TTACAACACG ATAATAACAA TACCGGTAAT TTATAATGCC CGATATTTGG GGCCGGTCAG AGAGAGTATC ATGAGTCTTT TCAGGTTTAG CGACATATTC TATTTTTTGG ATATTCCTGT TTTTGCAGTA ATGTCGTTTA TATTTTCCAA ACGGGCTGAA CAGAACAAGC TTCCGTTGCT GAAAAGATGC GTAGTGGCCG CAGTGCTGAT GGTAGTAGCT TTTGGCTCTT TTAAAATAGC ATACAGCAAA AATGACATGT CCGAGTACGA CAACAATTAT ATTGTGAAGA ACTTTGGCAT AGGTTATTTC CACTATTATG ATGTGAAAAA ATATTTAAAG GAAAATTATC TTAGGGATAA AAAACTTAGA ACTGAGGAGA AAAATGAACT GACATCCTTC TTTGAAGAAA AAAACAAGGA AAAAGCCGCA CTTTCCAATA GATTTAAGGG AATAGCAAAA GGGAAAAACC TTATTATTGT TCAGATGGAG GCTCTTCAGC ATTTTGTTAT CAACAGCAAA ATGAACGGCA GGGAAATAAC TCCCAACTTA AACAAGCTTG TAAAGGAAAG TCTGTATTTT GACAATATCT ATGTGCAGGT GGCAGGGGGT AATACGTCGG ATGCCGAGTT TATGACCAAT ACTTCATTGT ACCCTGCAAA AGAAGGTGCT GCCTATTTTA GATTTGCAAC AAACGAGTAT AACACCATTC CCAAGGAATT AAAGAAAGAA GGCTATAATT CCTACGCTTT GCATGCATAC GGACCTGCAT TCTGGAACAG AACCGAAATG TACAAAGCTA TAGGATTTGA TACTTTTATA AGCTCTAACG ACTATGTTAT GGACGAATAT ATAGGCTGGG GAGGCTGGGC GTTAAGTGAC GATTCGTTTT TCAGACAGTC TCTGGAGAAA ATTGATGTCA CCAAACCGTT CTATTCATTT TTCATAACTC TTTCCGGTCA TCATCCTTAT TCCTATTTTG AGGATAAACA AACCTTTGAT GTCGGAAAAT ATGACAGGAC TTATTTCGGC AACTATATTA AGGCTCAGAA CTATGCTGAT GCCGCTCTTG GCCGTTTTAT AGAAAGGCTT AAAGAAATGG GTCTTTATGA GAACAGCCTT ATTGTCATCT ACGGCGACCA TACAGGTCTT CCCAAGACTC AGGCAAAAGA ACTTCTGGAA TTTTTGGGAG TGGACGACAA CAAGGTTGAC TGGATAAAGC TTCAAAAGAT ACCTTTGCTG ATACATTGTC CGGGGGTGAA AGGAGAAACC ATTAGCACCA CCGGCGGACA GGTGGATATA TTCCCGATGA TTGCCAATAT GATGGGATTT GAAAACTATT ATGCGTTGGG CAAAGACCTG CTGAACACCG AAAAAGGTTA TGCGGTGCTG AGAAACGGTT CAGTGCTCAC GGATGACTAT TACTACTGCA GTGAGGATGA TACCGTTTAT GATTTGAGAA GCGGTGAGGT TCTTGACAAG AAGGACTATG AAGATGAGAT ACAAAAATAT CAAAAAGAAC TTCAAATATC CGACATAATT CTGGAAAAAG ATGCACTGCG GAAGTTGAAA TAA
|
Protein sequence | MKISIERARS NIFTNTRPQL DVFGIIYTVL FVCSIVFKGV FLQFQNQINF KPLFSTTNIF MFVASMSFTL VLAALLTVFH TKRRVLFFIS NILMSVLLLS DALYLRYYNT IITIPVIYNA RYLGPVRESI MSLFRFSDIF YFLDIPVFAV MSFIFSKRAE QNKLPLLKRC VVAAVLMVVA FGSFKIAYSK NDMSEYDNNY IVKNFGIGYF HYYDVKKYLK ENYLRDKKLR TEEKNELTSF FEEKNKEKAA LSNRFKGIAK GKNLIIVQME ALQHFVINSK MNGREITPNL NKLVKESLYF DNIYVQVAGG NTSDAEFMTN TSLYPAKEGA AYFRFATNEY NTIPKELKKE GYNSYALHAY GPAFWNRTEM YKAIGFDTFI SSNDYVMDEY IGWGGWALSD DSFFRQSLEK IDVTKPFYSF FITLSGHHPY SYFEDKQTFD VGKYDRTYFG NYIKAQNYAD AALGRFIERL KEMGLYENSL IVIYGDHTGL PKTQAKELLE FLGVDDNKVD WIKLQKIPLL IHCPGVKGET ISTTGGQVDI FPMIANMMGF ENYYALGKDL LNTEKGYAVL RNGSVLTDDY YYCSEDDTVY DLRSGEVLDK KDYEDEIQKY QKELQISDII LEKDALRKLK
|
| |