Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0062 |
Symbol | |
ID | 4808757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 93965 |
End bp | 95854 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105471 |
Product | sulfatase |
Protein accession | YP_001036496 |
Protein GI | 125972586 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000880147 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCGGC TAAAAAATTT CATAAATCAA CGCTTTAACG TTGCAGAAAC CATTTATTGG ATAATGTTTA TTATATTCCT TATGCTGAAG TGCATTTATT TTCAATTTAC AACAAAACTG AATACGGTTC CTTATCTGTC TTCCGTGAAT ATAACCATGT TCCTTTCGTC TTTCGCGGTT TTACTGATAA TATCAGCCTT TATAGCTCTT GTTTTCAACA AGTTCAGATT CATTGCCCTT TTTACAATAA ATTTTCTTCT GACAATTTTG CTGATAGCCG ATACAAACTT TTTCAGATAT TACTACAATT TAATCACCAT ACCTGTATTC TTTCAACTGA ATATCAAGCT TGTGAGTTCC GTCAACGAGA GCATACTGAG TCAATTCATG CTTAAGGATT TAATATATCT TGTTGATTTG CCGTTTATGC TGATCGGAGT TCTGCTTTTG AACAAAAACA CCCGAAAGCT GCATATTTCC CGGAGAGTAT ACCGCTTCGC AGCTTTTCTG GTAGTGGGCA TGGTTACATT TTTATCGGTG TTCCATACTT CGAACTTAAA TTCTTTTGCA TACAGCAACA ACTACTCGGC CAAAAGCCTG GGAGTACTCT TTTCCCATTA CTACAACACA AAACTGTTAA TTGAAGAGAA TCTCCTGGAA GACGACAGCT TCACCCAGGA GGATAAAAAC TCCATTATGG CCCTGTATGA AACAAAGAAG AATGAAAAAG ACAGCCTGGA CAGCAGGCTG AAGGGAATTG CCAAAGATAA AAATCTGATT GTCGTTCAGA TGGAGGCTTT GCAGCAATTT GTAATAAACT TAAAAATTAA CGGTAAAGAA GTAACTCCTA ATTTAAACAG GCTTATCTGC GAAAGCCTTT ATTTTGACAA CGTATTCTAC CAGGTGTCAG GCGGAAATAC TTCCGATGCG GAATTTGTAA CCAACAACTC ACTGTATCCT GCAAAAGAAG GTGCAGTTTA TCATTTATAT CCTGAAAACA CTTATCATTC CCTGGCTAAA ATACTGAAGG AAAAAGGATA CAATACTTAC TCGCTGCACG GTTTTGACAA AACCTTCTGG AACAGGGATG AAATGCACAT GTCCCTCGGC TTTGACAGAT TTTTTAACGA AGAGGATTTT GTACTGGACG ATTTTGCCGG ATGGGATGGT CAGGCTCTCA GCGATTCATC CTTCTTCAGG CAGTCTTTTG ATAAAATAGA CACTACAAAG CCTTTTTACA GCTTTTATAT AACTCTGTCA AGCCATCATC CGTTTACTTA TTTCGAAGAC TATGATTTTG ACGTCGGAGA ATTTGAGGGA ACGTACATAG GCAATTACCT TAAGGCCGCA AATTATCTTG ACAAATGCAT AGGTGAATTT ATAGCTGAAC TTAAGAAACG CGGGCTTTAC GACAACAGCC TGCTGGTATT CTACGGAGAC CACGCTGCTG TAAAAAAGAT TGAGGCCGAC GGACTTATGA AGCTTCTTGA CATGGAGTAC AGTGAGCCGG AATGGATGAA GCTTCAAAAA GTACCCCTTA TAATTCATTA TCCCGATCAG TCAAAACCTG AAGTTATAAG CACCATAGGC GGTCAAATCG ACATTCTTCC GACGATTGCA AATCTTATGG ATTTTGATGC GCCGTATGCT TTGGGAAAAG ATCTTTTGAA CTATGACGAA AATAAAGGTT ACGTTGTCTT AAGAGACGGT TCGGTAGTAA CAAAGGACTT CATATATTTC AATGATTTAA GAGAAGTATA TGATTATGAC ACCGGAAAAT TACTGGATTT AAATCTGTAT GACGACAAGA TTACTTCATA CATCAATGAA CTCAATGTAT CGGATATAAT TATAACCAAA GATGCTTTCA AATACGGTTT CGAAAACTAA
|
Protein sequence | MERLKNFINQ RFNVAETIYW IMFIIFLMLK CIYFQFTTKL NTVPYLSSVN ITMFLSSFAV LLIISAFIAL VFNKFRFIAL FTINFLLTIL LIADTNFFRY YYNLITIPVF FQLNIKLVSS VNESILSQFM LKDLIYLVDL PFMLIGVLLL NKNTRKLHIS RRVYRFAAFL VVGMVTFLSV FHTSNLNSFA YSNNYSAKSL GVLFSHYYNT KLLIEENLLE DDSFTQEDKN SIMALYETKK NEKDSLDSRL KGIAKDKNLI VVQMEALQQF VINLKINGKE VTPNLNRLIC ESLYFDNVFY QVSGGNTSDA EFVTNNSLYP AKEGAVYHLY PENTYHSLAK ILKEKGYNTY SLHGFDKTFW NRDEMHMSLG FDRFFNEEDF VLDDFAGWDG QALSDSSFFR QSFDKIDTTK PFYSFYITLS SHHPFTYFED YDFDVGEFEG TYIGNYLKAA NYLDKCIGEF IAELKKRGLY DNSLLVFYGD HAAVKKIEAD GLMKLLDMEY SEPEWMKLQK VPLIIHYPDQ SKPEVISTIG GQIDILPTIA NLMDFDAPYA LGKDLLNYDE NKGYVVLRDG SVVTKDFIYF NDLREVYDYD TGKLLDLNLY DDKITSYINE LNVSDIIITK DAFKYGFEN
|
| |