Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2760 |
Symbol | |
ID | 4810263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3255101 |
End bp | 3257986 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108180 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039152 |
Protein GI | 125975242 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.26535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATT CACCACAAAA GAGGATTTTG AAACGAAAAA ACAGGTGTAA AGGGCTTATA ACGGCTGTCA TCATTGCAGC TCAGTTACTA ACGTCGAGCA TATTATTTGC AGAAGCACCA CCTGCGACTT TTACACCTGC TGAAAACTGG GAGGATTATG ATTACTTCAA CTTTGCCGAA GCCCTGCAGA AGTCGCTGTA TTTTTATGAC GCTCAGAAAT GCGGAGTTGA GGCAGGCTAC GACCACGGTG GAAGGCTTGA GTGGAGAGGG GCGTGTCATG AAGTCGATGA AAGAATTCCT ATTTCCAACA CATCTCTTTC TGAAGCTTTT CTGGCAAAAT ACAGACATAT AATCGACCCT GACGGAGACG GTACTGTGGA TGTACACGGA GGCTTCCATG ATGCAGGTGA CCATGTCCGT TTCGGTCTTC CCCAAAGCTA TACGGCAGGT ACTTTGGGCT GGGGATTCTA TGAATTCAGA GAATCCTTCA GGGCAATCGG GGAAGAAGAG CACATGATTG AGATTTTAAG ATACTTTACG GATACTTTTT TACGCTGTTC ATTTATGGAT GAGGAAGGCA ATATTGTTGC TTTTTGCTAC ATGGTTGGTG AAGGAGACGA AGACCATTGC TATTGGGGAC CTCCGGAGTT ATACCCTGAA GAATACTTAA GAAGCAGACC GGCGGATTTT GCAACTTTTG ATGATCCCGG AAGCGATGTT TGTGCAAGCA CGGCTGCAGC ACTCTGCACA TCATATTTGA ATTTTAAGGA TGAAGACCCT GAATACGCTG AAAAATGCCT TACCGTTGCC AAGGCACTGT ATGATTTTGC AGTTAAGTAC AGAGGATTGC ATAAAGGCGA CGGTTACTAC ACCTCGGATT ATGACGAAGA TGAATTGGCA TGGGCTGCTG TCTGGCTTTA TGAGTGCACC GGAGACATGA AATACATAAA TGACATAGTA GCTGTTGATG AAACCGGCAA TTATGTGGGA TATATGAAAA GGATAATACC CGACACCTTT AAACAGAACG TTTGGTATAA TTCATGGGTT CACTGTTGGG ATGCCGTATG GGGAGGAACT TTCATAAAAC TGAACGAGCT GTTCCCTGAA AATGAATTGT TTGACTTTAT TGCAAGATGG AATGTTGAGT ATTTGTCAGG CGGAAAATGC CCACACGAGG ATCCTAATGA TCATAATTAT TGTAAACCAT CTCCTGCCGG CTACACAATG ATAAACGGCT GGGGTTCTGC GCGTTACAAT GCCGCTGCGC AATTATGCGC TCTTGTATAC ATGAAAAACA ATCCGGACAG GACAGATTTC GGCGAGTGGG CAAAAAGTCA AATGGAATAC CTTATGGGAA GAAATCCTAT GGGTTATTCG TACATAGTCG GTTACGGATA TGAAAAAGGC TTGCCTTTTG CAAAGCATCC GCACCACAGA GCGGCTCACG GCTCAAAAAC AAACAGCATG AACGATCCTG AGGAGCATCG CCATATATTG TGGGGGGCTC TTGTGGGAGG ACCCGATTTG AATGATTATC ACATTGACTC AACTACCGAG TACGCTTATA ATGAGGTGGC AGTTGACTAT AACGCTGCCT TTGTAGGCGC GCTGGCAGGA CTGTATAAAT ATTATGGACA GGGACATGAA CCTATTCCGA ATTTCCCGCC GCTAGAGCCG GAAACCGACG ATTATTTCTG CGAAGCAAAA ATTGTCCGTG AAACTAAAGA CAGTACACAA GTTCTTTTAA GAATTCATAA TGAATCGACC CGGCCTCCTC ATTATGAAAC AGGAATGATG GCAAGATACT TCTTCAATAT AAGCGAGCTT ATTGAAAACG GTCAAAGTAT AGATGATGTA ATATTTACTA TTGAATATGA TGAACAGATT TCCATGCAGC AGGAACCGGT TGTATACAGA GGACCTTTTA AATGGGATGA TGCAGGAACA TACTATTTTG AATTTGACTG GAGCGGAAGA AAAATTTACG GAGACAGGGA GCTTCAGATT TCCTTCAGAG TTAAACAGGA TTCGAATTAC ATGACCCATT GGGACTCCAG TAATGACTAT AGCAGGCAAG GCCTTACAAA TGAATATGCA ATATCCAAGA ATGTGCCCGT ATATCTGAAT GGTGTAAAAG TTTACGGTGA AGAACCGCCA AAGCTTTCTC CGACTCCGAC TCCAACAATT GATCCCAGCC AAACTCCGGA TGCTAATGCT TCAATCAGTG TATCATACAA GTGCGGAGTT AAGGATGGTA CGAAAAATAC TATAAGAGCT ACAATAAATA TTAAGAATAC CGGAACCACT CCTGTGAATT TATCGGATAT CAAGGTTCGA TACTGGTTTA CAAGTGACGG AAACGAACAG AATAACTTTG TGTGCGATTA TGCGGCTTTT GGAACGGACA AAGTAAAAGG TATTGTGAAA AAGATAGAAA ACTCTGTCCC TGGTGCTGAT ACGTATTGTG AAATCTCATT TACTGAGGAT GCGGGTAGGC TTGCACCCGG AGGAAGCACA GGAACAATAC CTTTCAGAAT TGAGGGTGCG GCAGAGTATG ACCAGACAGA TGATTATTCC TATAATTCTG AAATGTCAGA TGATTTTGGG GATAACACCA AGATTACTGC TTATATAAAA GATAAACTCA AATATGGAGT TGAGCCTGTT ACAATAATTG ATATTACATT GGGTGACCTG AACTATGACG GTAAAGTTAA CTCTACAGAC TATTTAGTTT TGAAAAGGTA TTTGCTTGGA ACAATTGACA AAGAATCAGA TCCTAACTTC CTGAAAGCCG CAGATCTTAA CAGGGATGGA CGTGTTAATT CGACAGACAT GTCGTTAATG AAACGTTATC TTCTTGGCAT AATAACGTCT TTTTAG
|
Protein sequence | MTDSPQKRIL KRKNRCKGLI TAVIIAAQLL TSSILFAEAP PATFTPAENW EDYDYFNFAE ALQKSLYFYD AQKCGVEAGY DHGGRLEWRG ACHEVDERIP ISNTSLSEAF LAKYRHIIDP DGDGTVDVHG GFHDAGDHVR FGLPQSYTAG TLGWGFYEFR ESFRAIGEEE HMIEILRYFT DTFLRCSFMD EEGNIVAFCY MVGEGDEDHC YWGPPELYPE EYLRSRPADF ATFDDPGSDV CASTAAALCT SYLNFKDEDP EYAEKCLTVA KALYDFAVKY RGLHKGDGYY TSDYDEDELA WAAVWLYECT GDMKYINDIV AVDETGNYVG YMKRIIPDTF KQNVWYNSWV HCWDAVWGGT FIKLNELFPE NELFDFIARW NVEYLSGGKC PHEDPNDHNY CKPSPAGYTM INGWGSARYN AAAQLCALVY MKNNPDRTDF GEWAKSQMEY LMGRNPMGYS YIVGYGYEKG LPFAKHPHHR AAHGSKTNSM NDPEEHRHIL WGALVGGPDL NDYHIDSTTE YAYNEVAVDY NAAFVGALAG LYKYYGQGHE PIPNFPPLEP ETDDYFCEAK IVRETKDSTQ VLLRIHNEST RPPHYETGMM ARYFFNISEL IENGQSIDDV IFTIEYDEQI SMQQEPVVYR GPFKWDDAGT YYFEFDWSGR KIYGDRELQI SFRVKQDSNY MTHWDSSNDY SRQGLTNEYA ISKNVPVYLN GVKVYGEEPP KLSPTPTPTI DPSQTPDANA SISVSYKCGV KDGTKNTIRA TINIKNTGTT PVNLSDIKVR YWFTSDGNEQ NNFVCDYAAF GTDKVKGIVK KIENSVPGAD TYCEISFTED AGRLAPGGST GTIPFRIEGA AEYDQTDDYS YNSEMSDDFG DNTKITAYIK DKLKYGVEPV TIIDITLGDL NYDGKVNSTD YLVLKRYLLG TIDKESDPNF LKAADLNRDG RVNSTDMSLM KRYLLGIITS F
|
| |