Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0578 |
Symbol | |
ID | 4808253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 706080 |
End bp | 708290 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105992 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037007 |
Protein GI | 125973097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAC TCATTATCAC TGTTATAGTA TCTGCTGTCC TTTTAACTGC TCTTATACCG CAGTTGCCTG TTTTTGCAGC AGACTATAAC TATGGAGAAG CACTCCAAAA AGCAATTATG TTCTATGAAT TTCAAATGTC CGGAAAGCTT CCCGACAACA TCCGTAACAA CTGGCGCGGT GATTCATGTC TCGGAGACGG AAGCGATGTA GGTCTTGACC TCACAGGAGG TTGGTTTGAC GCCGGTGACC ATGTAAAATT CAATCTGCCT ATGGCTTACA CAGCCACTAT GCTTGCATGG GCTGTGTATG AGTACAAGGA CGCGTTACAA AAAAGCGGTC AATTGGGCTA TTTAATGGAT CAGATTAAAT GGGCATCGGA CTACTTCATA AGATGCCATC CCGAAAAATA TGTATATTAT TATCAAGTGG GTAACGGTGA CATGGACCAC AGATGGTGGG TGCCGGCAGA ATGTATAGAT GTTCAGGCAC CAAGACCGTC TTACAAAGTA GATCTGTCAA ATCCCGGTTC CACAGTTACT GCGGGTACAG CTGCCGCACT TGCTGCAACT GCCTTGGTAT TCAAAGACAC TGATCCGGCA TATGCCGCTC TGTGCATACG TCATGCAAAA GAACTCTTTG ATTTTGCTGA AACCACTATG AGTGATAAAG GATATACCGC AGCATTGAAT TTCTACACAT CTCACAGTGG ATGGTATGAC GAGCTTTCCT GGGCAGGTGC ATGGATTTAT CTTGCAGACG GTGACGAAAC TTATCTTGAA AAAGCTGAAA AGTATGTGGA TAAATGGCCA ATCGAAAGCC AGACAACTTA CATTGCTTAT TCATGGGGTC ACTGCTGGGA CGACGTTCAC TACGGAGCAG CACTTCTTTT GGCAAAGATT ACAAACAAAT CCTTATACAA AGAAGCGATA GAAAGACACC TGGACTATTG GACAGTTGGA TTTAATGGTC AGAGAGTCAG ATATACACCA AAGGGTCTTG CTCACCTCAC TGACTGGGGT GTATTAAGAC ATGCCACTAC TACTGCATTC CTTGCATGTG TTTATTCCGA CTGGTCAGAA TGTCCAAGGG AAAAAGCCAA TATTTACATA GATTTTGCCA AGAAACAGGC TGACTATGCC TTAGGCAGCA GCGGCAGAAG TTATGTAGTC GGATTTGGTG TAAATCCTCC GCAGCATCCG CACCACAGAA CTGCCCACAG CTCATGGTGT GACAGTCAAA AAGTTCCTGA ATACCACAGA CACGTTCTTT ACGGAGCACT CGTAGGCGGA CCTGATGCCA GCGATGCTTA TGTTGATGAT ATAGGAAACT ATGTAACAAA TGAGGTTGCC TGCGACTACA ATGCCGGTTT TGTAGGATTG CTCGCCAAGA TGTATGAAAA ATATGGCGGA AACCCCATAC CAAACTTCAT GGCTATAGAA GAAAAAACAA ATGAAGAAAT TTATGTTGAA GCTACCGCCA ATTCAAATAA CGGTGTCGAA TTGAAAACAT ACCTTTACAA TAAATCCGGA TGGCCGGCAA GAGTTTGCGA CAAGCTTTCC TTCAGATATT TCATGGACCT TACGGAATAT GTATCCGCCG GATACAATCC TAATGATATA ACTGTTTCTA TAATTTACAG TGCAGCACCA ACTGCAAAAA TTTCAAAACC AATACTTTAT GACGCATCCA AAAACATATA TTATTGCGAA ATCGATCTCT CCGGTACCAA GATATTCCCC GGAAGCAACT CAGACCACCA GAAAGAAACC CAATTTAGAA TACAGCCTCC TGCAGGCGCA CCTTGGGACA ACACCAACGA CTTCTCCTAT CAGGGAATCA AGAAAAACGG TGAAGTTGTA AAAGAAATGC CTGTTTATGA AGACGGAATT CTCATATTCG GTGTAGAACC CAATGGTACC GGTCCTGCAA CACCAACGCC GAAACCGTCC GTAAATCCTT CACCTTCACC TACGCCAACA TCGGATATTC TTTACGGTGA CATCAATCTG GACGGAAAAA TTAACTCTTC AGATGTTACA CTGTTAAAAA GATATATTGT GAAGTCCATA GATGTTTTCC CAACCGCTGA TCCGGAACGG AGCTTAATAG CATCAGATGT AAACGGAGAC GGAAGGGTAA ACTCTACAGA CTATTCATAC CTTAAACGTT ATGTCTTGAA AATCATACCA ACCATACCCG GAAATTCATG A
|
Protein sequence | MKKLIITVIV SAVLLTALIP QLPVFAADYN YGEALQKAIM FYEFQMSGKL PDNIRNNWRG DSCLGDGSDV GLDLTGGWFD AGDHVKFNLP MAYTATMLAW AVYEYKDALQ KSGQLGYLMD QIKWASDYFI RCHPEKYVYY YQVGNGDMDH RWWVPAECID VQAPRPSYKV DLSNPGSTVT AGTAAALAAT ALVFKDTDPA YAALCIRHAK ELFDFAETTM SDKGYTAALN FYTSHSGWYD ELSWAGAWIY LADGDETYLE KAEKYVDKWP IESQTTYIAY SWGHCWDDVH YGAALLLAKI TNKSLYKEAI ERHLDYWTVG FNGQRVRYTP KGLAHLTDWG VLRHATTTAF LACVYSDWSE CPREKANIYI DFAKKQADYA LGSSGRSYVV GFGVNPPQHP HHRTAHSSWC DSQKVPEYHR HVLYGALVGG PDASDAYVDD IGNYVTNEVA CDYNAGFVGL LAKMYEKYGG NPIPNFMAIE EKTNEEIYVE ATANSNNGVE LKTYLYNKSG WPARVCDKLS FRYFMDLTEY VSAGYNPNDI TVSIIYSAAP TAKISKPILY DASKNIYYCE IDLSGTKIFP GSNSDHQKET QFRIQPPAGA PWDNTNDFSY QGIKKNGEVV KEMPVYEDGI LIFGVEPNGT GPATPTPKPS VNPSPSPTPT SDILYGDINL DGKINSSDVT LLKRYIVKSI DVFPTADPER SLIASDVNGD GRVNSTDYSY LKRYVLKIIP TIPGNS
|
| |