Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1428 |
Symbol | |
ID | 4810578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1748095 |
End bp | 1749423 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106851 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037852 |
Protein GI | 125973942 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.574986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACTT TTAAGCTTAA TGATGAATTC ATGTTTGGAA CCGCTACCGC AAGTACTCAG ATTGAGGGTG GGGATACGGG AAATACATGG TATAAGTGGT GCCAGGAAGG ACGTATCAAG GACTCCAGCA GCTGTATCAC TGCATGTGAC CATTGGAACA GGGTGGAGGA GGATACGGAG CTGTTGAAGA ACTTGGGAGT TCAAACCCAC AGAATGAGTC TTGAGTGGAG CAGAATAGAG CCTTCCAGGG GCAAATTTTC CGATGACGCA ATGAAACATT ACAGAGATGA GATTAAGCTT TTGGTGGAGA ACAACATAAA GCCTCTGGTT ACGCTTCATC ACTTTTCCGA GCCCATTTGG TTTCATGAAA TGGGGGGATG GAAAAAAACG GGCAATGCAG ATATTTTTAT AGAATATGTG AAGTATGTGG TTGAAAATTT GGGTGACCTT GTAAGCGACT GGGTAACCTT TAACGAGCCC AATGTCTATG TTGATTTTGG TTATGTAATC GGCATTTTCC CTCCGGGGGA AAGAAGCCTG TCTGAAGGGT TAAAGGTTAC GGCAGAGCTT ATAAACACCC ATGTAAAACT ATACCGGCTG ATACATAGGA TAAGAAGAGA GCGCAAATTT GCAGGCAGGA CAATGGTAGG AACGGCAATG CACCTTCGCA TCTTTGACGG GATAAGTTCT ACCGGAAAAA TGATAGCCAA AGTTGTAGAT TATCTGTTTA ACGAAATGTT TATGGAAGGC ATGACGACAG GGCACATGAT GTTTCCTCTT TCCAAAAAGG GTTCAAGCCA TAAAAAAGGC AGGTATGCGG ATTTTTTGGG AATTAATTAT TATACAAGAA ATATTGTTGA GTTCGTATTT GACCCGTCCC TTTATTTTCA CGAGCTTGTA TGTGACAAGG ATTTGACCAA ATCGGACCTC GGGTGGGACA TATATCCGGA AGGCATATAC AAAGTATGCA AGAGGTACTA TAAGAAATAT AAACTTCCCA TTTATATAAC CGAAAACGGA ATAAGCGATA AAAATGACAC CAAACGGCCG AGCTTTATTG CCAGCCATCT TGCTTATATT GCAAAAGCCA TAAAAGAAGG GATTCCGATA GAACGGTATT ATTACTGGAC GCTGATGGAT AACTTCGAAT GGCTTGAAGG TGAGTCAACG GATTTCGGCC TTTACGACTG CAATTTCCGC ACGCAGGAGA GGATACCGAG AAAAAGCGTC CGGCTTTATG AGCAAATATG CAGAAGAAAA GAATTAACCG CGGAGATGAT TGAGGATTTT AAGAAGTACA GCGGGATTAC TATAGAAACA ATCCGGTGA
|
Protein sequence | MVTFKLNDEF MFGTATASTQ IEGGDTGNTW YKWCQEGRIK DSSSCITACD HWNRVEEDTE LLKNLGVQTH RMSLEWSRIE PSRGKFSDDA MKHYRDEIKL LVENNIKPLV TLHHFSEPIW FHEMGGWKKT GNADIFIEYV KYVVENLGDL VSDWVTFNEP NVYVDFGYVI GIFPPGERSL SEGLKVTAEL INTHVKLYRL IHRIRRERKF AGRTMVGTAM HLRIFDGISS TGKMIAKVVD YLFNEMFMEG MTTGHMMFPL SKKGSSHKKG RYADFLGINY YTRNIVEFVF DPSLYFHELV CDKDLTKSDL GWDIYPEGIY KVCKRYYKKY KLPIYITENG ISDKNDTKRP SFIASHLAYI AKAIKEGIPI ERYYYWTLMD NFEWLEGEST DFGLYDCNFR TQERIPRKSV RLYEQICRRK ELTAEMIEDF KKYSGITIET IR
|
| |