Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0405 |
Symbol | |
ID | 4808408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 505876 |
End bp | 507456 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105819 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036836 |
Protein GI | 125972926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000478388 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAAAGG TTAAGGCCTT GTTGTTGGGA TTGATTGTAT TGGCTGTAGC TTTGTTACCT ACAGTGTCCT TTAAGTCACC GACTGTTGCG GCCGATCCGA ACAATGACGA CTGGCTGCAT GTTGAAGGTA ACAAAATAGT GGACATGTAC GGTAATCAGG TCTGGCTGAC CGGCTGCAAC TGGTTTGGAT TCAATACCGG TACCAATGTG TTTGACGGAG TATGGAGCTG CAATATGAGA GAAGCCCTCA AGGGTATGGC GGACAGAGGA ATAAATTTTT TGAGAATACC TATTTCAACA GAATTGCTGT ATCAATGGTC TCAAGGAATA TATCCCAAAG CAAATGTTAA TGATTTTGTA AATCCGGAGC TGAAAGGAAA GAACAGCCTT GAGCTTTTTG ACTTTGCCGT TCAGTGCTGC AAAGAATTCG GAATAAAGAT AATGGTGGAT ATACACAGTC CGGCAACAGA TGCCATGGGG CATATGTATC CTTTATGGTA TGACGGTCAA TTTACAACAG AGATATGGAT TTCAACTTTG GAGTGGTTGA CGGAAAGATA TAAAAATGAT GACACAATTC TTGCACTGGA CCTTAAAAAT GAGCCTCACG GCACCCCGGG CAGCGAATTA ATGGCCAAAT GGGATGGTTC CACGGATTTG AACAACTGGA AGCATGCTGC TGAAACATGC GCAAAGAGAA TCCTTGCAAT AAATCCGAAT ATTCTTATTG TGGTAGAAGG AGTGGAAGTT TATCCAAAGC CTGGCTATGA TTATACCGCA GTGGACGAAT GGGGAAAAGA GAGTAAATAT TTCTATAACT GGTGGGGAGG AAATTTAAGA GGAGTCAGGG ATTATCCCAT TGACCTTGGC AAGCATCAGA AGCAGCTTGT ATACTCACCT CACGATTACG GTCCCCTCGT ACATAAACAA CCTTGGTTCT ATGAAGGCTT TAACAAAGAA ACTTTGTATA ATGATTGCTG GAGAGATAAC TGGGCATACA TACACGAGGA AAACATCGCT CCTCTGATAG TGGGTGAATG GGGAGGTTTC ATGGACCGCG GAGACAACGA GAAATGGATG AAAGCGCTGA GAGATTATAT GATTGAGAAT AAAATATCCC ACACTTTTTG GTGCTATAAT GCAAATTCCG GTGATACCGG AGGACTTGTA TACTATGATT TTATTACCTG GGACGAAGAA AAATATGCTC TTCTGAAGCC TGCATTATGG CAGACAGAGG ACGGAAAGTT TATAGGCCTT GACCATCAGA TACCTCTTGG TTCAAATGGA ATTACCGTAA CTGAATATTA TGGCGGCTAT ATTCCGGAAC CGTCACCGAC TGCTACTGTT CCAGACGTAC CGACACCGTC GCATTCTTTC GAAATAGAGA AGGGGGATGT AAACGGTGAC GGTAATGTTA ATTCAACAGA TGTTGTATGG CTTAGGAGAT TTTTGCTAAA ATTGGTCGAG GATTTTCCTG TACCTTCCGG AAAACAGGCG GCGGATATGA ATGATGACGG GAATATCAAT TCTACCGATA TGATAGCCTT AAAGAGGAAA GTGCTTAAAA TACCAATATA A
|
Protein sequence | MRKVKALLLG LIVLAVALLP TVSFKSPTVA ADPNNDDWLH VEGNKIVDMY GNQVWLTGCN WFGFNTGTNV FDGVWSCNMR EALKGMADRG INFLRIPIST ELLYQWSQGI YPKANVNDFV NPELKGKNSL ELFDFAVQCC KEFGIKIMVD IHSPATDAMG HMYPLWYDGQ FTTEIWISTL EWLTERYKND DTILALDLKN EPHGTPGSEL MAKWDGSTDL NNWKHAAETC AKRILAINPN ILIVVEGVEV YPKPGYDYTA VDEWGKESKY FYNWWGGNLR GVRDYPIDLG KHQKQLVYSP HDYGPLVHKQ PWFYEGFNKE TLYNDCWRDN WAYIHEENIA PLIVGEWGGF MDRGDNEKWM KALRDYMIEN KISHTFWCYN ANSGDTGGLV YYDFITWDEE KYALLKPALW QTEDGKFIGL DHQIPLGSNG ITVTEYYGGY IPEPSPTATV PDVPTPSHSF EIEKGDVNGD GNVNSTDVVW LRRFLLKLVE DFPVPSGKQA ADMNDDGNIN STDMIALKRK VLKIPI
|
| |