Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0745 |
Symbol | |
ID | 4810363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 910112 |
End bp | 912304 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106162 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037173 |
Protein GI | 125973263 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTTTAAGCCT CATTTTAGTC GTTTCTTTTC TGATTATGCT GCTGACTCCT TCGACAAAGA TATCTGCAGC AACTACATTC AACTACGGAG AAGCCCTTCA GAAAGCCATA ATGTTCTATG AATTCCAGCG TTCCGGAAAA CTGCCTTCCA CCATCAGGAA CAACTGGAGA GCGGATTCCT GTCTGGACGA CGGCAAAGAT GTGGGGCTTG ACCTTACAGG CGGTTGGTTT GACGCCGGCG ACCACGTCAA GTTCAACCTT CCAATGGCAT ATACCGCCGC AATGCTCGCG TGGGCAGTAT ATGAAGAAAA GGACGCTTTT GTGAAAAGCG GGCAGCTTAA ATACATACTG GATGAAATCA AATGGGCAAC GGATTACTTC ATCAAATGTC ATCCCGAACC CGATGTGTAT TACTATCAGG TGGGAGACGG AGATATTGAC CACATGTGGT GGGGACCTGC GGAAGTCGTA CACTTGAGAA CAAAAAGACC TTCATACAAA GTAGATATTA CTTCACCGGG TTCTACCGTG TCGGCAGAAA CCGCGGCAGC CCTGGCTGCG GCATCAATAG TGTTTAAAGA TACTGATCCC CAATACTCAA ATCTGTGCTT AAAACACGCA AAGGAACTTT TCAACTTTGC CGACAAAACA AGAAGTGATG CCGGTTATAC AGCGGCAACA AACTTTTACA CATCCCACAG CGGATTCTAT GACGAACTTA CATGGGCGGC TACATGGATT TATCTTGCAA CAGGGGACAC AAGCTATCTT GACAAAGCCG AATCCTATGT TGAATTTTGG AGTACCGAAC CTCAGACAGA TATAATGTCC TACAAGTGGG GACACTGTTG GGATGACGTG CGCTACGGAG CACAGCTTCT TTTGGCAAGG ATTACAAACA AGCCGATATA CAAGGAAAGC ATGGAAAGAC ATTTGGATTA TTGGACAGTC GGAGTTGACA ATTCAAGGAT AAAATATACG CCCAAAGGTT TGGCATGGCT TAATAACTGG GGTTCTTTGA GATATGCCAC CACCACTGCG TTTCTTGCGG CTGTTTATGC CGATTGGGAA GGATGCAGCC CGCAAAAGGC GAAAATATAC AACGATTTTG CAAAGGCTCA GGTTGACTAC GCACTGGGCA GCACAGGAAG AAGTTTCGTA GTAGGATTTG GTGAAAATTG GCCACAGCAT CCTCACCACA GAACTGCCCA TGGTTCATGG TATGACAGCA TGAATGTGCC TGATTACCAT AGACATGTGC TGTATGGTGC ATTGGTCGGC GGACCCGGTG AAAGTGACAA CTACAGGGAC GATATTTCCG ACTATCAGTG CAACGAGGTT GCCTGTGACT ATAACGCAGG TTTTGTAGGT GCACTTGCCA AAATGTACAA CAGATATGAC GGAAGACCGG TACCGGAATT TAAAGCTATT GAAGTGCCGG AAGATGAATT TATGGTTGAA GCTTATGTAA GCAGCAGCGA CAAAAACTAT GTTGAAATAA AAACAAGACT TAACAACAGG ACGGCATGGC CTGCAAGAGT GTCCGAAGGA CTCTCCTTTA GATACTTTAT TGACCTTACT GAAGTAATTG AAGCAGGATA CGGTCCAAAC GATTTAATAA TATCAGGCGG TCAAGGTTCA AGCGGAAAAG TGTCGGGCCC GCACCTGTGG AACAAAGAAA AGAATATATA CTATATAGAA GTTGATTACA CCGGAGATCG TCTCTTCCCC GGCGGACAGG ACCACTATAG AAGAGATTCA TCTTTGCGAA TTGCCGTACC CGGCAACAGT GGATGCTGGA ACAGTGAAAA CGATCCGTCC TTCAAAGGAC TCTCCAAAAC AAGCGAATTT AAGAAAGCGG AATACATACC CGTTTACGAA TACGGTGTAA AAGTTGCAGG CATAGAACCT GAGGGAACAG TTGTTCAGCC TTCCCCAAGT CCTACTCCTA CTCCAACACC GCCTCATTCG GACGATGTCC TGTATGGAGA CATAAACAAT GATAAGACAG TAAACTCAAC AGATGTCACA TACTTAAAAA GGTTTTTACT GAAACAAATA AACAGTCTTC CCAATCAAAA AGCAGCGGAT GTAAACCTGG ACGGCAATAT AAATTCTACA GACCTTGTTA TTTTGAAAAG ATACGTTCTG CGTGGAATTA GCAAACTGCC TTACGCACCT TAA
|
Protein sequence | MKKFLSLILV VSFLIMLLTP STKISAATTF NYGEALQKAI MFYEFQRSGK LPSTIRNNWR ADSCLDDGKD VGLDLTGGWF DAGDHVKFNL PMAYTAAMLA WAVYEEKDAF VKSGQLKYIL DEIKWATDYF IKCHPEPDVY YYQVGDGDID HMWWGPAEVV HLRTKRPSYK VDITSPGSTV SAETAAALAA ASIVFKDTDP QYSNLCLKHA KELFNFADKT RSDAGYTAAT NFYTSHSGFY DELTWAATWI YLATGDTSYL DKAESYVEFW STEPQTDIMS YKWGHCWDDV RYGAQLLLAR ITNKPIYKES MERHLDYWTV GVDNSRIKYT PKGLAWLNNW GSLRYATTTA FLAAVYADWE GCSPQKAKIY NDFAKAQVDY ALGSTGRSFV VGFGENWPQH PHHRTAHGSW YDSMNVPDYH RHVLYGALVG GPGESDNYRD DISDYQCNEV ACDYNAGFVG ALAKMYNRYD GRPVPEFKAI EVPEDEFMVE AYVSSSDKNY VEIKTRLNNR TAWPARVSEG LSFRYFIDLT EVIEAGYGPN DLIISGGQGS SGKVSGPHLW NKEKNIYYIE VDYTGDRLFP GGQDHYRRDS SLRIAVPGNS GCWNSENDPS FKGLSKTSEF KKAEYIPVYE YGVKVAGIEP EGTVVQPSPS PTPTPTPPHS DDVLYGDINN DKTVNSTDVT YLKRFLLKQI NSLPNQKAAD VNLDGNINST DLVILKRYVL RGISKLPYAP
|
| |