Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0040 |
Symbol | |
ID | 4808805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 49460 |
End bp | 52123 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105449 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_001036474 |
Protein GI | 125972564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTGG TTAACAGTTT GGGAAGAAGA AAAATTCTTT TGATACTTGC TGTTATTGTA GCTTTCAGCA CTGTTCTGTT GTTTGCAAAG CTATGGGGGC GAAAGACTTC AAGTACTTTG GATGAGGTTG GTTCAAAAAC TCATGGTGAT TTGACGGCAG AAAATAAAAA CGGCGGATAT TTACCAGAGG AAGAGATTCC AGATCAGCCT CCGGCAACCG GGGCCTTCAA CTACGGTGAA GCGTTGCAAA AAGCAATTTT TTTCTATGAG TGTCAAAGAT CCGGAAAGCT CGATCCCTCA ACTCTTCGCC TAAATTGGCG GGGAGATTCG GGACTGGATG ACGGAAAAGA TGCAGGAATT GATCTTACCG GCGGATGGTA TGATGCGGGA GATCACGTAA AATTTAATTT GCCCATGTCT TATTCGGCGG CTATGCTGGG GTGGGCGGTG TATGAATATG AAGATGCGTT TAAACAGAGT GGACAGTATA ACCACATATT GAATAACATA AAATGGGCTT GTGATTATTT TATAAAATGT CATCCGGAAA AGGATGTGTA CTATTACCAG GTGGGCGACG GTCATGCTGA CCATGCGTGG TGGGGTCCTG CCGAAGTAAT GCCTATGGAA AGGCCGTCGT ACAAAGTTGA CAGGTCATCG CCGGGTTCCA CGGTTGTGGC AGAGACGTCG GCAGCTTTAG CTATAGCATC GATAATATTT AAGAAAGTGG ATGGTGAATA CTCGAAAGAA TGTTTAAAGC ATGCAAAAGA ACTGTTTGAA TTTGCGGACA CCACAAAAAG CGATGATGGG TACACTGCAG CCAATGGTTT TTATAATTCA TGGAGCGGAT TTTATGATGA GCTTTCCTGG GCAGCTGTAT GGCTTTATCT TGCTACCAAT GATTCTTCAT ATTTGGATAA AGCGGAAAGT TATTCTGACA AATGGGGTTA TGAGCCACAG ACGAACATAC CGAAGTATAA GTGGGCTCAA TGCTGGGATG ATGTGACTTA TGGCACTTAT CTTCTTTTGG CCAGGATTAA AAATGACAAC GGAAAATATA AAGAAGCGAT AGAAAGGCAT CTTGATTGGT GGACAACCGG ATACAACGGT GAAAGAATTA CATATACTCC GAAGGGACTT GCATGGCTCG ACCAGTGGGG ATCATTGAGG TATGCAACCA CAACGGCATT TCTGGCATGT GTTTATTCCG ATTGGGAGAA CGGTGATAAG GAAAAAGCAA AAACTTATCT GGAGTTTGCA AGAAGCCAGG CGGATTATGC TTTGGGAAGC ACGGGAAGAA GCTTTGTTGT GGGTTTTGGA GAAAATCCAC CGAAAAGGCC CCATCACAGA ACTGCTCACG GTTCATGGGC GGACAGTCAG ATGGAGCCTC CCGAACACAG GCATGTTCTT TATGGTGCCC TTGTGGGAGG ACCTGACAGC ACGGACAACT ACACCGACGA CATCAGTAAT TACACCTGCA ATGAAGTTGC CTGTGACTAT AATGCAGGTT TTGTGGGACT GCTTGCAAAA ATGTACAAGC TTTATGGCGG AAGTCCCGAT CCCAAATTTA ACGGTATAGA AGAAGTTCCG GAGGATGAAA TATTCGTTGA AGCCGGTGTG AATGCATCGG GAAACAATTT CATTGAAATA AAAGCGATAG TTAATAATAA ATCGGGCTGG CCTGCAAGAG TATGTGAGAA TTTATCCTTT AGATATTTTA TCAACATTGA AGAGATTGTG AATGCGGGAA AAAGTGCAAG CGACCTGCAA GTGAGCTCCA GCTACAATCA GGGGGCAAAA CTGTCCGATG TAAAGCACTA CAAGGACAAT ATTTATTATG TGGAAGTGGA TTTGTCGGGG ACAAAAATAT ATCCCGGGGG ACAATCGGCA TACAAGAAGG AAGTGCAGTT TAGAATTTCC GCGCCGGAGG GCACGGTGTT TAATCCGGAA AACGACTATT CCTATCAGGG ACTTTCGGCA GGTACGGTTG TAAAGTCTGA GTATATTCCG GTATATGATG CCGGGGTGCT GGTATTTGGA AGGGAACCGG GCTCAGCATC GAAAAGCACG TCTAAAGACA ATGGTTTGTC CAAGGCAACT CCCACGGTGA AAACTGAATC TCAGCCGACA GCAAAACACA CTCAAAATCC TGCCTCAGAC TTTAAAACTC CAGCCAATCA GAACAGTGTA AAAAAAGACC AAGGCATAAA AGGAGAAGTG GTATTACAGT ACGCAAACGG GAATGCAGGT GCTACGTCAA ACAGTATTAA TCCGAGGTTT AAAATAATTA ACAACGGTAC AAAAGCCATA AATTTGTCCG ATGTCAAGAT TAGATATTAT TACACAAAAG AAGGGGGCGC ATCTCAAAAC TTCTGGTGTG ATTGGAGCAG TGCCGGCAAT TCAAATGTTA CAGGAAACTT CTTTAATCTT TCTTCACCGA AAGAAGGAGC GGACACCTGT CTTGAAGTTG GTTTCGGAAG TGGGGCCGGA ACCCTTGATC CTGGTGGAAG CGTTGAAGTA CAGATAAGGT TTTCAAAGGA AGACTGGTCA AACTATAACC AGTCAAACGA TTATTCTTTC AATCCGTCTG CTTCCGATTA TACGGATTGG AACAGGGTGA CGTTGTATAT TTCAAACAAG CTTGTTTACG GCAAAGAACC TTGA
|
Protein sequence | MRLVNSLGRR KILLILAVIV AFSTVLLFAK LWGRKTSSTL DEVGSKTHGD LTAENKNGGY LPEEEIPDQP PATGAFNYGE ALQKAIFFYE CQRSGKLDPS TLRLNWRGDS GLDDGKDAGI DLTGGWYDAG DHVKFNLPMS YSAAMLGWAV YEYEDAFKQS GQYNHILNNI KWACDYFIKC HPEKDVYYYQ VGDGHADHAW WGPAEVMPME RPSYKVDRSS PGSTVVAETS AALAIASIIF KKVDGEYSKE CLKHAKELFE FADTTKSDDG YTAANGFYNS WSGFYDELSW AAVWLYLATN DSSYLDKAES YSDKWGYEPQ TNIPKYKWAQ CWDDVTYGTY LLLARIKNDN GKYKEAIERH LDWWTTGYNG ERITYTPKGL AWLDQWGSLR YATTTAFLAC VYSDWENGDK EKAKTYLEFA RSQADYALGS TGRSFVVGFG ENPPKRPHHR TAHGSWADSQ MEPPEHRHVL YGALVGGPDS TDNYTDDISN YTCNEVACDY NAGFVGLLAK MYKLYGGSPD PKFNGIEEVP EDEIFVEAGV NASGNNFIEI KAIVNNKSGW PARVCENLSF RYFINIEEIV NAGKSASDLQ VSSSYNQGAK LSDVKHYKDN IYYVEVDLSG TKIYPGGQSA YKKEVQFRIS APEGTVFNPE NDYSYQGLSA GTVVKSEYIP VYDAGVLVFG REPGSASKST SKDNGLSKAT PTVKTESQPT AKHTQNPASD FKTPANQNSV KKDQGIKGEV VLQYANGNAG ATSNSINPRF KIINNGTKAI NLSDVKIRYY YTKEGGASQN FWCDWSSAGN SNVTGNFFNL SSPKEGADTC LEVGFGSGAG TLDPGGSVEV QIRFSKEDWS NYNQSNDYSF NPSASDYTDW NRVTLYISNK LVYGKEP
|
| |