Gene Cthe_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0040 
Symbol 
ID4808805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp49460 
End bp52123 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content44% 
IMG OID640105449 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_001036474 
Protein GI125972564 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTGG TTAACAGTTT GGGAAGAAGA AAAATTCTTT TGATACTTGC TGTTATTGTA 
GCTTTCAGCA CTGTTCTGTT GTTTGCAAAG CTATGGGGGC GAAAGACTTC AAGTACTTTG
GATGAGGTTG GTTCAAAAAC TCATGGTGAT TTGACGGCAG AAAATAAAAA CGGCGGATAT
TTACCAGAGG AAGAGATTCC AGATCAGCCT CCGGCAACCG GGGCCTTCAA CTACGGTGAA
GCGTTGCAAA AAGCAATTTT TTTCTATGAG TGTCAAAGAT CCGGAAAGCT CGATCCCTCA
ACTCTTCGCC TAAATTGGCG GGGAGATTCG GGACTGGATG ACGGAAAAGA TGCAGGAATT
GATCTTACCG GCGGATGGTA TGATGCGGGA GATCACGTAA AATTTAATTT GCCCATGTCT
TATTCGGCGG CTATGCTGGG GTGGGCGGTG TATGAATATG AAGATGCGTT TAAACAGAGT
GGACAGTATA ACCACATATT GAATAACATA AAATGGGCTT GTGATTATTT TATAAAATGT
CATCCGGAAA AGGATGTGTA CTATTACCAG GTGGGCGACG GTCATGCTGA CCATGCGTGG
TGGGGTCCTG CCGAAGTAAT GCCTATGGAA AGGCCGTCGT ACAAAGTTGA CAGGTCATCG
CCGGGTTCCA CGGTTGTGGC AGAGACGTCG GCAGCTTTAG CTATAGCATC GATAATATTT
AAGAAAGTGG ATGGTGAATA CTCGAAAGAA TGTTTAAAGC ATGCAAAAGA ACTGTTTGAA
TTTGCGGACA CCACAAAAAG CGATGATGGG TACACTGCAG CCAATGGTTT TTATAATTCA
TGGAGCGGAT TTTATGATGA GCTTTCCTGG GCAGCTGTAT GGCTTTATCT TGCTACCAAT
GATTCTTCAT ATTTGGATAA AGCGGAAAGT TATTCTGACA AATGGGGTTA TGAGCCACAG
ACGAACATAC CGAAGTATAA GTGGGCTCAA TGCTGGGATG ATGTGACTTA TGGCACTTAT
CTTCTTTTGG CCAGGATTAA AAATGACAAC GGAAAATATA AAGAAGCGAT AGAAAGGCAT
CTTGATTGGT GGACAACCGG ATACAACGGT GAAAGAATTA CATATACTCC GAAGGGACTT
GCATGGCTCG ACCAGTGGGG ATCATTGAGG TATGCAACCA CAACGGCATT TCTGGCATGT
GTTTATTCCG ATTGGGAGAA CGGTGATAAG GAAAAAGCAA AAACTTATCT GGAGTTTGCA
AGAAGCCAGG CGGATTATGC TTTGGGAAGC ACGGGAAGAA GCTTTGTTGT GGGTTTTGGA
GAAAATCCAC CGAAAAGGCC CCATCACAGA ACTGCTCACG GTTCATGGGC GGACAGTCAG
ATGGAGCCTC CCGAACACAG GCATGTTCTT TATGGTGCCC TTGTGGGAGG ACCTGACAGC
ACGGACAACT ACACCGACGA CATCAGTAAT TACACCTGCA ATGAAGTTGC CTGTGACTAT
AATGCAGGTT TTGTGGGACT GCTTGCAAAA ATGTACAAGC TTTATGGCGG AAGTCCCGAT
CCCAAATTTA ACGGTATAGA AGAAGTTCCG GAGGATGAAA TATTCGTTGA AGCCGGTGTG
AATGCATCGG GAAACAATTT CATTGAAATA AAAGCGATAG TTAATAATAA ATCGGGCTGG
CCTGCAAGAG TATGTGAGAA TTTATCCTTT AGATATTTTA TCAACATTGA AGAGATTGTG
AATGCGGGAA AAAGTGCAAG CGACCTGCAA GTGAGCTCCA GCTACAATCA GGGGGCAAAA
CTGTCCGATG TAAAGCACTA CAAGGACAAT ATTTATTATG TGGAAGTGGA TTTGTCGGGG
ACAAAAATAT ATCCCGGGGG ACAATCGGCA TACAAGAAGG AAGTGCAGTT TAGAATTTCC
GCGCCGGAGG GCACGGTGTT TAATCCGGAA AACGACTATT CCTATCAGGG ACTTTCGGCA
GGTACGGTTG TAAAGTCTGA GTATATTCCG GTATATGATG CCGGGGTGCT GGTATTTGGA
AGGGAACCGG GCTCAGCATC GAAAAGCACG TCTAAAGACA ATGGTTTGTC CAAGGCAACT
CCCACGGTGA AAACTGAATC TCAGCCGACA GCAAAACACA CTCAAAATCC TGCCTCAGAC
TTTAAAACTC CAGCCAATCA GAACAGTGTA AAAAAAGACC AAGGCATAAA AGGAGAAGTG
GTATTACAGT ACGCAAACGG GAATGCAGGT GCTACGTCAA ACAGTATTAA TCCGAGGTTT
AAAATAATTA ACAACGGTAC AAAAGCCATA AATTTGTCCG ATGTCAAGAT TAGATATTAT
TACACAAAAG AAGGGGGCGC ATCTCAAAAC TTCTGGTGTG ATTGGAGCAG TGCCGGCAAT
TCAAATGTTA CAGGAAACTT CTTTAATCTT TCTTCACCGA AAGAAGGAGC GGACACCTGT
CTTGAAGTTG GTTTCGGAAG TGGGGCCGGA ACCCTTGATC CTGGTGGAAG CGTTGAAGTA
CAGATAAGGT TTTCAAAGGA AGACTGGTCA AACTATAACC AGTCAAACGA TTATTCTTTC
AATCCGTCTG CTTCCGATTA TACGGATTGG AACAGGGTGA CGTTGTATAT TTCAAACAAG
CTTGTTTACG GCAAAGAACC TTGA
 
Protein sequence
MRLVNSLGRR KILLILAVIV AFSTVLLFAK LWGRKTSSTL DEVGSKTHGD LTAENKNGGY 
LPEEEIPDQP PATGAFNYGE ALQKAIFFYE CQRSGKLDPS TLRLNWRGDS GLDDGKDAGI
DLTGGWYDAG DHVKFNLPMS YSAAMLGWAV YEYEDAFKQS GQYNHILNNI KWACDYFIKC
HPEKDVYYYQ VGDGHADHAW WGPAEVMPME RPSYKVDRSS PGSTVVAETS AALAIASIIF
KKVDGEYSKE CLKHAKELFE FADTTKSDDG YTAANGFYNS WSGFYDELSW AAVWLYLATN
DSSYLDKAES YSDKWGYEPQ TNIPKYKWAQ CWDDVTYGTY LLLARIKNDN GKYKEAIERH
LDWWTTGYNG ERITYTPKGL AWLDQWGSLR YATTTAFLAC VYSDWENGDK EKAKTYLEFA
RSQADYALGS TGRSFVVGFG ENPPKRPHHR TAHGSWADSQ MEPPEHRHVL YGALVGGPDS
TDNYTDDISN YTCNEVACDY NAGFVGLLAK MYKLYGGSPD PKFNGIEEVP EDEIFVEAGV
NASGNNFIEI KAIVNNKSGW PARVCENLSF RYFINIEEIV NAGKSASDLQ VSSSYNQGAK
LSDVKHYKDN IYYVEVDLSG TKIYPGGQSA YKKEVQFRIS APEGTVFNPE NDYSYQGLSA
GTVVKSEYIP VYDAGVLVFG REPGSASKST SKDNGLSKAT PTVKTESQPT AKHTQNPASD
FKTPANQNSV KKDQGIKGEV VLQYANGNAG ATSNSINPRF KIINNGTKAI NLSDVKIRYY
YTKEGGASQN FWCDWSSAGN SNVTGNFFNL SSPKEGADTC LEVGFGSGAG TLDPGGSVEV
QIRFSKEDWS NYNQSNDYSF NPSASDYTDW NRVTLYISNK LVYGKEP