Gene Cthe_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0745 
Symbol 
ID4810363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp910112 
End bp912304 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content45% 
IMG OID640106162 
Productglycoside hydrolase family protein 
Protein accessionYP_001037173 
Protein GI125973263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTTTAAGCCT CATTTTAGTC GTTTCTTTTC TGATTATGCT GCTGACTCCT 
TCGACAAAGA TATCTGCAGC AACTACATTC AACTACGGAG AAGCCCTTCA GAAAGCCATA
ATGTTCTATG AATTCCAGCG TTCCGGAAAA CTGCCTTCCA CCATCAGGAA CAACTGGAGA
GCGGATTCCT GTCTGGACGA CGGCAAAGAT GTGGGGCTTG ACCTTACAGG CGGTTGGTTT
GACGCCGGCG ACCACGTCAA GTTCAACCTT CCAATGGCAT ATACCGCCGC AATGCTCGCG
TGGGCAGTAT ATGAAGAAAA GGACGCTTTT GTGAAAAGCG GGCAGCTTAA ATACATACTG
GATGAAATCA AATGGGCAAC GGATTACTTC ATCAAATGTC ATCCCGAACC CGATGTGTAT
TACTATCAGG TGGGAGACGG AGATATTGAC CACATGTGGT GGGGACCTGC GGAAGTCGTA
CACTTGAGAA CAAAAAGACC TTCATACAAA GTAGATATTA CTTCACCGGG TTCTACCGTG
TCGGCAGAAA CCGCGGCAGC CCTGGCTGCG GCATCAATAG TGTTTAAAGA TACTGATCCC
CAATACTCAA ATCTGTGCTT AAAACACGCA AAGGAACTTT TCAACTTTGC CGACAAAACA
AGAAGTGATG CCGGTTATAC AGCGGCAACA AACTTTTACA CATCCCACAG CGGATTCTAT
GACGAACTTA CATGGGCGGC TACATGGATT TATCTTGCAA CAGGGGACAC AAGCTATCTT
GACAAAGCCG AATCCTATGT TGAATTTTGG AGTACCGAAC CTCAGACAGA TATAATGTCC
TACAAGTGGG GACACTGTTG GGATGACGTG CGCTACGGAG CACAGCTTCT TTTGGCAAGG
ATTACAAACA AGCCGATATA CAAGGAAAGC ATGGAAAGAC ATTTGGATTA TTGGACAGTC
GGAGTTGACA ATTCAAGGAT AAAATATACG CCCAAAGGTT TGGCATGGCT TAATAACTGG
GGTTCTTTGA GATATGCCAC CACCACTGCG TTTCTTGCGG CTGTTTATGC CGATTGGGAA
GGATGCAGCC CGCAAAAGGC GAAAATATAC AACGATTTTG CAAAGGCTCA GGTTGACTAC
GCACTGGGCA GCACAGGAAG AAGTTTCGTA GTAGGATTTG GTGAAAATTG GCCACAGCAT
CCTCACCACA GAACTGCCCA TGGTTCATGG TATGACAGCA TGAATGTGCC TGATTACCAT
AGACATGTGC TGTATGGTGC ATTGGTCGGC GGACCCGGTG AAAGTGACAA CTACAGGGAC
GATATTTCCG ACTATCAGTG CAACGAGGTT GCCTGTGACT ATAACGCAGG TTTTGTAGGT
GCACTTGCCA AAATGTACAA CAGATATGAC GGAAGACCGG TACCGGAATT TAAAGCTATT
GAAGTGCCGG AAGATGAATT TATGGTTGAA GCTTATGTAA GCAGCAGCGA CAAAAACTAT
GTTGAAATAA AAACAAGACT TAACAACAGG ACGGCATGGC CTGCAAGAGT GTCCGAAGGA
CTCTCCTTTA GATACTTTAT TGACCTTACT GAAGTAATTG AAGCAGGATA CGGTCCAAAC
GATTTAATAA TATCAGGCGG TCAAGGTTCA AGCGGAAAAG TGTCGGGCCC GCACCTGTGG
AACAAAGAAA AGAATATATA CTATATAGAA GTTGATTACA CCGGAGATCG TCTCTTCCCC
GGCGGACAGG ACCACTATAG AAGAGATTCA TCTTTGCGAA TTGCCGTACC CGGCAACAGT
GGATGCTGGA ACAGTGAAAA CGATCCGTCC TTCAAAGGAC TCTCCAAAAC AAGCGAATTT
AAGAAAGCGG AATACATACC CGTTTACGAA TACGGTGTAA AAGTTGCAGG CATAGAACCT
GAGGGAACAG TTGTTCAGCC TTCCCCAAGT CCTACTCCTA CTCCAACACC GCCTCATTCG
GACGATGTCC TGTATGGAGA CATAAACAAT GATAAGACAG TAAACTCAAC AGATGTCACA
TACTTAAAAA GGTTTTTACT GAAACAAATA AACAGTCTTC CCAATCAAAA AGCAGCGGAT
GTAAACCTGG ACGGCAATAT AAATTCTACA GACCTTGTTA TTTTGAAAAG ATACGTTCTG
CGTGGAATTA GCAAACTGCC TTACGCACCT TAA
 
Protein sequence
MKKFLSLILV VSFLIMLLTP STKISAATTF NYGEALQKAI MFYEFQRSGK LPSTIRNNWR 
ADSCLDDGKD VGLDLTGGWF DAGDHVKFNL PMAYTAAMLA WAVYEEKDAF VKSGQLKYIL
DEIKWATDYF IKCHPEPDVY YYQVGDGDID HMWWGPAEVV HLRTKRPSYK VDITSPGSTV
SAETAAALAA ASIVFKDTDP QYSNLCLKHA KELFNFADKT RSDAGYTAAT NFYTSHSGFY
DELTWAATWI YLATGDTSYL DKAESYVEFW STEPQTDIMS YKWGHCWDDV RYGAQLLLAR
ITNKPIYKES MERHLDYWTV GVDNSRIKYT PKGLAWLNNW GSLRYATTTA FLAAVYADWE
GCSPQKAKIY NDFAKAQVDY ALGSTGRSFV VGFGENWPQH PHHRTAHGSW YDSMNVPDYH
RHVLYGALVG GPGESDNYRD DISDYQCNEV ACDYNAGFVG ALAKMYNRYD GRPVPEFKAI
EVPEDEFMVE AYVSSSDKNY VEIKTRLNNR TAWPARVSEG LSFRYFIDLT EVIEAGYGPN
DLIISGGQGS SGKVSGPHLW NKEKNIYYIE VDYTGDRLFP GGQDHYRRDS SLRIAVPGNS
GCWNSENDPS FKGLSKTSEF KKAEYIPVYE YGVKVAGIEP EGTVVQPSPS PTPTPTPPHS
DDVLYGDINN DKTVNSTDVT YLKRFLLKQI NSLPNQKAAD VNLDGNINST DLVILKRYVL
RGISKLPYAP