Gene Cthe_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0413 
Symbol 
ID4808416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp519467 
End bp523141 
Gene Length3675 bp 
Protein Length1224 aa 
Translation table11 
GC content44% 
IMG OID640105827 
Productglycoside hydrolase family protein 
Protein accessionYP_001036844 
Protein GI125972934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA GAAGGTCAAT TTGTACTGCT GTTTTGTTGG CGGTTTTATT GACACTTCTG 
GTACCGACAT CCGTGTTTGC CTTAGAAGAT AATTCTTCGA CTTTGCCGCC GTATAAAAAC
GACCTTTTGT ATGAGAGGAC TTTTGATGAG GGACTTTGTT ATCCATGGCA TACCTGTGAA
GACAGCGGAG GAAAATGCTC CTTTGATGTG GTCGATGTTC CGGGGCAGCC CGGTAATAAA
GCATTTGCCG TTACTGTTCT TGACAAAGGG CAAAACAGAT GGAGCGTTCA GATGAGACAC
CGTGGTCTTA CTCTTGAACA GGGACATACA TATAGAGTAC GGCTTAAGAT TTGGGCAGAT
GCGTCCTGTA AAGTTTATAT AAAAATAGGA CAAATGGGCG AGCCCTATGC TGAATATTGG
AACAACAAGT GGAGTCCATA CACACTGACA GCAGGTAAGG TATTGGAAAT TGACGAGACG
TTTGTTATGG ACAAGCCAAC TGACGACACA TGCGAATTTA CATTCCATTT AGGTGGCGAA
TTGGCAGCAA CTCCTCCATA TACAGTTTAT CTTGATGATG TATCCCTTTA TGACCCAGAA
TATACGAAGC CTGTTGAATA TATACTTCCG CAGCCTGATG TACGTGTGAA CCAGGTTGGC
TACCTGCCGG AGGGCAAGAA AGTTGCCACT GTGGTATGCA ATTCAACTCA GCCGGTAAAA
TGGCAGCTTA AGAATGCTGC AGGCGTTGTA GTTTTGGAAG GTTATACCGA ACCAAAGGGT
CTTGACAAAG ACTCGCAGGA TTATGTACAT TGGCTTGATT TTTCCGATTT TGCAACCGAA
GGAATTGGTT ACTATTTTGA ACTTCCGACT GTAAACAGTC CTACAAACTA CAGTCATCCA
TTTGACATTC GCAAAGACAT CTATACTCAG ATGAAATATG ATGCATTGGC ATTCTTCTAT
CACAAGAGAA GCGGTATTCC TATTGAAATG CCGTATGCAG GAGGAGAACA GTGGACCAGA
CCTGCAGGAC ATATCGGAAT TGAGCCGAAC AAGGGAGATA CAAATGTTCC TACATGGCCT
CAGGATGATG AGTATGCAGG AATACCTCAG AAGAATTATA CAAAGGATGT AACCGGTGGA
TGGTATGATG CCGGTGACCA CGGTAAATAT GTTGTAAACG GCGGTATAGC CGTCTGGACA
TTAATGAACA TGTATGAGAG GGCAAAAATT AGAGGTCTTG ACAACTGGGG ACCATACAGG
GACGGCGGAA TGAACATACC GGAGCAGAAT AACGGTTATC CGGACATTCT TGATGAAGCA
AGATGGGAAA TTGAGTTCTT TAAGAAAATG CAGGTAACTG AAAAAGAGGA TCCTTCCATA
GCCGGAATGG TACACCACAA AATTCACGAC TTCAGATGGA CTGCTTTGGG TATGTTGCCT
CACGAAGATC CCCAGCCACG TTACTTAAGG CCGGTAAGTA CGGCTGCGAC TTTGAACTTT
GCGGCAACTT TGGCACAAAG TGCACGTCTT TGGAAAGATT ATGATCCGAC TTTTGCTGCT
GACTGTTTGG AAAAGGCTGA AATAGCATGG CAGGCGGCAT TAAAGCATCC TGATATTTAT
GCTGAGTATA CTCCCGGTAG CGGTGGTCCC GGAGGCGGAC CATACAATGA CGACTATGTC
GGAGACGAAT TCTACTGGGC AGCCTGCGAA CTTTATGTAA CAACAGGAAA AGACGAATAT
AAGAATTACC TGATGAATTC ACCTCACTAT CTTGAAATGC CTGCAAAGAT GGGTGAAAAC
GGTGGAGCAA ACGGAGAAGA CAACGGATTG TGGGGATGCT TCACCTGGGG AACTACTCAA
GGATTGGGAA CCATTACTCT TGCATTGGTT GAAAACGGAT TGCCTGCTAC AGACATTCAA
AAGGCAAGAA ACAATATAGC TAAAGCTGCT GACAGATGGC TTGAGAATAT TGAAGAGCAA
GGTTACAGAC TGCCGATCAA ACAGGCGGAG GATGAGAGAG GCGGTTATCC ATGGGGTTCA
AACTCCTTCA TTTTGAACCA GATGATAGTT ATGGGATATG CCTATGACTT TACAGGTGAC
TCCAAATATC TCGATGGAAT GTTTGACGGC ATAAGCTACC TGTTGGGAAG AAACGCAATG
GATCAGTCCT ATGTAACAGG GTATGGTGAG CGTCCGCTTC AGAATCCTCA TGACAGGTTC
TGGACGCCGC AGACAAGTAA GAGATTCCCT GCTCCACCTC CGGGTATAAT TTCCGGCGGT
CCGAACTCCC GTTTCGAGGA CCCGACAATA AATGCGGCCG TTAAGAAGGA TACACCGCCA
CAGAAATGTT TTATCGACCA TACAGACTCA TGGTCAACCA ACGAGATAAC TGTTAACTGG
AATGCTCCGT TTGCATGGGT TACAGCTTAT CTTGACGAGC AGTACACAGA CAGTGAAACC
GATAAGGTAA CTATTGATTC GCCTGTTGCA GGAGAAAGAT TTGAAGCCGG TAAAGACATT
AATATAAGCG CAACTGTTAA ATCAAAAACT CCTGTAAGCA AAGTAGAGTT TTACAATGGA
GATACGCTTA TTTCCAGTGA CACAACTGCA CCTTACACAG CAAAGATAAC AGGAGCCGCT
GTCGGAGCAT ATAACCTTAA AGCGGTTGCA GTGCTGTCTG ACGGAAGAAG AATTGAGTCA
CCGGTAACTC CTGTACTTGT TAAGGTAATT GTGAAACCTA CTGTAAAACT TACTGCACCC
AAGTCAAATG TTGTGGCTTA TGGAAATGAG TTCCTGAAGA TTACAGCAAC AGCCAGTGAC
TCTGACGGCA AAATCTCCAG GGTTGATTTC CTTGTTGACG GTGAAGTAAT CGGTTCAGAC
AGGGAAGCAC CTTATGAATA TGAGTGGAAA GCTGTGGAAG GCAATCACGA AATAAGTGTA
ATTGCTTATG ATGATGACGA TGCGGCTTCA ACACCTGATT CCGTAAAAAT ATTTGTAAAA
CAGGCACGGG ATGTAAAAGT ACAGTATTTG TGCGAAAATA CGCAAACATC CACTCAGGAA
ATCAAGGGTA AATTCAATAT AGTTAACACA GGAAACAGAG ATTATTCGCT GAAAGATATA
GTATTAAGAT ACTACTTTAC CAAGGAGCAC AATTCACAGC TTCAGTTTAT CTGCTATTAT
ACACCCATAG GCTCCGGAAA TCTCATTCCG TCCTTTGGCG GCTCGGGTGA CGAGCATTAT
CTGCAGCTGG AATTCAAAGA TGTCAAGCTG CCTGCCGGCG GTCAGACTGG GGAAATACAG
TTTGTTATAA GATATGCAGA TAACTCCTTC CATGATCAGT CGAACGACTA TTCGTTCGAT
CCAACTATAA AAGCGTTCCA GGATTATGGC AAGGTTACCC TGTATAAGAA TGGAGAACTT
GTTTGGGGAA CGCCGCCGGG CGGTACAGAA CCTGAAGAAC CGGAAGAGCC TGCGATAGTT
TACGGCGACT GTAATGATGA CGGCAAAGTA AATTCAACAG ACGTCGCAGT AATGAAGAGA
TATTTAAAGA AAGAAAATGT TAATATTAAT CTTGACAATG CAGATGTGAA TGCGGACGGC
AAAGTTAACT CAACAGACTT CTCAATACTT AAGAGATATG TTATGAAGAA CATAGAAGAA
TTGCCATATC GATAA
 
Protein sequence
MKFRRSICTA VLLAVLLTLL VPTSVFALED NSSTLPPYKN DLLYERTFDE GLCYPWHTCE 
DSGGKCSFDV VDVPGQPGNK AFAVTVLDKG QNRWSVQMRH RGLTLEQGHT YRVRLKIWAD
ASCKVYIKIG QMGEPYAEYW NNKWSPYTLT AGKVLEIDET FVMDKPTDDT CEFTFHLGGE
LAATPPYTVY LDDVSLYDPE YTKPVEYILP QPDVRVNQVG YLPEGKKVAT VVCNSTQPVK
WQLKNAAGVV VLEGYTEPKG LDKDSQDYVH WLDFSDFATE GIGYYFELPT VNSPTNYSHP
FDIRKDIYTQ MKYDALAFFY HKRSGIPIEM PYAGGEQWTR PAGHIGIEPN KGDTNVPTWP
QDDEYAGIPQ KNYTKDVTGG WYDAGDHGKY VVNGGIAVWT LMNMYERAKI RGLDNWGPYR
DGGMNIPEQN NGYPDILDEA RWEIEFFKKM QVTEKEDPSI AGMVHHKIHD FRWTALGMLP
HEDPQPRYLR PVSTAATLNF AATLAQSARL WKDYDPTFAA DCLEKAEIAW QAALKHPDIY
AEYTPGSGGP GGGPYNDDYV GDEFYWAACE LYVTTGKDEY KNYLMNSPHY LEMPAKMGEN
GGANGEDNGL WGCFTWGTTQ GLGTITLALV ENGLPATDIQ KARNNIAKAA DRWLENIEEQ
GYRLPIKQAE DERGGYPWGS NSFILNQMIV MGYAYDFTGD SKYLDGMFDG ISYLLGRNAM
DQSYVTGYGE RPLQNPHDRF WTPQTSKRFP APPPGIISGG PNSRFEDPTI NAAVKKDTPP
QKCFIDHTDS WSTNEITVNW NAPFAWVTAY LDEQYTDSET DKVTIDSPVA GERFEAGKDI
NISATVKSKT PVSKVEFYNG DTLISSDTTA PYTAKITGAA VGAYNLKAVA VLSDGRRIES
PVTPVLVKVI VKPTVKLTAP KSNVVAYGNE FLKITATASD SDGKISRVDF LVDGEVIGSD
REAPYEYEWK AVEGNHEISV IAYDDDDAAS TPDSVKIFVK QARDVKVQYL CENTQTSTQE
IKGKFNIVNT GNRDYSLKDI VLRYYFTKEH NSQLQFICYY TPIGSGNLIP SFGGSGDEHY
LQLEFKDVKL PAGGQTGEIQ FVIRYADNSF HDQSNDYSFD PTIKAFQDYG KVTLYKNGEL
VWGTPPGGTE PEEPEEPAIV YGDCNDDGKV NSTDVAVMKR YLKKENVNIN LDNADVNADG
KVNSTDFSIL KRYVMKNIEE LPYR