Gene Cthe_0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0624 
Symbol 
ID4808226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp765032 
End bp769837 
Gene Length4806 bp 
Protein Length1601 aa 
Translation table11 
GC content43% 
IMG OID640106038 
Productglycoside hydrolase family protein 
Protein accessionYP_001037052 
Protein GI125973142 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA GAAGATTATC GCTACTTTTG GTACTTGCCA TAATGTTTAC GATGGTCGTT 
CCACAGATAT CTGCAAGTGC CGAAACAGTT GCTCCTGAAG GCTACAGGAA GCTTTTGGAT
GTACAAATTT TCAAGGATTC GCCTGTAGTC GGATGGTCAG GAAGCGGTAT GGGCGAGCTT
GAAACTATCG GCGATACCCT TCCGGTTGAT ACCACAGTTA CATATAACGG TTTGCCGACT
TTAAGACTGA ATGTCCAGAC AACCGTTCAG TCAGGATGGT GGATTTCTCT TCTTACATTA
AGAGGATGGA ACACCCATGA CCTTTCCCAG TATGTCGAAA ACGGTTATCT TGAGTTTGAC
ATCAAGGGTA AGGAAGGCGG AGAAGACTTT GTTATTGGTT TCAGGGACAA GGTTTATGAA
CGCGTTTACG GACTTGAAAT TGATGTTACC ACAGTAATAT CAAATTATGT AACGGTAACT
ACGGACTGGC AGCATGTTAA GATTCCTTTG AGAGACCTGA TGAAGATTAA TAACGGATTT
GATCCTTCAT CAGTTACATG CCTGGTGTTC TCAAAAAGAT ATGCAGATCC GTTTACAGTA
TGGTTCAGTG ATATAAAGAT TACATCAGAA GACAATGAAA AGTCCGCTCC TGCAATCAAG
GTAAACCAGC TTGGCTTTAT TCCTGAAGCT GAAAAATACG CTTTGGTTAC AGGTTTTGCA
GAAGAGCTCG CAGTATCGGA AGGTGACGAA TTTGCCGTTA TAAATGCTGC GGACAATTCT
GTTGCTTATA CCGGAAAATT AACTCTTGTA ACAGAATATG AACCTCTTGA TTCCGGAGAA
AAAATACTTA AGGCAGATTT CAGCGACTTG ACTGTACCTG GCAAATACTA CATTAGTATT
GAAGGTCTTG ACAATTCACC CAAGTTTGAA ATCGGTGAAG GTATTTACGG TCCACTGGTT
GTTGACGCTG CAAGATATTT CTATTATCAG CGTCAGGGTA TAGAACTTGA AGAGCCTTAT
GCGCAGGGAT ATCCCCGCAA GGACGTTACT CCTCAGGACG CATATGCTGT ATTTGCATCC
GGAAAGAAGG ATCCGATTGA CATAACAAAG GGTTGGTATG ACGCAGGAGA CTTCGGTAAG
TATGTAAATG CCGGAGCAAC CGGTGTTTCC GATTTGTTCT GGGCATATGA AATGTTCCCT
TCCCAGTTTG TTGACGGTCA GTTCAATATT CCTGAAAGCG GAAACGGTGT ACCGGACATC
CTTGACGAAG CTCGCTGGGA GCTTGAATGG ATGCTGAAAA TGCAGGACAA AGAAAGCGGA
GGATTCTATC CCAGAGTTCA ATCTGACAAT GACGAAAACA TAAAATCAAG AATAATCAGG
GATCAGAACG GCTGTACCAC TGATGATACT GCATGTGCCG CCGGAATACT TGCTCATGCA
TACTTGATTT ACAAGGATAT TGACCCTGAT TTTGCACAAG AGTGCCTGGA TGCGGCAATA
AATGCATGGA AATTCCTTGA AAAGAATCCT GAAAACATTG TTTCACCTCC GGGTCCATAC
AACGTATATG ACGACAGCGG AGACAGACTC TGGGCTGCAG CTTCGCTGTA CAGAGCTACC
GGTGAAGAGG TTTATCATAC ATACTTTAAA CAAAACTACA AATCTTTTGC ACAAAAGTTC
GAAAGCCCGA CTGCATATGC TCATACATGG GGTGATATGT GGCTTACGGC ATTCCTTTCG
TATTTGAAAG CTGAAAACAA GGATCAGGAA GTTGTAGACT GGATTGATAC AGAGTTTGGA
ATCTGGCTTG AAAACATACT CACAAGATAT GAGAACAATC CATGGAAGAA TGCAATTGTT
CCCGGAAACT ACTTCTGGGG AATCAACATG CAGGTTATGA ATGTTCCGAT GGATGCTATC
ATAGGTTCAC AGCTTCTTGG AAAATACAGT GACAGAATAG AAAAATTAGG TTTTGGTTCA
CTTAACTGGC TGCTTGGTAC AAATCCGCTT CGCTTCAGCT TTGTATCAGG ATATGGAGAG
GATTCTGTAA AAGGAGTATT CAGCAATATT TACAATACGG ACGGCAAGCA GGGAATTCCG
AAAGGATACA TGCCTGGTGG ACCAAATGCT TATGAAGGTG CAGGCCTGTC AAGGTTTGCA
GCAAAATGCT ACACCAGAAG TACCGGTGAC TGGGTAGCCA ACGAACATAC AGTATATTGG
AACTCAGCTT TGGTATTTAT GGCTGCTTTT GCAAACCAGG GTTCAGAGGT TAATCCGGGA
CCTGCGCCGG AACCGGGAGT AACTCCGAAT CCTACAGAAC CTGCAAAAGT GGTTGACATC
AGGATAGATA CTTCTGCTGA AAGAAAGCCA ATCAGCCCGT ATATATACGG AAGCAATCAG
GAACTTGATG CAACAGTTAC TGCAAAGAGG TTCGGCGGAA ACAGAACTAC AGGATACAAC
TGGGAAAACA ACTTCTCAAA TGCAGGAAGT GACTGGCTGC ATTACAGTGA TACATACCTT
TTGGAGGACG GCGGAGTACC TAAGGGAGAG TGGAGTACAC CTGCTTCTGT AGTTACCACG
TTCCATGACA AGGCACTTAG CAAAAATGTT CCTTACACAC TTATCACTCT TCAGGCAGCA
GGTTATGTTT CCGCAGACGG AAACGGACCG GTTTCCCAGG AAGAAACTGC ACCGTCTTCA
AGATGGAAGG AAGTTAAGTT TGAAAAGGGA GCACCTTTCT CACTTACACC GGACACAGAA
GATGATTATG TTTACATGGA TGAGTTTGTA AACTATCTTG TAAACAAATA CGGAAATGCA
TCCACACCTA CAGGAATAAA GGGTTATTCA ATAGATAACG AGCCGGCATT GTGGAGTCAT
ACTCATCCGA GAATTCATCC GGACAATGTA ACTGCCAAAG AGCTTATTGA AAAATCTGTA
GCTCTTTCCA AGGCGGTTAA AAAGGTAGAT CCATATGCAG AAATATTCGG ACCTGCTTTG
TACGGATTTG CCGCATATGA GACACTTCAG TCAGCTCCTG ACTGGGGAAC TGAAGGAGAA
GGATACAGGT GGTTTATAGA TTATTACCTC GATAAGATGA AAAAGGCTTC TGATGAAGAA
GGAAAGAGAC TTTTGGACGT ACTTGACGTA CACTGGTATC CGGAAGCCAG GGGCGGCGGT
GAAAGAATAT GCTTTGGAGC CGATCCAAGA AATATTGAGA CAAACAAAGC AAGATTGCAG
GCGCCCAGAA CATTGTGGGA TCCTACATAT ATTGAAGACA GCTGGATAGG ACAATGGAAG
AAGGATTTCC TCCCGATATT ACCTAATCTT TTGGATTCCA TTGAAAAATA TTATCCGGGA
ACGAAGCTTG CTATAACTGA ATATGACTAT GGCGGAGGAA ATCATATTAC AGGCGGTATT
GCTCAAGCCG ATGTTCTTGG TATATTCGGT AAATACGGTG TTTACCTTGC AACATTCTGG
GGAGATGCAA GCAATAACTA TACTGAGGCC GGTATAAACC TTTATACCAA CTACGACGGC
AAAGGCGGCA AATTTGGAGA TACATCCGTA AAATGTGAAA CGTCCGACAT AGAAGTAAGC
TCTGCTTATG CATCCATTGT CGGTGAAGAT GACAGCAAAC TCCATATCAT TCTTTTGAAC
AAGAACTATG ACCAGCCGAC GACATTCAAT TTCTCAATTG ACAGCAGCAA GAACTACACA
ATAGGAAATG TATGGGCATT TGACAGAGGA AGCTCCAATA TTACTCAAAG AACTCCTATA
GTGAACATAA AGGACAATAC CTTCACATAT ACAGTACCGG CTTTGACAGC GTGCCATATT
GTGCTTGAAG CTGCGGAGCC CGTAGTGTAC GGAGACTTGA ACAATGACTC TAAAGTAAAC
GCAGTAGACA TTATGATGCT CAAACGATAT ATTCTCGGAA TAATAGATAA TATAAATCTG
ACAGCAGCTG ACATTTATTT TGACGGTGTT GTAAATTCAA GTGACTATAA TATAATGAAG
AGATATTTGT TAAAGGCAAT AGAAGATATT CCTTATGTTC CGGAAAACCA GGCACCTAAA
GCAATATTTA CTTTCTCGCC CGAAGATCCG GTTACTGACG AGAATGTAGT GTTCAATGCA
TCAAATTCAA TAGATGAAGA CGGAACAATT GCCTATTATG TATGGGATTT CGGTGACGGA
TATGAAGGAA CTTCAACAAC ACCGACTATT ACCTATAAGT ATAAAAACCC CGGAACATAC
AAAGTAAAAC TGATTGTTAC AGACAACCAG GGGGCTTCAA GTTCGTTTAC AGCTACCATA
AAAGTAACCT CAGCTACCGG GGACAATTCC AAATTCAACT TTGAAGACGG CACGCTGGGA
GGATTTACAA CATCCGGAAC AAATGCTACG GGTGTTGTTG TGAACACTAC TGAAAAAGCA
TTCAAAGGCG AAAGAGGTCT TAAATGGACT GTAACAAGCG AAGGAGAAGG AACTGCAGAA
TTGAAACTTG ACGGAGGTAC TATTGTAGTT CCCGGTACCA CTATGACGTT TAGAATCTGG
ATACCTTCCG GTGCGCCTAT TGCTGCCATC CAGCCGTATA TTATGCCTCA TACACCTGAT
TGGTCGGAAG TCCTCTGGAA TTCGACATGG AAAGGATACA CCATGGTGAA GACCGATGAC
TGGAATGAAA TTACCCTGAC ACTGCCGGAA GACGTGGATC CGACTTGGCC GCAGCAGATG
GGTATACAGG TACAGACCAT AGATGAAGGT GAATTCACTA TCTATGTAGA TGCTATTGAC
TGGTAA
 
Protein sequence
MAKRRLSLLL VLAIMFTMVV PQISASAETV APEGYRKLLD VQIFKDSPVV GWSGSGMGEL 
ETIGDTLPVD TTVTYNGLPT LRLNVQTTVQ SGWWISLLTL RGWNTHDLSQ YVENGYLEFD
IKGKEGGEDF VIGFRDKVYE RVYGLEIDVT TVISNYVTVT TDWQHVKIPL RDLMKINNGF
DPSSVTCLVF SKRYADPFTV WFSDIKITSE DNEKSAPAIK VNQLGFIPEA EKYALVTGFA
EELAVSEGDE FAVINAADNS VAYTGKLTLV TEYEPLDSGE KILKADFSDL TVPGKYYISI
EGLDNSPKFE IGEGIYGPLV VDAARYFYYQ RQGIELEEPY AQGYPRKDVT PQDAYAVFAS
GKKDPIDITK GWYDAGDFGK YVNAGATGVS DLFWAYEMFP SQFVDGQFNI PESGNGVPDI
LDEARWELEW MLKMQDKESG GFYPRVQSDN DENIKSRIIR DQNGCTTDDT ACAAGILAHA
YLIYKDIDPD FAQECLDAAI NAWKFLEKNP ENIVSPPGPY NVYDDSGDRL WAAASLYRAT
GEEVYHTYFK QNYKSFAQKF ESPTAYAHTW GDMWLTAFLS YLKAENKDQE VVDWIDTEFG
IWLENILTRY ENNPWKNAIV PGNYFWGINM QVMNVPMDAI IGSQLLGKYS DRIEKLGFGS
LNWLLGTNPL RFSFVSGYGE DSVKGVFSNI YNTDGKQGIP KGYMPGGPNA YEGAGLSRFA
AKCYTRSTGD WVANEHTVYW NSALVFMAAF ANQGSEVNPG PAPEPGVTPN PTEPAKVVDI
RIDTSAERKP ISPYIYGSNQ ELDATVTAKR FGGNRTTGYN WENNFSNAGS DWLHYSDTYL
LEDGGVPKGE WSTPASVVTT FHDKALSKNV PYTLITLQAA GYVSADGNGP VSQEETAPSS
RWKEVKFEKG APFSLTPDTE DDYVYMDEFV NYLVNKYGNA STPTGIKGYS IDNEPALWSH
THPRIHPDNV TAKELIEKSV ALSKAVKKVD PYAEIFGPAL YGFAAYETLQ SAPDWGTEGE
GYRWFIDYYL DKMKKASDEE GKRLLDVLDV HWYPEARGGG ERICFGADPR NIETNKARLQ
APRTLWDPTY IEDSWIGQWK KDFLPILPNL LDSIEKYYPG TKLAITEYDY GGGNHITGGI
AQADVLGIFG KYGVYLATFW GDASNNYTEA GINLYTNYDG KGGKFGDTSV KCETSDIEVS
SAYASIVGED DSKLHIILLN KNYDQPTTFN FSIDSSKNYT IGNVWAFDRG SSNITQRTPI
VNIKDNTFTY TVPALTACHI VLEAAEPVVY GDLNNDSKVN AVDIMMLKRY ILGIIDNINL
TAADIYFDGV VNSSDYNIMK RYLLKAIEDI PYVPENQAPK AIFTFSPEDP VTDENVVFNA
SNSIDEDGTI AYYVWDFGDG YEGTSTTPTI TYKYKNPGTY KVKLIVTDNQ GASSSFTATI
KVTSATGDNS KFNFEDGTLG GFTTSGTNAT GVVVNTTEKA FKGERGLKWT VTSEGEGTAE
LKLDGGTIVV PGTTMTFRIW IPSGAPIAAI QPYIMPHTPD WSEVLWNSTW KGYTMVKTDD
WNEITLTLPE DVDPTWPQQM GIQVQTIDEG EFTIYVDAID W