Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0624 |
Symbol | |
ID | 4808226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 765032 |
End bp | 769837 |
Gene Length | 4806 bp |
Protein Length | 1601 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106038 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037052 |
Protein GI | 125973142 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGA GAAGATTATC GCTACTTTTG GTACTTGCCA TAATGTTTAC GATGGTCGTT CCACAGATAT CTGCAAGTGC CGAAACAGTT GCTCCTGAAG GCTACAGGAA GCTTTTGGAT GTACAAATTT TCAAGGATTC GCCTGTAGTC GGATGGTCAG GAAGCGGTAT GGGCGAGCTT GAAACTATCG GCGATACCCT TCCGGTTGAT ACCACAGTTA CATATAACGG TTTGCCGACT TTAAGACTGA ATGTCCAGAC AACCGTTCAG TCAGGATGGT GGATTTCTCT TCTTACATTA AGAGGATGGA ACACCCATGA CCTTTCCCAG TATGTCGAAA ACGGTTATCT TGAGTTTGAC ATCAAGGGTA AGGAAGGCGG AGAAGACTTT GTTATTGGTT TCAGGGACAA GGTTTATGAA CGCGTTTACG GACTTGAAAT TGATGTTACC ACAGTAATAT CAAATTATGT AACGGTAACT ACGGACTGGC AGCATGTTAA GATTCCTTTG AGAGACCTGA TGAAGATTAA TAACGGATTT GATCCTTCAT CAGTTACATG CCTGGTGTTC TCAAAAAGAT ATGCAGATCC GTTTACAGTA TGGTTCAGTG ATATAAAGAT TACATCAGAA GACAATGAAA AGTCCGCTCC TGCAATCAAG GTAAACCAGC TTGGCTTTAT TCCTGAAGCT GAAAAATACG CTTTGGTTAC AGGTTTTGCA GAAGAGCTCG CAGTATCGGA AGGTGACGAA TTTGCCGTTA TAAATGCTGC GGACAATTCT GTTGCTTATA CCGGAAAATT AACTCTTGTA ACAGAATATG AACCTCTTGA TTCCGGAGAA AAAATACTTA AGGCAGATTT CAGCGACTTG ACTGTACCTG GCAAATACTA CATTAGTATT GAAGGTCTTG ACAATTCACC CAAGTTTGAA ATCGGTGAAG GTATTTACGG TCCACTGGTT GTTGACGCTG CAAGATATTT CTATTATCAG CGTCAGGGTA TAGAACTTGA AGAGCCTTAT GCGCAGGGAT ATCCCCGCAA GGACGTTACT CCTCAGGACG CATATGCTGT ATTTGCATCC GGAAAGAAGG ATCCGATTGA CATAACAAAG GGTTGGTATG ACGCAGGAGA CTTCGGTAAG TATGTAAATG CCGGAGCAAC CGGTGTTTCC GATTTGTTCT GGGCATATGA AATGTTCCCT TCCCAGTTTG TTGACGGTCA GTTCAATATT CCTGAAAGCG GAAACGGTGT ACCGGACATC CTTGACGAAG CTCGCTGGGA GCTTGAATGG ATGCTGAAAA TGCAGGACAA AGAAAGCGGA GGATTCTATC CCAGAGTTCA ATCTGACAAT GACGAAAACA TAAAATCAAG AATAATCAGG GATCAGAACG GCTGTACCAC TGATGATACT GCATGTGCCG CCGGAATACT TGCTCATGCA TACTTGATTT ACAAGGATAT TGACCCTGAT TTTGCACAAG AGTGCCTGGA TGCGGCAATA AATGCATGGA AATTCCTTGA AAAGAATCCT GAAAACATTG TTTCACCTCC GGGTCCATAC AACGTATATG ACGACAGCGG AGACAGACTC TGGGCTGCAG CTTCGCTGTA CAGAGCTACC GGTGAAGAGG TTTATCATAC ATACTTTAAA CAAAACTACA AATCTTTTGC ACAAAAGTTC GAAAGCCCGA CTGCATATGC TCATACATGG GGTGATATGT GGCTTACGGC ATTCCTTTCG TATTTGAAAG CTGAAAACAA GGATCAGGAA GTTGTAGACT GGATTGATAC AGAGTTTGGA ATCTGGCTTG AAAACATACT CACAAGATAT GAGAACAATC CATGGAAGAA TGCAATTGTT CCCGGAAACT ACTTCTGGGG AATCAACATG CAGGTTATGA ATGTTCCGAT GGATGCTATC ATAGGTTCAC AGCTTCTTGG AAAATACAGT GACAGAATAG AAAAATTAGG TTTTGGTTCA CTTAACTGGC TGCTTGGTAC AAATCCGCTT CGCTTCAGCT TTGTATCAGG ATATGGAGAG GATTCTGTAA AAGGAGTATT CAGCAATATT TACAATACGG ACGGCAAGCA GGGAATTCCG AAAGGATACA TGCCTGGTGG ACCAAATGCT TATGAAGGTG CAGGCCTGTC AAGGTTTGCA GCAAAATGCT ACACCAGAAG TACCGGTGAC TGGGTAGCCA ACGAACATAC AGTATATTGG AACTCAGCTT TGGTATTTAT GGCTGCTTTT GCAAACCAGG GTTCAGAGGT TAATCCGGGA CCTGCGCCGG AACCGGGAGT AACTCCGAAT CCTACAGAAC CTGCAAAAGT GGTTGACATC AGGATAGATA CTTCTGCTGA AAGAAAGCCA ATCAGCCCGT ATATATACGG AAGCAATCAG GAACTTGATG CAACAGTTAC TGCAAAGAGG TTCGGCGGAA ACAGAACTAC AGGATACAAC TGGGAAAACA ACTTCTCAAA TGCAGGAAGT GACTGGCTGC ATTACAGTGA TACATACCTT TTGGAGGACG GCGGAGTACC TAAGGGAGAG TGGAGTACAC CTGCTTCTGT AGTTACCACG TTCCATGACA AGGCACTTAG CAAAAATGTT CCTTACACAC TTATCACTCT TCAGGCAGCA GGTTATGTTT CCGCAGACGG AAACGGACCG GTTTCCCAGG AAGAAACTGC ACCGTCTTCA AGATGGAAGG AAGTTAAGTT TGAAAAGGGA GCACCTTTCT CACTTACACC GGACACAGAA GATGATTATG TTTACATGGA TGAGTTTGTA AACTATCTTG TAAACAAATA CGGAAATGCA TCCACACCTA CAGGAATAAA GGGTTATTCA ATAGATAACG AGCCGGCATT GTGGAGTCAT ACTCATCCGA GAATTCATCC GGACAATGTA ACTGCCAAAG AGCTTATTGA AAAATCTGTA GCTCTTTCCA AGGCGGTTAA AAAGGTAGAT CCATATGCAG AAATATTCGG ACCTGCTTTG TACGGATTTG CCGCATATGA GACACTTCAG TCAGCTCCTG ACTGGGGAAC TGAAGGAGAA GGATACAGGT GGTTTATAGA TTATTACCTC GATAAGATGA AAAAGGCTTC TGATGAAGAA GGAAAGAGAC TTTTGGACGT ACTTGACGTA CACTGGTATC CGGAAGCCAG GGGCGGCGGT GAAAGAATAT GCTTTGGAGC CGATCCAAGA AATATTGAGA CAAACAAAGC AAGATTGCAG GCGCCCAGAA CATTGTGGGA TCCTACATAT ATTGAAGACA GCTGGATAGG ACAATGGAAG AAGGATTTCC TCCCGATATT ACCTAATCTT TTGGATTCCA TTGAAAAATA TTATCCGGGA ACGAAGCTTG CTATAACTGA ATATGACTAT GGCGGAGGAA ATCATATTAC AGGCGGTATT GCTCAAGCCG ATGTTCTTGG TATATTCGGT AAATACGGTG TTTACCTTGC AACATTCTGG GGAGATGCAA GCAATAACTA TACTGAGGCC GGTATAAACC TTTATACCAA CTACGACGGC AAAGGCGGCA AATTTGGAGA TACATCCGTA AAATGTGAAA CGTCCGACAT AGAAGTAAGC TCTGCTTATG CATCCATTGT CGGTGAAGAT GACAGCAAAC TCCATATCAT TCTTTTGAAC AAGAACTATG ACCAGCCGAC GACATTCAAT TTCTCAATTG ACAGCAGCAA GAACTACACA ATAGGAAATG TATGGGCATT TGACAGAGGA AGCTCCAATA TTACTCAAAG AACTCCTATA GTGAACATAA AGGACAATAC CTTCACATAT ACAGTACCGG CTTTGACAGC GTGCCATATT GTGCTTGAAG CTGCGGAGCC CGTAGTGTAC GGAGACTTGA ACAATGACTC TAAAGTAAAC GCAGTAGACA TTATGATGCT CAAACGATAT ATTCTCGGAA TAATAGATAA TATAAATCTG ACAGCAGCTG ACATTTATTT TGACGGTGTT GTAAATTCAA GTGACTATAA TATAATGAAG AGATATTTGT TAAAGGCAAT AGAAGATATT CCTTATGTTC CGGAAAACCA GGCACCTAAA GCAATATTTA CTTTCTCGCC CGAAGATCCG GTTACTGACG AGAATGTAGT GTTCAATGCA TCAAATTCAA TAGATGAAGA CGGAACAATT GCCTATTATG TATGGGATTT CGGTGACGGA TATGAAGGAA CTTCAACAAC ACCGACTATT ACCTATAAGT ATAAAAACCC CGGAACATAC AAAGTAAAAC TGATTGTTAC AGACAACCAG GGGGCTTCAA GTTCGTTTAC AGCTACCATA AAAGTAACCT CAGCTACCGG GGACAATTCC AAATTCAACT TTGAAGACGG CACGCTGGGA GGATTTACAA CATCCGGAAC AAATGCTACG GGTGTTGTTG TGAACACTAC TGAAAAAGCA TTCAAAGGCG AAAGAGGTCT TAAATGGACT GTAACAAGCG AAGGAGAAGG AACTGCAGAA TTGAAACTTG ACGGAGGTAC TATTGTAGTT CCCGGTACCA CTATGACGTT TAGAATCTGG ATACCTTCCG GTGCGCCTAT TGCTGCCATC CAGCCGTATA TTATGCCTCA TACACCTGAT TGGTCGGAAG TCCTCTGGAA TTCGACATGG AAAGGATACA CCATGGTGAA GACCGATGAC TGGAATGAAA TTACCCTGAC ACTGCCGGAA GACGTGGATC CGACTTGGCC GCAGCAGATG GGTATACAGG TACAGACCAT AGATGAAGGT GAATTCACTA TCTATGTAGA TGCTATTGAC TGGTAA
|
Protein sequence | MAKRRLSLLL VLAIMFTMVV PQISASAETV APEGYRKLLD VQIFKDSPVV GWSGSGMGEL ETIGDTLPVD TTVTYNGLPT LRLNVQTTVQ SGWWISLLTL RGWNTHDLSQ YVENGYLEFD IKGKEGGEDF VIGFRDKVYE RVYGLEIDVT TVISNYVTVT TDWQHVKIPL RDLMKINNGF DPSSVTCLVF SKRYADPFTV WFSDIKITSE DNEKSAPAIK VNQLGFIPEA EKYALVTGFA EELAVSEGDE FAVINAADNS VAYTGKLTLV TEYEPLDSGE KILKADFSDL TVPGKYYISI EGLDNSPKFE IGEGIYGPLV VDAARYFYYQ RQGIELEEPY AQGYPRKDVT PQDAYAVFAS GKKDPIDITK GWYDAGDFGK YVNAGATGVS DLFWAYEMFP SQFVDGQFNI PESGNGVPDI LDEARWELEW MLKMQDKESG GFYPRVQSDN DENIKSRIIR DQNGCTTDDT ACAAGILAHA YLIYKDIDPD FAQECLDAAI NAWKFLEKNP ENIVSPPGPY NVYDDSGDRL WAAASLYRAT GEEVYHTYFK QNYKSFAQKF ESPTAYAHTW GDMWLTAFLS YLKAENKDQE VVDWIDTEFG IWLENILTRY ENNPWKNAIV PGNYFWGINM QVMNVPMDAI IGSQLLGKYS DRIEKLGFGS LNWLLGTNPL RFSFVSGYGE DSVKGVFSNI YNTDGKQGIP KGYMPGGPNA YEGAGLSRFA AKCYTRSTGD WVANEHTVYW NSALVFMAAF ANQGSEVNPG PAPEPGVTPN PTEPAKVVDI RIDTSAERKP ISPYIYGSNQ ELDATVTAKR FGGNRTTGYN WENNFSNAGS DWLHYSDTYL LEDGGVPKGE WSTPASVVTT FHDKALSKNV PYTLITLQAA GYVSADGNGP VSQEETAPSS RWKEVKFEKG APFSLTPDTE DDYVYMDEFV NYLVNKYGNA STPTGIKGYS IDNEPALWSH THPRIHPDNV TAKELIEKSV ALSKAVKKVD PYAEIFGPAL YGFAAYETLQ SAPDWGTEGE GYRWFIDYYL DKMKKASDEE GKRLLDVLDV HWYPEARGGG ERICFGADPR NIETNKARLQ APRTLWDPTY IEDSWIGQWK KDFLPILPNL LDSIEKYYPG TKLAITEYDY GGGNHITGGI AQADVLGIFG KYGVYLATFW GDASNNYTEA GINLYTNYDG KGGKFGDTSV KCETSDIEVS SAYASIVGED DSKLHIILLN KNYDQPTTFN FSIDSSKNYT IGNVWAFDRG SSNITQRTPI VNIKDNTFTY TVPALTACHI VLEAAEPVVY GDLNNDSKVN AVDIMMLKRY ILGIIDNINL TAADIYFDGV VNSSDYNIMK RYLLKAIEDI PYVPENQAPK AIFTFSPEDP VTDENVVFNA SNSIDEDGTI AYYVWDFGDG YEGTSTTPTI TYKYKNPGTY KVKLIVTDNQ GASSSFTATI KVTSATGDNS KFNFEDGTLG GFTTSGTNAT GVVVNTTEKA FKGERGLKWT VTSEGEGTAE LKLDGGTIVV PGTTMTFRIW IPSGAPIAAI QPYIMPHTPD WSEVLWNSTW KGYTMVKTDD WNEITLTLPE DVDPTWPQQM GIQVQTIDEG EFTIYVDAID W
|
| |