Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1472 |
Symbol | |
ID | 4810622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1789679 |
End bp | 1792381 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106893 |
Product | carbohydrate-binding family 11 protein |
Protein accession | YP_001037894 |
Protein GI | 125973984 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.980604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GGCTTTTAGT TTCTTTTTTG GTGTTAAGCA TAATTGTAGG ATTACTTTCT TTTCAGTCGC TTGGTAATTA CAACAGTGGT TTAAAAATCG GTGCTTGGGT GGGAACCCAG CCGTCAGAAT CAGCAATTAA GAGTTTTCAG GAACTTCAGG GTAGAAAGCT TGATATTGTC CACCAGTTTA TTAACTGGTC AACTGATTTT TCCTGGGTAA GACCTTATGC CGACGCTGTT TATAATAACG GCTCAATATT AATGATTACC TGGGAACCTT GGGAATACAA CACTGTAGAT ATCAAAAACG GTAAAGCGGA TGCTTACATA ACCAGAATGG CGCAAGATAT GAAAGCCTAT GGCAAGGAAA TTTGGTTAAG ACCTCTTCAT GAAGCCAACG GAGACTGGTA TCCATGGGCC ATAGGATATT CTTCAAGAGT AAACACAAAC GAAACTTACA TAGCCGCTTT CAGACATATT GTCGATATTT TCCGTGCCAA CGGAGCCACC AACGTCAAAT GGGTGTTTAA TGTAAACTGC GACAATGTAG GTAACGGCAC AAGTTATCTG GGTCATTATC CCGGAGATAA TTATGTAGAC TACACCTCAA TTGACGGATA CAACTGGGGT ACCACTCAAA GCTGGGGAAG CCAATGGCAA AGCTTTGATC AGGTTTTCTC CAGAGCCTAC CAAGCTTTGG CATCAATAAA CAAACCCATC ATTATAGCAG AGTTTGCATC AGCTGAAATA GGCGGAAACA AGGCAAGATG GATTACAGAA GCATATAACT CTATAAGAAC ATCCTACAAC AAGGTAATTG CTGCAGTATG GTTTCACGAG AACAAAGAAA CCGACTGGAG AATCAACTCA AGTCCTGAAG CCCTTGCAGC ATACAGGGAG GCAATAGGAG CCGGTTCATC AAATCCTACC CCTACTCCAA CTTGGACCTC TACTCCACCA TCAAGCTCAC CAAAGGCTGT CGACCCCTTT GAAATGGTTA GAAAAATGGG TATGGGAACA AACCTCGGAA ACACTCTCGA AGCTCCCTAT GAAGGCTCCT GGTCCAAGTC TGCCATGGAA TATTATTTTG ATGATTTTAA AGCTGCAGGA TATAAAAACG TAAGAATCCC TGTAAGATGG GACAACCATA CAATGAGGAC ATACCCGTAT ACCATTGACA AAGCCTTTTT GGACAGGGTT GAGCAAGTGG TTGACTGGTC ACTTTCAAGA GGTTTTGTTA CAATTATAAA TTCTCACCAT GATGACTGGA TCAAGGAAGA CTATAACGGA AACATAGAAC GGTTTGAAAA GATATGGGAA CAGATTGCGG AAAGGTTTAA AAACAAATCC GAAAATCTTC TGTTTGAAAT CATGAATGAG CCTTTCGGTA ACATTACAGA CGAACAAATA GACGACATGA ACAGCAGAAT ATTAAAAATA ATCAGAAAGA CCAATCCAAC CCGTATTGTT ATAATAGGCG GAGGTTATTG GAACAGTTAT AATACGCTTG TAAACATTAA AATTCCTGAT GACCCATACT TAATCGGAAC TTTCCATTAC TATGACCCAT ATGAATTTAC TCACAAGTGG AGAGGTACAT GGGGTACTCA GGAAGACATG GATACTGTAG TAAGAGTATT TGATTTTGTT AAGAGTTGGT CTGACAGAAA CAATATCCCG GTATATTTTG GAGAATTTGC CGTAATGGCT TATGCCGACA GAACTTCCCG TGTAAAATGG TATGATTTTA TAAGTGATGC GGCCCTGGAG CGCGGTTTTG CATGTTCCGT ATGGGATAAC GGCGTTTTTG GTTCATTGGA TAATGACATG GCTATTTACA ACAGAGATAC CCGTACCTTT GACACTGAAA TCCTCAATGC ACTATTTAAT CCCGGAACAT ATCCGTCTTA TTCTCCGAAA CCTTCACCAA CTCCAAGACC GACCAAACCG CCCGTAACAC CGGCTGTCGG TGAAAAAATG CTGGATGATT TTGAGGGTGT GTTAAATTGG GGTTCATACT CCGGTGAAGG TGCAAAAGTT TCAACAAAAA TTGTGTCCGG AAAAACAGGA AACGGCATGG AAGTCAGCTA CACCGGGACA ACGGACGGCT ACTGGGGAAC AGTATACAGT TTACCGGACG GCGATTGGTC AAAATGGCTT AAAATCTCTT TTGACATTAA GTCCGTTGAC GGTTCTGCCA ATGAAATCAG ATTTATGATT GCTGAAAAAA GCATAAACGG TGTGGGAGAC GGAGAACACT GGGTTTACTC AATAACTCCC GACAGTTCGT GGAAAACTAT AGAAATACCG TTCTCCAGCT TTAGAAGAAG ACTTGATTAT CAGCCGCCTG GACAGGATAT GAGCGGTACT TTGGATCTTG ACAATATAGA TTCAATTCAC TTCATGTATG CCAACAACAA GTCGGGAAAA TTTGTCGTAG ACAATATCAA GCTGATTGGT GCTACTTCCG ATCCGACTCC TTCAATAAAA CACGGAGATT TGAACTTCGA TAATGCAGTG AATTCTACAG ACTTGTTAAT GCTTAAAAGG TATATCCTCA AATCTTTGGA ACTCGGTACA TCTGAGCAGG AGGAAAAATT CAAAAAAGCG GCAGATTTAA ACAGGGACAA CAAGGTCGAC TCCACTGACT TGACAATTTT GAAAAGATAC TTGCTGAAAG CCATCAGTGA AATACCCATA TAA
|
Protein sequence | MKKRLLVSFL VLSIIVGLLS FQSLGNYNSG LKIGAWVGTQ PSESAIKSFQ ELQGRKLDIV HQFINWSTDF SWVRPYADAV YNNGSILMIT WEPWEYNTVD IKNGKADAYI TRMAQDMKAY GKEIWLRPLH EANGDWYPWA IGYSSRVNTN ETYIAAFRHI VDIFRANGAT NVKWVFNVNC DNVGNGTSYL GHYPGDNYVD YTSIDGYNWG TTQSWGSQWQ SFDQVFSRAY QALASINKPI IIAEFASAEI GGNKARWITE AYNSIRTSYN KVIAAVWFHE NKETDWRINS SPEALAAYRE AIGAGSSNPT PTPTWTSTPP SSSPKAVDPF EMVRKMGMGT NLGNTLEAPY EGSWSKSAME YYFDDFKAAG YKNVRIPVRW DNHTMRTYPY TIDKAFLDRV EQVVDWSLSR GFVTIINSHH DDWIKEDYNG NIERFEKIWE QIAERFKNKS ENLLFEIMNE PFGNITDEQI DDMNSRILKI IRKTNPTRIV IIGGGYWNSY NTLVNIKIPD DPYLIGTFHY YDPYEFTHKW RGTWGTQEDM DTVVRVFDFV KSWSDRNNIP VYFGEFAVMA YADRTSRVKW YDFISDAALE RGFACSVWDN GVFGSLDNDM AIYNRDTRTF DTEILNALFN PGTYPSYSPK PSPTPRPTKP PVTPAVGEKM LDDFEGVLNW GSYSGEGAKV STKIVSGKTG NGMEVSYTGT TDGYWGTVYS LPDGDWSKWL KISFDIKSVD GSANEIRFMI AEKSINGVGD GEHWVYSITP DSSWKTIEIP FSSFRRRLDY QPPGQDMSGT LDLDNIDSIH FMYANNKSGK FVVDNIKLIG ATSDPTPSIK HGDLNFDNAV NSTDLLMLKR YILKSLELGT SEQEEKFKKA ADLNRDNKVD STDLTILKRY LLKAISEIPI
|
| |