Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0267 |
Symbol | |
ID | 4808550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 328214 |
End bp | 330229 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105679 |
Product | type 3a, cellulose-binding |
Protein accession | YP_001036699 |
Protein GI | 125972789 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACATT ACACGGGAAT CATTTTGAAA CTGGAAAGTG ACAGGGCAAT CGTATTGACT GACGGGCTTG ATTTTATGGA ACTGAAGCTC AAACCCGGCA TGCAACGCGG CCAGCATGTT ATTTTCGATG AATCCGACTT GTATTCAGCC GGTCTCATTA CGAGGTATAA AAGTATCATT ATGCCTTTTT CTGCGTTTGC CGCTGCCGCT GCAGTTTTTC TTGTAATACT TTTCAGTCTG AGATTCGTTT CAATTTCCCA GGAATATGCT TATATAGATG TTGATATAAA TCCAAGCATT GGGCTTGTAA TTGATAAAAA GGAAAAGGTT ATAGACGCAA AACCTTTAAA TAATGACGCA AAACCGATTC TCGATGAAGC GGCCCCAAAA GACATGCCGT TATACGACGC TTTATCGAAA ATACTGGATA TTTCAAAGAA AAACGGATAT ATAAATTCCG CAGACAATAT TGTTCTGTTC TCCGCATCAA TAAACTCCGG CCGGAACAAT GTTTCCGAAA GTGACAAAGG CATTCAGGAA ATAATATCCA CCCTCAAAGA CGTTGCAAAG GATGCAGGAG TCAAATTTGA AATTATACCG TCTACCGAAG AGGACAGACA AAAAGCTTTA GACCAAAATC TTTCCATGGG AAGGTATGCT ATATATGTAA AAGCTGTGGA AGAAGGTGTC AATCTTAACC TGGAAGATGC CCGAAACTTA AGCGTCTCCG AGATTCTTGG CAAAGTAAAC ATTGGCAAAT TCGCCATCTC CGATACTCCG GAAGACTCCG GAATTATGCC TGCGATTTCG GTTCCGGCTG AACCTGTGCC AAGTGTTACA CCGGCTTATA CCGCTGTACC GGAAAAAACA GAAGCGCAAC CTGTTGATAT ACCAAAATCT TCACCAACAC CGGCAAGTTT TACGGCACAT GTTCCCACAC CGCCAAAAAC TCCGTCAATC CCGCATACAT CCGGGCCTGC CATCGTGCAC ACTCCGGCTG CCGATAAAAC CACCCCGACT TTCACGGGGT CATCCACACC TGTACCAACC AATGTAGTAG CAATTGCATC CACACCTGTA CCTGTATCTA CGCCCAAGCC TGTATCAACA CCTGCATATT CATCTACTCC CACACCCGAA TCTACACCTG TACCTGTGTC CACACCCAAG CCAGCATCAA CACCTACACC TGCATCTACA CCCAAGCCTG TATCGACACC CACACATGTA TCTACACCCA AGCCTATATC AACACCTACA TCCACACCTA GACCTGCGTC CACACCTAAG CCTACATCAA CGCCTACACC GGAATCTACG CCCAAGCCTA CGTCAACACC TGCACCCGTA TCCACACCTA CATCGACACC AATACCTACA TATACATCCA CTCCTGCATC CACACCAATA CCTGCATATA CATCCACACC TACGTCCATA CCAACGCTTA CACCTGCAAC ATCACCTGCG CCCACATCAT CGCCGACACC GATTCCATCG CCAGCGCCTA CGGAAACTGA CTTGTTAACA AAGATTGAGC TTCAGGCATA CAATCACATC AGAACCTCCG AAACCAAAGA GCTTCAACCC AGAATCAAGC TGATAAATAC CGGAAATACG CCAATAACGC TCTCTGAGGT AAAAATCAGG TATTATTATA CAAAGGATCA AGTAATCAAT GAGATTTACA CATGCGACTG GTCAAACATT ACTTCATCCA AAATAACAGG AACTGTGGTT CAAATGTCAA ACCCAAAACC TAATGCCGAC AGCTACGTTG AAATAGGTTT CACTAACAGT GCCGGTGTTC TAAATCCCGG TGAGTACGTT GAAATAATCA GCAGAATCGG AAACAGTTAT GCTTTAAGTC TGGCAACTCC GCCTTATTCA GAATGGAATT ATATGTACGA CCAAAATTCC GACTACTCCT TCAACAACAG TTCTTCAGAT TTTGTAGTCT GGGACAAGAT TACAGTGTAT ATATCAGGAA CTCTATATTG GGGAATTGAA CCTTAA
|
Protein sequence | MSHYTGIILK LESDRAIVLT DGLDFMELKL KPGMQRGQHV IFDESDLYSA GLITRYKSII MPFSAFAAAA AVFLVILFSL RFVSISQEYA YIDVDINPSI GLVIDKKEKV IDAKPLNNDA KPILDEAAPK DMPLYDALSK ILDISKKNGY INSADNIVLF SASINSGRNN VSESDKGIQE IISTLKDVAK DAGVKFEIIP STEEDRQKAL DQNLSMGRYA IYVKAVEEGV NLNLEDARNL SVSEILGKVN IGKFAISDTP EDSGIMPAIS VPAEPVPSVT PAYTAVPEKT EAQPVDIPKS SPTPASFTAH VPTPPKTPSI PHTSGPAIVH TPAADKTTPT FTGSSTPVPT NVVAIASTPV PVSTPKPVST PAYSSTPTPE STPVPVSTPK PASTPTPAST PKPVSTPTHV STPKPISTPT STPRPASTPK PTSTPTPEST PKPTSTPAPV STPTSTPIPT YTSTPASTPI PAYTSTPTSI PTLTPATSPA PTSSPTPIPS PAPTETDLLT KIELQAYNHI RTSETKELQP RIKLINTGNT PITLSEVKIR YYYTKDQVIN EIYTCDWSNI TSSKITGTVV QMSNPKPNAD SYVEIGFTNS AGVLNPGEYV EIISRIGNSY ALSLATPPYS EWNYMYDQNS DYSFNNSSSD FVVWDKITVY ISGTLYWGIE P
|
| |