Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0246 |
Symbol | |
ID | 4808594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 297573 |
End bp | 300035 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640105658 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001036678 |
Protein GI | 125972768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTCTTGTATT TTTAACGGCC TTGAGTCTGA TATTCACGCT GTTTATCAGT TATTCCCTGT CAGCAGGACC GGCTTCAACC AAGTATGGGG ATCTCAATGC CGATGGCAAG ATCAATTCGA CAGATTACAA CTTGGGCAAG AGATTGATTC TGAGAACAAT TTCGGAGCTT CCCATTTCCA ATGGATCTGT AGCCTTTGAC CTTAACGGTG ATTCAAAGGT TGATTCAACG GACCTTACTG CGCTGAAAAG ATACCTGCTG GGTGTTATTG ACAAGTTTCC GGTGGGCACG GATATACCAT CCCAAACACA AAAGACGAGA TATCAGGCTG AGGATGCGAT GTTGTACAAG GCATTCGAGG AAACAATCCA TGCAGGTTAT GACGGGAGAA GTTATGTAAA TTACGACAAC GAACCCGGAG GATATATTGA GTGGAATGTA AATGTATCCA GTTCAGGTAC ATATAAGCTT ATTTTCAGAT ATGCAAACGG ATCAAACAAT AACAGACCTA TGGAAATAAG AGTAAATTCC AATCTGGTTG CAGGTAGTCT GGACTTTTAT CCGACTTCAG CCTGGACTGT ATGGAATGAC CAAAGCATAG TTGTAACTTT AAATGCGGGC AACAACGTTA TCAGGGCAAC GGGAATTGCC TCGGACGGCG GACCGAATGT GGATTATCTT GAAGTAATTC CGACAAATGA ACCACCAGCA CCCACCCCTT CACCGACGCC TACAGTTGGA CCTACACCTG CTGGTGCGCG TCAGATGGAG AGACTGGACA GAGGGCTTGT GGCGGTAAAA GTAAACAACG GAGTATTTTT AAGCTGGAGA ATGTTTGGTA CGGATCCTTC CAACATTGCA TTCAACTTGT ACCGCAACGG AACAAAGATA AATTCCACAC CGATTACCGG TGCGACAAAC TATGTGGATA CCGGCGGAAC GACAAGTTCA ACATACACGG TACGTGCGGT TATTAACGGA CAGGAACAGG AGGCATCAAA ACCTGTAAGT GTCTGGGCTC AGAATTATCT TCAGATTCCC ATTCAGCCAC CGTCAAGCGC GTACGAGGCT AATGACTGCA GTGCCGCAGA CCTTGACGGA GACGGAGAAT ATGAAATTGT GTTAAAGTGG GAGCCAAATA ACGCAAAAGA CAATTCCCAA TCCGGATATA CCGATAATGT GTATTTGGAT GCTTACAAGC TGAACGGCAC ACGTTTGTGG AGAATAGATC TTGGAAGAAA TATCCGTGCC GGTGCCCACT ATACCCAGTT TATGGTTTAT GACCTTGACG GCGACGGCAA GGCAGAGGTT GCATGCAAGA CAGCTGACGG AACAAGAGAC GGAAAAGGAA ATGTGATAGG CAATCCAAAT GCGGATTATC GTAATTCAAG CGGATACATA CTTTCAGGAC CTGAATACCT GACAGTATTC GATGGACAGA CAGGTGCCGC CATTACAACG GTGGATTATG ATCCTCCGAG AGGAAATGTC TCTTCATGGG GTGACAATTA CGGAAACAGA GTGGACCGTT TCCTGGCGTG CATAGCATAC CTTGACGGTC AAAGACCAAG CCTTGTCATG TGCCGCGGAT ATTATACAAG AAGCGTGCTT GTGGCCTGGG ATTTCAGAAA CGGAAGGCTT ACAAAGAGAT GGGTATTTGA CGGCAACAAT TACAGCGGAT ATAACGGACA GGGTAATCAC AACCTGAGTG TGGCCGATGT TGACGGCGAC GGAAGAGATG AGATTATTTA CGGTGCATGT ACCATTGATG ACAACGGAAA AGGATTGTAT ACTTCAGGAC TTGGCCATGG GGACGCTCTG CATGTGGGAG ATCTTAATCC CAACAGACCG GGCCTTGAAA TTTGGAGCTG CTTTGAAAGC TCCGGCGGCG CTGCTTTGCG TGATGCAAGG ACAGGAGAAG TGTTGTTCAG ATGGCATAGA TCCAGTGATA CAGGAAGGGC TTGTGCGGCT GATATAACGG CATCATCTCC GGGAGCTGAG CTTTGGGCTG CAGGTTCTCC GCTGTTCAGC TGTACCGGTC AGAATATAGG AACTGCTCCA AGCCAGATTA ACTTTGCTAT ATGGTGGGAC GGAGACGAAC TCAGGGAGCT CCTTGACGGC ATTACAATAA GCAAATACGG TGTAGGAACA TTGTTTACCG CGACCGGATG TGCTTCCAAC AACGGTACAA AATCAACTCC GTGCCTCCAG GCAGACCTCC TTGGAGACTG GAGAGAAGAA GTAATCTTTA GAACTTCGGA CAACAGGTAT TTGAGAATAT ACACCACAAC GGCAACAACA AACAGACGTA TTTACACATT AATGCATGAT CCGGTTTACA GATTGGGTAT AGCCTGGCAG AATGTAGCAT ACAATCAGCC GCCGCACACA AGCTTCTTTA TCGGAGCCGG CATGGCTGAG CCTCCGAAGC CAAATATTTA CCTTGTGCCG TAA
|
Protein sequence | MKKTLVFLTA LSLIFTLFIS YSLSAGPAST KYGDLNADGK INSTDYNLGK RLILRTISEL PISNGSVAFD LNGDSKVDST DLTALKRYLL GVIDKFPVGT DIPSQTQKTR YQAEDAMLYK AFEETIHAGY DGRSYVNYDN EPGGYIEWNV NVSSSGTYKL IFRYANGSNN NRPMEIRVNS NLVAGSLDFY PTSAWTVWND QSIVVTLNAG NNVIRATGIA SDGGPNVDYL EVIPTNEPPA PTPSPTPTVG PTPAGARQME RLDRGLVAVK VNNGVFLSWR MFGTDPSNIA FNLYRNGTKI NSTPITGATN YVDTGGTTSS TYTVRAVING QEQEASKPVS VWAQNYLQIP IQPPSSAYEA NDCSAADLDG DGEYEIVLKW EPNNAKDNSQ SGYTDNVYLD AYKLNGTRLW RIDLGRNIRA GAHYTQFMVY DLDGDGKAEV ACKTADGTRD GKGNVIGNPN ADYRNSSGYI LSGPEYLTVF DGQTGAAITT VDYDPPRGNV SSWGDNYGNR VDRFLACIAY LDGQRPSLVM CRGYYTRSVL VAWDFRNGRL TKRWVFDGNN YSGYNGQGNH NLSVADVDGD GRDEIIYGAC TIDDNGKGLY TSGLGHGDAL HVGDLNPNRP GLEIWSCFES SGGAALRDAR TGEVLFRWHR SSDTGRACAA DITASSPGAE LWAAGSPLFS CTGQNIGTAP SQINFAIWWD GDELRELLDG ITISKYGVGT LFTATGCASN NGTKSTPCLQ ADLLGDWREE VIFRTSDNRY LRIYTTTATT NRRIYTLMHD PVYRLGIAWQ NVAYNQPPHT SFFIGAGMAE PPKPNIYLVP
|
| |