Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2197 |
Symbol | |
ID | 4811062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2622821 |
End bp | 2625607 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107603 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038592 |
Protein GI | 125974682 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.71663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCTATT TTTTTGCCGG TATATGGTAC ATTTTTAGAG TAGCATTAAC AAAAACCTTT ACCGTACCTT CAGATTATTC CGGCAAGAAA GTTTTTATAC AATTCGACGG AGCTTATATG AACAGCCAGG TATGGATAAA CGGGACATAC TTGGGAATTC GTCCATATGG ATACAGCTCT TTTGAATATG ACTTGACTCC ATACCTTAAC ATAGGCGGGA AAAACGTAAT TGCAGTCAAA ATCAACAACA ACCAGCCCAA CAGTCGCTGG TATTCGGGAA GCGGCATTTA CCGCAACGTG TGGCTGACAG TTTTGGACCC GGTTCATGTG GATTATTGCG GAATGTTTAT AACCACTCCG AATGTAAGCA GAGATTCGGC TACAGCCAAT GTCAGCACGA AAGTGGTAAA CCAGGGCAAT TCGGAAAAAA CAGTTTCTTT AAAAACCATA ATTATGGATG CAAATGGCAA CCAGGTTGCT TCTGATACAT CTTCAGCAGT TAACATATCA GCCGGTAGTG ACTATACATT TAACCAAAAC CTTACAGTAT CAAATCCCAA TTTATGGTCT CCTGATTCCC CGTATCTTTA CATGGTTCAA ACTCAAGTAA TTGTTGATGG AAAAGTGGCT GATACCTATA AGTCAACCAT GGGATTTCGT TATCTTAATT TTAGCAGCAC TACCGGTTTT TCTTTAAACG GCGTTAAAAC GAAAATAAAG GGAGTATGTA TGCATCATGA CTTGGGCGCT TTAGGAGCGG CAGTTAATTA CCGTGCTATT GAAAGGCAGC TTCAGATTAT GAAAGAGATG GGCTGCAATG CTATCCGCAC CGCGCATAAT CCTCCTGATC CGCAAGTGTT GGAAATATGC GACAGATTGG GTCTGATGGT TATGGATGAA GCCTTTGACT GCTGGGAAAC CGGAAAGACT GCCAATGACT ATCATCTGTA TTTCAAAGAC TGGGCCAAAA GGGACCTTCA GGATATGGTT AAAAGAGACC GCAATCATCC GTCGGTTATT ATGTGGAGCA TAGGCAATGA GATTCCCAAT GCTACCGTTG AAACTGCCAC AAAGCTGAAA AACTGGGTGA AGGAAATAGA TCCCACCCGA CCGGTAACAT GGGGTTGTTT TGCTATAAAT ATGTCGGACG ATACATACAA ACGGATTGCA AGTGTCCTTG ATTTGGTCGG ATACAACTAT TTCCCCTTTA TGTATGACCA GGGACACAAG GAACATCCCG AATGGATAAT GTTCGGCAGT GAAACAAGCT CGGCGGTAAG AAGCCGGGGT GTATATAAAA CTCCCACCAA CCAGAATATA CTGACCGGCA ATGACAACCA GTGCTCATCT TATGACAACA GCGTGGTTGC CTGGGGTAAC AGCGCAGAAT CGTCATATTA TGAAATCAAC AGACGGGATT ACATGCTTGG GGAATTTGTT TGGACGGGAT TTGACTATAT TGGTGAACCG ACACCGTACA AATGGCCGTC GAAAAGCTCA TATTTCGGAA TAGTTGACAC ATGCGGATTC CCCAAAGATA TATATTATTT CTATCAAAGC AAATGGAGCG ACAAGCCGAT GGTGCATATC CTGCCCCATT GGAACTGGTC GAACGGTACT ACCGTAGAGG TGTGGGCTTA TAGCAACTGC GATACGGTGG AGCTTTTCTT AAACGGCACT TCCCTTGGAG TAAAGAGTAT GGGAAATAAC GGGCATGTTT CGTGGAATGT TCCCTGGGTT CCGGGTACAC TCAGAGCAAA AGCTGTCAAA GGAAATATAG TGGTTTATGA CGAGGTAACC ACTGCCGGTA ATCCTGCAAA AATTCAGTTA AAACCGGACA GGACAACTAT TACGGCTGAC GGCAAGGACT TGGTATTTAT AGAAACTGAT ATTGTAGACA GTAACGGTGT TCTTGTCCCG ACGGCAAGCA ATACTGTGAA CTTTTCCATA TCCGGACCGG GAGTAATTGT CGGAGTTGAC AATGGAAATG CTGCAAGCCT GGAACCTTAC AAGGCAAACA GCAGGCAGGC TTTTAACGGC AAGTGCCTCG TGATAGTCCA GGCAACCAAA ACCAACGGGA CTATTATAGT AACGGCCAGT TCGAACGGAT TGGAATCTGA CAGAGTGATT ATTAAGACAA CCGGAGGGGA ACCTGAACCG ACTCCTGTGC CAAGGTCTGC TTTTACACGA ATCGAAGCGG AAAGCTATGA TGCTCAGTCA GGAATCCAGA CTGAAGATTG CAGCGAAGGC GGTAAGGATG TGGGATATAT TGAAAACGGA GATTTTGTCG TCTACAAGGC TATTGATTTT GGCAGAGGAG CAGCAAGTTT TAAAGCGAGA GTAGCCAGCG CTACAAGCGG AGGCAATATT GAACTTAGGA TTGACAGTAT TGACGGACCT GTAGTTGGCA TTTGTCCGGT TGCCGGCACC GGCGGTTGGC AGGAATGGGC TGATGCGACG TGTGAGGTAA GTGACCTGAA GGGAGTCCAT GATCTTTATC TGAAATTTAC CGGAGGCAGC GGTTATCTGC TTAATGTGAA TTGGTTCACC TTTGTTGAAG GAAACAGTGA TGAGGATCTG GGTGATTTAA ACGGTGACGG AAAAGTAAAC TCGACAGACC TTCAGCTAAT GAAAATGCAC GTACTCAGGC AAAGACAGCT TACAGGAACA AGCCTCTTAA ATGCAGATGT AAACAGGGAC GGCAAAGTGG ATTCTACCGA TGTCGCATTA TTAAAAAGAT ATATATTGAG ACAAATATCT TCTTTTGATG ATTATGCTCG GTCTTAA
|
Protein sequence | MAYFFAGIWY IFRVALTKTF TVPSDYSGKK VFIQFDGAYM NSQVWINGTY LGIRPYGYSS FEYDLTPYLN IGGKNVIAVK INNNQPNSRW YSGSGIYRNV WLTVLDPVHV DYCGMFITTP NVSRDSATAN VSTKVVNQGN SEKTVSLKTI IMDANGNQVA SDTSSAVNIS AGSDYTFNQN LTVSNPNLWS PDSPYLYMVQ TQVIVDGKVA DTYKSTMGFR YLNFSSTTGF SLNGVKTKIK GVCMHHDLGA LGAAVNYRAI ERQLQIMKEM GCNAIRTAHN PPDPQVLEIC DRLGLMVMDE AFDCWETGKT ANDYHLYFKD WAKRDLQDMV KRDRNHPSVI MWSIGNEIPN ATVETATKLK NWVKEIDPTR PVTWGCFAIN MSDDTYKRIA SVLDLVGYNY FPFMYDQGHK EHPEWIMFGS ETSSAVRSRG VYKTPTNQNI LTGNDNQCSS YDNSVVAWGN SAESSYYEIN RRDYMLGEFV WTGFDYIGEP TPYKWPSKSS YFGIVDTCGF PKDIYYFYQS KWSDKPMVHI LPHWNWSNGT TVEVWAYSNC DTVELFLNGT SLGVKSMGNN GHVSWNVPWV PGTLRAKAVK GNIVVYDEVT TAGNPAKIQL KPDRTTITAD GKDLVFIETD IVDSNGVLVP TASNTVNFSI SGPGVIVGVD NGNAASLEPY KANSRQAFNG KCLVIVQATK TNGTIIVTAS SNGLESDRVI IKTTGGEPEP TPVPRSAFTR IEAESYDAQS GIQTEDCSEG GKDVGYIENG DFVVYKAIDF GRGAASFKAR VASATSGGNI ELRIDSIDGP VVGICPVAGT GGWQEWADAT CEVSDLKGVH DLYLKFTGGS GYLLNVNWFT FVEGNSDEDL GDLNGDGKVN STDLQLMKMH VLRQRQLTGT SLLNADVNRD GKVDSTDVAL LKRYILRQIS SFDDYARS
|
| |