Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0059 |
Symbol | |
ID | 4808754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 90979 |
End bp | 92439 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105468 |
Product | type 3a, cellulose-binding |
Protein accession | YP_001036493 |
Protein GI | 125972583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0360636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAT TGGGAATAAT ATATGAAATT CAGGGCATGA AAGCTGTAGT TCTGACAAGC GAAGGCGAAT TTTTGATTAT TCGCAGGCGC AAAGATATGA AGGTTGGACA GCAGGTGAGT TTTGAAAATG AGGATATATA TAATGTCAGG GGAAAGAGAT TTTTATATGT TGCTGCCGCC GTTTCAAGTG TTGCGGCAGT GCTGGTTGTA ATGTTTCTGT ATTTTCAGTC TGCATTTTTG AGTAATACCG ATAATATTTA CGGATATATA TGCGTTGATA TAAATCCCAG CGTTGAGCTG GTAATTGATG AAACTTGCAG GGTTTTGGAA GTTAGACCTC AAAATAAAGA CGGAGAGCAA TTGATTTCGG GATTGGAACT TTTGGACAAA AATGTTGAAG ATGTTGTTTA CGAGCTTATT AACAGGTCCA TAAGTTTTGG TTTTGTCAAA GCTGATGATA ACAGAAAAAT TGTTCTTATC AGCGGTGCTC TTAATGATAA ACGGAACGAA CTCAAAACGA AAAAAGAAAA CGATGAGGCT GAATTGACGG AATTGCTTGA CAATATCAAA GCCAGGGTAG ATAGAATAGA TAATATTAAA GTGCGCACCA TAACGGCAAC TTCCAGGGAA AGAAAAGATG CATTGAAATA CGGGCTTTCC ATGGGAAAAT ACTGCCTATA CCTTGAAGCG CAGGAGTTGA ACGGCAGCAT TACCATTGAC GAAGTGCATG ATATGAGTAT TTCAGATATG ATAGAGAAAT TGGAACAGAT GAAGCTGGCA TTAAAAGATG AGGCAAGTCC AAAACTGCAA ACCACGCCGA CGCTTGGAGG GGAAACTGCA CAAATATCGC CGGAATCCAT GCAACATTCC ACAGTGCCCG GGTTGCCGGA AACTCCATCA TCTTCAGAGA AGACAATCGC ACCGACACTC CATGGAACTC CAGGTGTGCC TGATGAGAAA ACATTACAGC CTTCAACGCC GACAGAAAGC TCAGAATATG TGCAAGACGG TACAAAAGGG CTTAAAATAC AATATTACAG CAGAAAGCCC CATGATTCCG CAGGGATCGA CTTCAGCTTC AGAATGTTTA ACACGGGAAA TGAAGCAATT GACCTTAAAG ATGTTAAAGT AAGGTATTAT TTCAAAGAAG ATGTTTCGAT TGATGAAATG AACTGGGCGG TATACTTTTA CAGTTTGGGT AGTGAAAAGG ATGTTCAGTG CAGGTTTTAT GAGCTTCCCG GAAAGAAAGA GGCAAACAAA TATCTTGAAA TTACATTCAA ATCGGGGACG CTTTCTCCGA ACGATGTAAT GTATATCACA GGTGAGTTTT ATAAGAATGA TTGGACAAAA TTCGAGCAAA GGGACGATTA TTCCTACAAT CCTGCGGATT CCTATTCGGA TTGGAAAAGG ATGACTGCAT ACATTTCGAA CAAACTGGTA TGGGGAATTG AGCCCAATTG A
|
Protein sequence | MNRLGIIYEI QGMKAVVLTS EGEFLIIRRR KDMKVGQQVS FENEDIYNVR GKRFLYVAAA VSSVAAVLVV MFLYFQSAFL SNTDNIYGYI CVDINPSVEL VIDETCRVLE VRPQNKDGEQ LISGLELLDK NVEDVVYELI NRSISFGFVK ADDNRKIVLI SGALNDKRNE LKTKKENDEA ELTELLDNIK ARVDRIDNIK VRTITATSRE RKDALKYGLS MGKYCLYLEA QELNGSITID EVHDMSISDM IEKLEQMKLA LKDEASPKLQ TTPTLGGETA QISPESMQHS TVPGLPETPS SSEKTIAPTL HGTPGVPDEK TLQPSTPTES SEYVQDGTKG LKIQYYSRKP HDSAGIDFSF RMFNTGNEAI DLKDVKVRYY FKEDVSIDEM NWAVYFYSLG SEKDVQCRFY ELPGKKEANK YLEITFKSGT LSPNDVMYIT GEFYKNDWTK FEQRDDYSYN PADSYSDWKR MTAYISNKLV WGIEPN
|
| |