Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2950 |
Symbol | |
ID | 4810838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3468753 |
End bp | 3470417 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108373 |
Product | Pectate lyase/Amb allergen |
Protein accession | YP_001039341 |
Protein GI | 125975431 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3866] Pectate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.821566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCAGAT GTTGGGTAAA ATGTATTTCT TCAATGTTGG CAATTTCTTT AATATTATTT GCTTCTTTAA CGTTAACGGC TTCATTAATA GTAGCCTCGT TCTTACCTGC AACCACCACT TATGCCCAGA CCGTGTCACC TTTGGATAGA CCTATAGGAT GGGCGTCGGA AGCCGGAGGA ACCACAGGTG GCGGAAATGC TGCACCGGTG ATTGTAACAA GTGCAAGTGA ACTTCAAAAT CTTGTCAAAG ATAACACTCC AAGAGTTATT TACGTACAGG GAAACATCGG CGGCAACTAT ACCGTAGGAT CCAACAAAAC CATCATCGGC TTACCCGGCG CAACGACAGG CAGTTGGACG TTTAAAGGCT CATCCAACGT TATATTGAGA AACCTTAAAA TAAGAGGAAA CGGTGCAGAC GGCGATGCCG TGACCGTTAC GGACTATTCC CATCATATTT GGTTTGACCA CCTTGATTTG GCCGACTCAA CCGACGAAAA TCTCAGCATA AAACGCGGAA GCGATTACAT AACCATTTCC TGGTGCAAAT ACTGGTTTTC AAGGGACGGA GGCCACACAT TCGGAGGCCT GATCGGACAT AGTGATAATA ACGCAGCGCA GGACGAGGGA AGGCTTAGGG TAACCTATCA TCACAACTGG TATTCAAAGG GAGTAACAGA GCGTATGCCC CGTGTCCGTT TTGGAAAAGT TCATATTTTC AACAACCTGT TCGACGCTCC CGGCAACAAT TATGTCATTC GCTGTGGTTA CAAAGCAAAC ATCCGCTCTG AAGGCAATGT CTTCGTTAAC ATGAAAAATT GCTTTGACTT TAGCACTTCT TCTCCGGACT CCGTACTCCA GAGCATTAAT GATTTGTTTA TAGGAAATTG TAGCGGAACA ACCGGAAGAG GTATTGCTTT CGTTCCTCCC TATCAATACA CCGTCGAACC GACCGCGGGA CTTAAAGAAA AGATTGAAGC AGGGGCGGGA GCAACGTTAA ATGTCCCGGG AACATTCTCC CCCACACCGT CACCTTCAAA TACCCCTACT GCAACACCAA CACCTGCAAG TATAGTGTAC GGAGATTTGA ATAATGACGG CAGGACAAAT TCAACTGACT ATTCATTAAT GAAAAGATAC CTTCTTGGTT CCATAAGCTT TACAAATGAA CAGCTTAAAG CAGCGGATGT AAATCTCGAC GGTAAAGTAA ACTCCTCTGA TTATACTGTA TTAAGAAGAT TCTTACTGGG TTCGATCGAC TTGTTGCCAT ATAACGGAAC CGCGACTTAC CAGGCTGAAG ATGCAGTTTT CAGCGGCGCT ATATTTGAAA CAAAAAATGC AGGCTACACA GGAACAGGCT ATGTAAATTA TGACAATGTA CCCGGCGGAT ATATCGAATG GACACTGAAC ATAGCTAATG CAGGAACATA TACCCTGACA CTTACATATG CAAACGGAAC TTCGTCAAAC AGGACGGTTG ACATAAGCGT AAACGGTAAT ATCGTTGCCT CCGGTGTTGT ATTTGGAGGA ACAGGGTCAT GGACACAGTG GCAGACCAAG AGTATAACTG CCTCATTAAA TTCCGGAGTT AACAAAATCA GAGTTACCGG CACATCATCA GACGGAGGTC CCAACATCGA TAAACTCGAA ATAAGGAGAA ATTAA
|
Protein sequence | MSRCWVKCIS SMLAISLILF ASLTLTASLI VASFLPATTT YAQTVSPLDR PIGWASEAGG TTGGGNAAPV IVTSASELQN LVKDNTPRVI YVQGNIGGNY TVGSNKTIIG LPGATTGSWT FKGSSNVILR NLKIRGNGAD GDAVTVTDYS HHIWFDHLDL ADSTDENLSI KRGSDYITIS WCKYWFSRDG GHTFGGLIGH SDNNAAQDEG RLRVTYHHNW YSKGVTERMP RVRFGKVHIF NNLFDAPGNN YVIRCGYKAN IRSEGNVFVN MKNCFDFSTS SPDSVLQSIN DLFIGNCSGT TGRGIAFVPP YQYTVEPTAG LKEKIEAGAG ATLNVPGTFS PTPSPSNTPT ATPTPASIVY GDLNNDGRTN STDYSLMKRY LLGSISFTNE QLKAADVNLD GKVNSSDYTV LRRFLLGSID LLPYNGTATY QAEDAVFSGA IFETKNAGYT GTGYVNYDNV PGGYIEWTLN IANAGTYTLT LTYANGTSSN RTVDISVNGN IVASGVVFGG TGSWTQWQTK SITASLNSGV NKIRVTGTSS DGGPNIDKLE IRRN
|
| |