Gene Cthe_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2950 
Symbol 
ID4810838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3468753 
End bp3470417 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content44% 
IMG OID640108373 
ProductPectate lyase/Amb allergen 
Protein accessionYP_001039341 
Protein GI125975431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.821566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCAGAT GTTGGGTAAA ATGTATTTCT TCAATGTTGG CAATTTCTTT AATATTATTT 
GCTTCTTTAA CGTTAACGGC TTCATTAATA GTAGCCTCGT TCTTACCTGC AACCACCACT
TATGCCCAGA CCGTGTCACC TTTGGATAGA CCTATAGGAT GGGCGTCGGA AGCCGGAGGA
ACCACAGGTG GCGGAAATGC TGCACCGGTG ATTGTAACAA GTGCAAGTGA ACTTCAAAAT
CTTGTCAAAG ATAACACTCC AAGAGTTATT TACGTACAGG GAAACATCGG CGGCAACTAT
ACCGTAGGAT CCAACAAAAC CATCATCGGC TTACCCGGCG CAACGACAGG CAGTTGGACG
TTTAAAGGCT CATCCAACGT TATATTGAGA AACCTTAAAA TAAGAGGAAA CGGTGCAGAC
GGCGATGCCG TGACCGTTAC GGACTATTCC CATCATATTT GGTTTGACCA CCTTGATTTG
GCCGACTCAA CCGACGAAAA TCTCAGCATA AAACGCGGAA GCGATTACAT AACCATTTCC
TGGTGCAAAT ACTGGTTTTC AAGGGACGGA GGCCACACAT TCGGAGGCCT GATCGGACAT
AGTGATAATA ACGCAGCGCA GGACGAGGGA AGGCTTAGGG TAACCTATCA TCACAACTGG
TATTCAAAGG GAGTAACAGA GCGTATGCCC CGTGTCCGTT TTGGAAAAGT TCATATTTTC
AACAACCTGT TCGACGCTCC CGGCAACAAT TATGTCATTC GCTGTGGTTA CAAAGCAAAC
ATCCGCTCTG AAGGCAATGT CTTCGTTAAC ATGAAAAATT GCTTTGACTT TAGCACTTCT
TCTCCGGACT CCGTACTCCA GAGCATTAAT GATTTGTTTA TAGGAAATTG TAGCGGAACA
ACCGGAAGAG GTATTGCTTT CGTTCCTCCC TATCAATACA CCGTCGAACC GACCGCGGGA
CTTAAAGAAA AGATTGAAGC AGGGGCGGGA GCAACGTTAA ATGTCCCGGG AACATTCTCC
CCCACACCGT CACCTTCAAA TACCCCTACT GCAACACCAA CACCTGCAAG TATAGTGTAC
GGAGATTTGA ATAATGACGG CAGGACAAAT TCAACTGACT ATTCATTAAT GAAAAGATAC
CTTCTTGGTT CCATAAGCTT TACAAATGAA CAGCTTAAAG CAGCGGATGT AAATCTCGAC
GGTAAAGTAA ACTCCTCTGA TTATACTGTA TTAAGAAGAT TCTTACTGGG TTCGATCGAC
TTGTTGCCAT ATAACGGAAC CGCGACTTAC CAGGCTGAAG ATGCAGTTTT CAGCGGCGCT
ATATTTGAAA CAAAAAATGC AGGCTACACA GGAACAGGCT ATGTAAATTA TGACAATGTA
CCCGGCGGAT ATATCGAATG GACACTGAAC ATAGCTAATG CAGGAACATA TACCCTGACA
CTTACATATG CAAACGGAAC TTCGTCAAAC AGGACGGTTG ACATAAGCGT AAACGGTAAT
ATCGTTGCCT CCGGTGTTGT ATTTGGAGGA ACAGGGTCAT GGACACAGTG GCAGACCAAG
AGTATAACTG CCTCATTAAA TTCCGGAGTT AACAAAATCA GAGTTACCGG CACATCATCA
GACGGAGGTC CCAACATCGA TAAACTCGAA ATAAGGAGAA ATTAA
 
Protein sequence
MSRCWVKCIS SMLAISLILF ASLTLTASLI VASFLPATTT YAQTVSPLDR PIGWASEAGG 
TTGGGNAAPV IVTSASELQN LVKDNTPRVI YVQGNIGGNY TVGSNKTIIG LPGATTGSWT
FKGSSNVILR NLKIRGNGAD GDAVTVTDYS HHIWFDHLDL ADSTDENLSI KRGSDYITIS
WCKYWFSRDG GHTFGGLIGH SDNNAAQDEG RLRVTYHHNW YSKGVTERMP RVRFGKVHIF
NNLFDAPGNN YVIRCGYKAN IRSEGNVFVN MKNCFDFSTS SPDSVLQSIN DLFIGNCSGT
TGRGIAFVPP YQYTVEPTAG LKEKIEAGAG ATLNVPGTFS PTPSPSNTPT ATPTPASIVY
GDLNNDGRTN STDYSLMKRY LLGSISFTNE QLKAADVNLD GKVNSSDYTV LRRFLLGSID
LLPYNGTATY QAEDAVFSGA IFETKNAGYT GTGYVNYDNV PGGYIEWTLN IANAGTYTLT
LTYANGTSSN RTVDISVNGN IVASGVVFGG TGSWTQWQTK SITASLNSGV NKIRVTGTSS
DGGPNIDKLE IRRN