Gene Cthe_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2179 
Symbol 
ID4810893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2593348 
End bp2596116 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content43% 
IMG OID640107583 
ProductPectate lyase/Amb allergen 
Protein accessionYP_001038574 
Protein GI125974664 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA GAGTTTCGGC AAAAATCTCG TTCTTGTTGG TTTTATCAAT GTTTATTGGT 
ATGTTTTTTG TAATATCCAG AAATTCTTCT GTTCAAGCGG CGCCAAGCTT TGAACTGGTT
GGATTTGCAA CGTTGAACGG CGGTACCACG GGCGGTACGG GCGGTAAAGA AGTGACTGCC
ACCAGTATAT CGCAGATTAA CGAGCTGTTA AGCCAAAGGA AAAAGAATAA AGACACATCA
CCTCTTGTAA TCAAGTTTGA CAGAAAATTG ACCGGTTCAG AAGTTATTGC CTGTAAAAAA
GTAAGCAACA TTACATTTCT CGGTGTTGAC GGCAAAGGTG AATTGGAAGG TGCCGGAATC
AATATAGTCA AGTCAAAAAA TATCATTGTT CGAAATTTGA AAATTCACCA TACAAGAGCC
CCTATGGATG CAATCGGTAT TGAGAATTCC CAGAATATTT GGATTGACCA CTGTGAATTG
TATAACGAGA TTGGCGACTG TAACGGTGAC GGAATAGTTG ACCCGAATGA CGGAGACACC
GAAGGCGGAG ATGTTGACTG GTATGACGGA CTTTTGGATA TTAAAAAGTC CAGCGAATAT
ATAACTGTAT CGTGGAACTA TTTCCATGAC TCCTACAAAA CAAGTCTTAT AGGCTCATCG
GACGGTGATG ACTATGACAG AAAAATAACT TTCCATCATA ATATATGTGC AAACGTTAAA
TCCCGTACGC CAAGCTACAG AGGCGGTACA GGGCATATGT TCAACAACTA TTATGTGGAT
GTTTTAGGCA GCGGTATAAA CTCAAGAGTG GGCGCAAAAT TAAGAATCGA AGGAAACATT
TTTGAAAGGG TCGGATGTGG AGCGGTAGAC AGCAAAACAG GCTTTGCGGA AGGACCGATT
GGTTCGTATT ACAGCAGTAA AATCGGATAT TGGGATGTAA GAGACAATAT ATTTATTGAT
TGTAAAGGAA ATCAGCCTAC TACTTCAACA TGCAGCTTTG AGCCGCCATA CAAGTATGAC
CATGTATTGC AGCCTGCAAG CCAGGTTAAA GAAACTGTGC TGCGTTATGC CGGTGTCCAG
GGTAATATTG TACTGCCGAC TACATCTCCG ACAAATCCAC CTGCGGCTAC ACCGACGCAA
CCGAAAAATA CTCAGCAGCC GACTGTGAAA TATGGTGATT TGAACGGAGA CGGAAATGTC
AACTCAACAG ATTCGATTTT AATGAAAAGA TATCTTATGA AAAGTGTTGA TTTAAATGAA
GAACAGCTTA AGGCTGCGGA TGTGAACCTT GACGGCAGAG TGAATTCGAC CGATAGGAGC
ATACTGAACA GATACCTGTT AAAAATAATT GAGAAGCTGC CTTATGGTAA CGAGATACCT
CCGGCAACGC CGACGCCCAC CGATACGCAT CCGAGTTCAA CTCCGACCGA AGGTGTGGTA
CACGAGGCGG AAAGCAGCAG CAACCATTTG AAATACGCCA AAGTGGAATC CAACTATGTT
GTGTTTGACC AGACGAAAGA TGCGTATATT GAAATGAAAA AAGTTAATTC ACCTGTAACC
GGTGAGGTGA CAATTACCAT TGTATATTCA AACGGTTCGG GGAAAAGCCT GCCGATGGAA
ATCAAAGTAA ACAGCACTAC CATTGAAAGC AATAAAGAAT TTCCAAGCAC AGGAGCATGG
AATATATGGA GCACTCTCAG TGTAAAAGCA AATATGAACA GCGGTTCGGA TAATGTTATA
AGAATCAAAA CACGTTCAAA TGACGGTGGA CCGCGAATTG ACAAAGTAAT TGTAAGTGCC
GGGGGGTCAG GATACACACC GGCAACAACG CCGCCCTCAA CAACCAAGCC TGTAACCACA
CCAACGCCAA CAGTGCCAAC ATCAACATCT GCAATCCCTG TGGAAGGAGA CATTATTCTT
TCACCCAATG GTTCAATGAC GCTTCAGCAA GCCATTGACT TGATTCAACC GGGAAAGACA
ATATATTTGA AAGCAGGAAC ATACAAATAT TCAAAAACTA TACTCATAAA AGAAGGAAAT
GACGGTTTGC CAAACGCAAG AAAGACGTTG GCGGCCTATG GGGACGGAGA AGTAATTATT
GACTTCTCAG CGATGACGGA AAATTCCTCC AACAGAGGTA TTGTTCTTGA TGCCAAATAC
TGGCATGTCA AAGGAATAAC CATAAAAGGA GCGGGAGACA ACGGAATGCT GCTCTCAGGA
CACTACAATA TTATTGAAAA ATGTACGTTC AGAGAGAACC GCGACTCAGG ACTCCAGCTG
TCAAGATACA ATACCTCCTA TAATACAAAA GACAAATGGC CCAGCAATAA TCTTATAATA
GACTGCCTGT CAACCATGAA CATGGATTCC CGCCGTGAGG ATGCCGACGG TTTTGCGGCA
AAACTTACCT GCGGAGAAGG TAACGTATTT AGAAATTGTA TAGCCATTAA CAACTGTGAT
GACGGGTGGG ACTTGTATAC CAAGAAAGAA ACAGGAGCTA TAGGCGTTGT AACTTTGGAA
AACTGCCAGG CTATAGGAAA CGGTTATGGT GTGAACGGAA AAGATACAGG CGGAGACGGT
AACGGGTACA AACTTGGAGA TGACACCGCT TCAGTACCCC ATATACTTAT AAATTGTGTG
GCTAAAAACA ATAAAAAGCA TGGATTTACC GGCAACGGTA ATCCGGCGAA AATTGTATTG
CAAAACTGCA CCGGTTCGGG TAACGGAGGA AAACTGTTTG ACAGACTTGA CAATGCAATA
TTTAAATAA
 
Protein sequence
MQKRVSAKIS FLLVLSMFIG MFFVISRNSS VQAAPSFELV GFATLNGGTT GGTGGKEVTA 
TSISQINELL SQRKKNKDTS PLVIKFDRKL TGSEVIACKK VSNITFLGVD GKGELEGAGI
NIVKSKNIIV RNLKIHHTRA PMDAIGIENS QNIWIDHCEL YNEIGDCNGD GIVDPNDGDT
EGGDVDWYDG LLDIKKSSEY ITVSWNYFHD SYKTSLIGSS DGDDYDRKIT FHHNICANVK
SRTPSYRGGT GHMFNNYYVD VLGSGINSRV GAKLRIEGNI FERVGCGAVD SKTGFAEGPI
GSYYSSKIGY WDVRDNIFID CKGNQPTTST CSFEPPYKYD HVLQPASQVK ETVLRYAGVQ
GNIVLPTTSP TNPPAATPTQ PKNTQQPTVK YGDLNGDGNV NSTDSILMKR YLMKSVDLNE
EQLKAADVNL DGRVNSTDRS ILNRYLLKII EKLPYGNEIP PATPTPTDTH PSSTPTEGVV
HEAESSSNHL KYAKVESNYV VFDQTKDAYI EMKKVNSPVT GEVTITIVYS NGSGKSLPME
IKVNSTTIES NKEFPSTGAW NIWSTLSVKA NMNSGSDNVI RIKTRSNDGG PRIDKVIVSA
GGSGYTPATT PPSTTKPVTT PTPTVPTSTS AIPVEGDIIL SPNGSMTLQQ AIDLIQPGKT
IYLKAGTYKY SKTILIKEGN DGLPNARKTL AAYGDGEVII DFSAMTENSS NRGIVLDAKY
WHVKGITIKG AGDNGMLLSG HYNIIEKCTF RENRDSGLQL SRYNTSYNTK DKWPSNNLII
DCLSTMNMDS RREDADGFAA KLTCGEGNVF RNCIAINNCD DGWDLYTKKE TGAIGVVTLE
NCQAIGNGYG VNGKDTGGDG NGYKLGDDTA SVPHILINCV AKNNKKHGFT GNGNPAKIVL
QNCTGSGNGG KLFDRLDNAI FK