Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2179 |
Symbol | |
ID | 4810893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2593348 |
End bp | 2596116 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107583 |
Product | Pectate lyase/Amb allergen |
Protein accession | YP_001038574 |
Protein GI | 125974664 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3866] Pectate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA GAGTTTCGGC AAAAATCTCG TTCTTGTTGG TTTTATCAAT GTTTATTGGT ATGTTTTTTG TAATATCCAG AAATTCTTCT GTTCAAGCGG CGCCAAGCTT TGAACTGGTT GGATTTGCAA CGTTGAACGG CGGTACCACG GGCGGTACGG GCGGTAAAGA AGTGACTGCC ACCAGTATAT CGCAGATTAA CGAGCTGTTA AGCCAAAGGA AAAAGAATAA AGACACATCA CCTCTTGTAA TCAAGTTTGA CAGAAAATTG ACCGGTTCAG AAGTTATTGC CTGTAAAAAA GTAAGCAACA TTACATTTCT CGGTGTTGAC GGCAAAGGTG AATTGGAAGG TGCCGGAATC AATATAGTCA AGTCAAAAAA TATCATTGTT CGAAATTTGA AAATTCACCA TACAAGAGCC CCTATGGATG CAATCGGTAT TGAGAATTCC CAGAATATTT GGATTGACCA CTGTGAATTG TATAACGAGA TTGGCGACTG TAACGGTGAC GGAATAGTTG ACCCGAATGA CGGAGACACC GAAGGCGGAG ATGTTGACTG GTATGACGGA CTTTTGGATA TTAAAAAGTC CAGCGAATAT ATAACTGTAT CGTGGAACTA TTTCCATGAC TCCTACAAAA CAAGTCTTAT AGGCTCATCG GACGGTGATG ACTATGACAG AAAAATAACT TTCCATCATA ATATATGTGC AAACGTTAAA TCCCGTACGC CAAGCTACAG AGGCGGTACA GGGCATATGT TCAACAACTA TTATGTGGAT GTTTTAGGCA GCGGTATAAA CTCAAGAGTG GGCGCAAAAT TAAGAATCGA AGGAAACATT TTTGAAAGGG TCGGATGTGG AGCGGTAGAC AGCAAAACAG GCTTTGCGGA AGGACCGATT GGTTCGTATT ACAGCAGTAA AATCGGATAT TGGGATGTAA GAGACAATAT ATTTATTGAT TGTAAAGGAA ATCAGCCTAC TACTTCAACA TGCAGCTTTG AGCCGCCATA CAAGTATGAC CATGTATTGC AGCCTGCAAG CCAGGTTAAA GAAACTGTGC TGCGTTATGC CGGTGTCCAG GGTAATATTG TACTGCCGAC TACATCTCCG ACAAATCCAC CTGCGGCTAC ACCGACGCAA CCGAAAAATA CTCAGCAGCC GACTGTGAAA TATGGTGATT TGAACGGAGA CGGAAATGTC AACTCAACAG ATTCGATTTT AATGAAAAGA TATCTTATGA AAAGTGTTGA TTTAAATGAA GAACAGCTTA AGGCTGCGGA TGTGAACCTT GACGGCAGAG TGAATTCGAC CGATAGGAGC ATACTGAACA GATACCTGTT AAAAATAATT GAGAAGCTGC CTTATGGTAA CGAGATACCT CCGGCAACGC CGACGCCCAC CGATACGCAT CCGAGTTCAA CTCCGACCGA AGGTGTGGTA CACGAGGCGG AAAGCAGCAG CAACCATTTG AAATACGCCA AAGTGGAATC CAACTATGTT GTGTTTGACC AGACGAAAGA TGCGTATATT GAAATGAAAA AAGTTAATTC ACCTGTAACC GGTGAGGTGA CAATTACCAT TGTATATTCA AACGGTTCGG GGAAAAGCCT GCCGATGGAA ATCAAAGTAA ACAGCACTAC CATTGAAAGC AATAAAGAAT TTCCAAGCAC AGGAGCATGG AATATATGGA GCACTCTCAG TGTAAAAGCA AATATGAACA GCGGTTCGGA TAATGTTATA AGAATCAAAA CACGTTCAAA TGACGGTGGA CCGCGAATTG ACAAAGTAAT TGTAAGTGCC GGGGGGTCAG GATACACACC GGCAACAACG CCGCCCTCAA CAACCAAGCC TGTAACCACA CCAACGCCAA CAGTGCCAAC ATCAACATCT GCAATCCCTG TGGAAGGAGA CATTATTCTT TCACCCAATG GTTCAATGAC GCTTCAGCAA GCCATTGACT TGATTCAACC GGGAAAGACA ATATATTTGA AAGCAGGAAC ATACAAATAT TCAAAAACTA TACTCATAAA AGAAGGAAAT GACGGTTTGC CAAACGCAAG AAAGACGTTG GCGGCCTATG GGGACGGAGA AGTAATTATT GACTTCTCAG CGATGACGGA AAATTCCTCC AACAGAGGTA TTGTTCTTGA TGCCAAATAC TGGCATGTCA AAGGAATAAC CATAAAAGGA GCGGGAGACA ACGGAATGCT GCTCTCAGGA CACTACAATA TTATTGAAAA ATGTACGTTC AGAGAGAACC GCGACTCAGG ACTCCAGCTG TCAAGATACA ATACCTCCTA TAATACAAAA GACAAATGGC CCAGCAATAA TCTTATAATA GACTGCCTGT CAACCATGAA CATGGATTCC CGCCGTGAGG ATGCCGACGG TTTTGCGGCA AAACTTACCT GCGGAGAAGG TAACGTATTT AGAAATTGTA TAGCCATTAA CAACTGTGAT GACGGGTGGG ACTTGTATAC CAAGAAAGAA ACAGGAGCTA TAGGCGTTGT AACTTTGGAA AACTGCCAGG CTATAGGAAA CGGTTATGGT GTGAACGGAA AAGATACAGG CGGAGACGGT AACGGGTACA AACTTGGAGA TGACACCGCT TCAGTACCCC ATATACTTAT AAATTGTGTG GCTAAAAACA ATAAAAAGCA TGGATTTACC GGCAACGGTA ATCCGGCGAA AATTGTATTG CAAAACTGCA CCGGTTCGGG TAACGGAGGA AAACTGTTTG ACAGACTTGA CAATGCAATA TTTAAATAA
|
Protein sequence | MQKRVSAKIS FLLVLSMFIG MFFVISRNSS VQAAPSFELV GFATLNGGTT GGTGGKEVTA TSISQINELL SQRKKNKDTS PLVIKFDRKL TGSEVIACKK VSNITFLGVD GKGELEGAGI NIVKSKNIIV RNLKIHHTRA PMDAIGIENS QNIWIDHCEL YNEIGDCNGD GIVDPNDGDT EGGDVDWYDG LLDIKKSSEY ITVSWNYFHD SYKTSLIGSS DGDDYDRKIT FHHNICANVK SRTPSYRGGT GHMFNNYYVD VLGSGINSRV GAKLRIEGNI FERVGCGAVD SKTGFAEGPI GSYYSSKIGY WDVRDNIFID CKGNQPTTST CSFEPPYKYD HVLQPASQVK ETVLRYAGVQ GNIVLPTTSP TNPPAATPTQ PKNTQQPTVK YGDLNGDGNV NSTDSILMKR YLMKSVDLNE EQLKAADVNL DGRVNSTDRS ILNRYLLKII EKLPYGNEIP PATPTPTDTH PSSTPTEGVV HEAESSSNHL KYAKVESNYV VFDQTKDAYI EMKKVNSPVT GEVTITIVYS NGSGKSLPME IKVNSTTIES NKEFPSTGAW NIWSTLSVKA NMNSGSDNVI RIKTRSNDGG PRIDKVIVSA GGSGYTPATT PPSTTKPVTT PTPTVPTSTS AIPVEGDIIL SPNGSMTLQQ AIDLIQPGKT IYLKAGTYKY SKTILIKEGN DGLPNARKTL AAYGDGEVII DFSAMTENSS NRGIVLDAKY WHVKGITIKG AGDNGMLLSG HYNIIEKCTF RENRDSGLQL SRYNTSYNTK DKWPSNNLII DCLSTMNMDS RREDADGFAA KLTCGEGNVF RNCIAINNCD DGWDLYTKKE TGAIGVVTLE NCQAIGNGYG VNGKDTGGDG NGYKLGDDTA SVPHILINCV AKNNKKHGFT GNGNPAKIVL QNCTGSGNGG KLFDRLDNAI FK
|
| |