Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2543 |
Symbol | |
ID | 4809299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3010934 |
End bp | 3012331 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107959 |
Product | spore germination protein-like protein |
Protein accession | YP_001038938 |
Protein GI | 125975028 |
COG category | [R] General function prediction only |
COG ID | [COG5401] Spore germination protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000191345 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGCA ATTTTAATTT TGAAGACATA ATTAAATATT CCGAAAATCA TCTTTCGGAA GAAGAGAAAA AGAGAATTAA AGAGCATCTG GATGTATGTG AAAAATGCCG AAAACGCTAT GGCGTACTGA AATTTACAGA GGCTTACGCA AAAGACAGTT CCATGACAAG TGAAAGCATC ACTAAAAATG TGATGGAAAC AATTGACGTA AACAGATACA GCAAAAGTAA AAAGTTTTTG TTCGGCAGGA ATTTTTACAG AGCCCTGCCT GTTATAAAAC CAGTTTTGGC ATCCGCGGCA GTATTTGTTG TGGTAATGGT GGGCATAACA AGTTTCGGAA GTCTCAGAGG ATTAATCAAC AATTCCGGCG ATGTGAAACC TAATCCAACC AATCCGATTG CCAATTCTCA AAACACAAAC CCGGCCGCTT TGCCTGAAAG TACAGCCCCG GCAGTTCAAA ATCCTGTGGA AAAGAAGGTA ATCACGCTTT ACTATTCAAA TTCCAATGCG GATAAAGTGG TTGCAGAAAA AAGAGAAGTT GAAATAAGCA AAGATACACA AATAGAAAGA TTGGTGTTTG AAGAATTGCA AAAGGGTCCG AAAAACGAGG GATTAGTTGC CACAATACCA AAAGGGACCA GACTTTTATC AGTATCCACC GAAAACGGCA TTTGCACACT TAATCTTTCC AAAGAGTTCG TTGACAACCA TCCCGGCGGA ACAGCAGGTG AAACAATGAC TTTATTTTCA ATAGTCAATA CAATGACTGA GCTTCCCGGT ATCGAGAAAG TACAGTTTCT TATTGAAGGC CAAAAACAGG ATGCATATAT ACATGTTGCA TTTAGTGAAC CCTTTAAAAG AAACAACAGT ATTATCCAAA AGAGCCCAAG TGAGATAAAA GCCGAAGTTG AAGCTAAGTC TCAGGAAGCG ATTAAGGCTA TCAAGGAAAA AGATATGGAA AAGTTGGCCC AAATGGTACA TCCTGAAAAA GGTGTGTTGT TCTCCCCCTA TTCCCATATT GAATTGGAAA AACATAAAGT ATTTACAAAG GATCAATTAA AAAATCTTAT GGAGTCGGAA GAAGTATATA TCTGGGGAGA ATATGACGGC TCCGGTGACC CGATTAAGTT AACCTTTGCC CAGTATTTCG ACAAATTTGT ATATGATCAT GATTTTGCGA ACGCCGAAAA AGTGGCATAC AATGAAATAC AGCAATCTGG AAACACAATT GTCAATATTT CTGATGTATA TCCGGAAGGA AAGTTTATGG ATTATTACTT CCCCGGATTC ACTCCCGAAT ATGACGGAAT GGACTGGGCA AGTTTAAGAT TAGTTTTTGA GGAGTATGAC GGCCAGTGGT ATCTTGTATG TATTGCCCAT GGCCAATGGA CTATTTAA
|
Protein sequence | MKCNFNFEDI IKYSENHLSE EEKKRIKEHL DVCEKCRKRY GVLKFTEAYA KDSSMTSESI TKNVMETIDV NRYSKSKKFL FGRNFYRALP VIKPVLASAA VFVVVMVGIT SFGSLRGLIN NSGDVKPNPT NPIANSQNTN PAALPESTAP AVQNPVEKKV ITLYYSNSNA DKVVAEKREV EISKDTQIER LVFEELQKGP KNEGLVATIP KGTRLLSVST ENGICTLNLS KEFVDNHPGG TAGETMTLFS IVNTMTELPG IEKVQFLIEG QKQDAYIHVA FSEPFKRNNS IIQKSPSEIK AEVEAKSQEA IKAIKEKDME KLAQMVHPEK GVLFSPYSHI ELEKHKVFTK DQLKNLMESE EVYIWGEYDG SGDPIKLTFA QYFDKFVYDH DFANAEKVAY NEIQQSGNTI VNISDVYPEG KFMDYYFPGF TPEYDGMDWA SLRLVFEEYD GQWYLVCIAH GQWTI
|
| |