Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2091 |
Symbol | |
ID | 4810951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2486389 |
End bp | 2487687 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107498 |
Product | hypothetical protein |
Protein accession | YP_001038491 |
Protein GI | 125974581 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000589193 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATA GATTTAAATA TCACAGGTCA TTTTTAATAT TTACCCACGA AGATTCAAGA AACGGAGAAG GGAGAGAACC GTCAGGATAT GTAAAAATAG AAATCAGAGA CGGCAGGGGA AAGCTTTGCT GTCAGGTGTC AAACCTTAGG GAAAGCAACG ACGCAGTTTA TAAACTGTAT TTAATAAATG TGGATGAAGC GGGACTTAAG GCGGCTTGCC CGGGAGTCAT TGATTTGGCG AAAGGAAAAG GTGAATTGAT ATGGAATTTT GACCCCGCAA ATATTGACGG AACGGGAATG AGTGTCTGTG ACATCAATGT TGCGGCAGTT ATCCTTGAGA ACAATAATGA ACGGTACATA GACAATATTC TCTGCCCCCT TGCGGCGTAT AAAAACGGGA AAATTGAGTG GAGACAGAAA ATGAAAAAGT TTTTGAACAA GCAAAGGGCG CAAGAAGCGG AAGAAACGGA AGAAAGACAA GATTTTGAAG AAGCACAGGA AACCGGGGAA AGTCAAAGGA CCCAGGATAT CCCAAAAAAA GAAGAGATGC AAAAAACGGA ACGTGCGCAA AGAACGAAAG ATGATGCAAT GCCGAAAACG GGTTCTGAAA AGAGTGAAGA TGGCGGCAAA ATAGGGGAAG ATATTGCAAG TGGAAATGAA AATATTGAGG ATAAAAATAA AAACATGATT GAAAGCGTTC AGAGAAAAGA GGACAATAAG AGCGGTGCTG ATACCATTGT GGAAGAAAAC GGTCGTGGCG GGGTTTCTGA CGATGATGAG AGAAAAACAA AAATTGAGGC TGAGAATGAA GTTGAAGATA AGAATGAAAA AGAAAATGAA AATGAAACTG AAAGTAAAGA TGAGTTTGGG AATAAAAGAA AAGCCGGCAG TGAAATAAAT TTTGAAGAAT TGGTGGCAAA ATTTGATAGA TGCTTTGAAA AGTGTAATCC GTTTATGTCC GGCAGAAAAG ATTACAGGTG GTGGAAGATA GCAAGTCCGG TTCATTTGAA CAACATACTG TACCAAATGA AAGTAGATGT ACCCATCCTT TTTAATCCCC TGGTGCTTAT GGCCCACTTT AAATACAGGC ATTTGATTGT GGGTACTTAT GAAGACAAGG CCAGAAATCT TCGTTATATT GTCTGCGGTG TTCCCGGAGT GTATTGGGTT GACGAGAAGC CTTTCGGCAA AATATGCAGG TGGGCCCAGG TTGACGGAAA CGTGCCGAAA TACGGTGCAT TTGGATATTG GCTTGTTTAT ATAAATCCCA ATACAGGCGA GATATTGAAC GTTGGCTAG
|
Protein sequence | MDNRFKYHRS FLIFTHEDSR NGEGREPSGY VKIEIRDGRG KLCCQVSNLR ESNDAVYKLY LINVDEAGLK AACPGVIDLA KGKGELIWNF DPANIDGTGM SVCDINVAAV ILENNNERYI DNILCPLAAY KNGKIEWRQK MKKFLNKQRA QEAEETEERQ DFEEAQETGE SQRTQDIPKK EEMQKTERAQ RTKDDAMPKT GSEKSEDGGK IGEDIASGNE NIEDKNKNMI ESVQRKEDNK SGADTIVEEN GRGGVSDDDE RKTKIEAENE VEDKNEKENE NETESKDEFG NKRKAGSEIN FEELVAKFDR CFEKCNPFMS GRKDYRWWKI ASPVHLNNIL YQMKVDVPIL FNPLVLMAHF KYRHLIVGTY EDKARNLRYI VCGVPGVYWV DEKPFGKICR WAQVDGNVPK YGAFGYWLVY INPNTGEILN VG
|
| |