Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1207 |
Symbol | |
ID | 4809899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1440596 |
End bp | 1442119 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106630 |
Product | membrane protein-like protein |
Protein accession | YP_001037632 |
Protein GI | 125973722 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.347408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTCA AGATTCCCAA ATTGGACTAC AAATCTATTA TCATCTGGTC CGTTATTCTA ATCGCAGGTT TTTGGTATCT AACGAGAGGA ATCAGTCATG AAACTCTCTG GTACGATGAA TCTTATTCGG CCGCCATTAT CAACCATTCA ATACCTGATA TAATCAGAAT TACTGCAAAC GACAGTCACC CGCCTTTATA TTTTATAATG CTCAAAATTT TCAGTTCTGT GTTTGGACGC ACTGAATCGG CACTAAGACT TTTTTCGGTA TTGGGATTGC TGGCTTTGGC AACTCTTGGC GCAGGTCCGG TAAGACGGGT GTTTGGCAAA TTCATGGGTA TGATGTATTC ATTCTGTGTA ATTGCCGTCC CCATAAGCCT GTCAATAGGC CAGGAAACAA GGATGTACAC ATGGGCGGCT TTTTTTGTCA CAGCCGGCGC TTTGTACGGA TATCTTGCAT TGCAGGAAAA CAAAAGGTCC GATTGGATTA AGTTTGGACT GGCCACCCTG GCATCCGCTT TTACTCATTA TTACGCACTG CTTGCCGTTA CTGTTTTAAA TGTACTGCTC TTTGTCTGGC TGCTTTTAAG ATTCATAACC ACCAAAGATA AAAAGAAGTT TACAAGTTAC TTAATTACTG CAGGAGCTGT AGTACTGTGC TATTTTCCGT GGATTTTTAT ATTATTCGGT CAAGCAAAAA AGGTTTCCAA ATCTTTCTGG ATACCGCCCG TCACGAAAGA TGTCATCTGG TATACTTTAC AGTATCCCTT TGCAGCCAAG TTCTGGACAT TTAGATTCTC AAGAGTATGT TTTATCTGTG CAGTCGTACT CATACTATGG GGATTGATAT TCTCGGTAAT AAAACGCAAA AAGCAAGGGT TAATGTCTTT GCTTGCAGTG TTGATATATA CTTTGACCCT GGTAAGTGCA ATTGTTCTTT CCAATGTAAT AAGGCCTCTT TTGGTGGAAA GATACATATT CCCGGTTGTC GGACTGTTTG TTCTGGCCTT TGCATACGGA ATATCCATGC TCAACAGCAA AGGTGCTTCA ATATTCGTTT GCGTGGCATT ATTGGCCGTT TCAATTTCGC AAAACCAGTT GATTATCGAG AAAAGATTTA ACGGTCCGAT GAAAGAAGTA TGCAGTTATA TCAATAGCCA GAATATAACT CCGGAAGACG TTTTTATTCA CACAGATGAA CATACATTTG GAACATTTTG TTATTATTAT CCCAACAACA AGCATTATCT TTACCTTCCT CCCGACTTCG ACGGATACAG CGGATATGAT GCTTTTTCAC CTGCCGGCTC CTACGGTTCG GACATCAAAG AATTTATAGA CGGCCGGGAA AAAATATGGT TTGTGGAACG TGAAGGATCC GACATGAGTA AACAAGGGTC CAAATTGCTC GATAAACATA TTTTGCAAAG CAGAGGCATG ATTCTTAAGT TCAACCTTTA CCCGCATTCT TTCTACGCTG TTACTTTAAG GAGGGTTGTT CCCGGAAATG CATTAAAAGA TTAA
|
Protein sequence | MKFKIPKLDY KSIIIWSVIL IAGFWYLTRG ISHETLWYDE SYSAAIINHS IPDIIRITAN DSHPPLYFIM LKIFSSVFGR TESALRLFSV LGLLALATLG AGPVRRVFGK FMGMMYSFCV IAVPISLSIG QETRMYTWAA FFVTAGALYG YLALQENKRS DWIKFGLATL ASAFTHYYAL LAVTVLNVLL FVWLLLRFIT TKDKKKFTSY LITAGAVVLC YFPWIFILFG QAKKVSKSFW IPPVTKDVIW YTLQYPFAAK FWTFRFSRVC FICAVVLILW GLIFSVIKRK KQGLMSLLAV LIYTLTLVSA IVLSNVIRPL LVERYIFPVV GLFVLAFAYG ISMLNSKGAS IFVCVALLAV SISQNQLIIE KRFNGPMKEV CSYINSQNIT PEDVFIHTDE HTFGTFCYYY PNNKHYLYLP PDFDGYSGYD AFSPAGSYGS DIKEFIDGRE KIWFVEREGS DMSKQGSKLL DKHILQSRGM ILKFNLYPHS FYAVTLRRVV PGNALKD
|
| |