Gene Cthe_1207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1207 
Symbol 
ID4809899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1440596 
End bp1442119 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content40% 
IMG OID640106630 
Productmembrane protein-like protein 
Protein accessionYP_001037632 
Protein GI125973722 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.347408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCA AGATTCCCAA ATTGGACTAC AAATCTATTA TCATCTGGTC CGTTATTCTA 
ATCGCAGGTT TTTGGTATCT AACGAGAGGA ATCAGTCATG AAACTCTCTG GTACGATGAA
TCTTATTCGG CCGCCATTAT CAACCATTCA ATACCTGATA TAATCAGAAT TACTGCAAAC
GACAGTCACC CGCCTTTATA TTTTATAATG CTCAAAATTT TCAGTTCTGT GTTTGGACGC
ACTGAATCGG CACTAAGACT TTTTTCGGTA TTGGGATTGC TGGCTTTGGC AACTCTTGGC
GCAGGTCCGG TAAGACGGGT GTTTGGCAAA TTCATGGGTA TGATGTATTC ATTCTGTGTA
ATTGCCGTCC CCATAAGCCT GTCAATAGGC CAGGAAACAA GGATGTACAC ATGGGCGGCT
TTTTTTGTCA CAGCCGGCGC TTTGTACGGA TATCTTGCAT TGCAGGAAAA CAAAAGGTCC
GATTGGATTA AGTTTGGACT GGCCACCCTG GCATCCGCTT TTACTCATTA TTACGCACTG
CTTGCCGTTA CTGTTTTAAA TGTACTGCTC TTTGTCTGGC TGCTTTTAAG ATTCATAACC
ACCAAAGATA AAAAGAAGTT TACAAGTTAC TTAATTACTG CAGGAGCTGT AGTACTGTGC
TATTTTCCGT GGATTTTTAT ATTATTCGGT CAAGCAAAAA AGGTTTCCAA ATCTTTCTGG
ATACCGCCCG TCACGAAAGA TGTCATCTGG TATACTTTAC AGTATCCCTT TGCAGCCAAG
TTCTGGACAT TTAGATTCTC AAGAGTATGT TTTATCTGTG CAGTCGTACT CATACTATGG
GGATTGATAT TCTCGGTAAT AAAACGCAAA AAGCAAGGGT TAATGTCTTT GCTTGCAGTG
TTGATATATA CTTTGACCCT GGTAAGTGCA ATTGTTCTTT CCAATGTAAT AAGGCCTCTT
TTGGTGGAAA GATACATATT CCCGGTTGTC GGACTGTTTG TTCTGGCCTT TGCATACGGA
ATATCCATGC TCAACAGCAA AGGTGCTTCA ATATTCGTTT GCGTGGCATT ATTGGCCGTT
TCAATTTCGC AAAACCAGTT GATTATCGAG AAAAGATTTA ACGGTCCGAT GAAAGAAGTA
TGCAGTTATA TCAATAGCCA GAATATAACT CCGGAAGACG TTTTTATTCA CACAGATGAA
CATACATTTG GAACATTTTG TTATTATTAT CCCAACAACA AGCATTATCT TTACCTTCCT
CCCGACTTCG ACGGATACAG CGGATATGAT GCTTTTTCAC CTGCCGGCTC CTACGGTTCG
GACATCAAAG AATTTATAGA CGGCCGGGAA AAAATATGGT TTGTGGAACG TGAAGGATCC
GACATGAGTA AACAAGGGTC CAAATTGCTC GATAAACATA TTTTGCAAAG CAGAGGCATG
ATTCTTAAGT TCAACCTTTA CCCGCATTCT TTCTACGCTG TTACTTTAAG GAGGGTTGTT
CCCGGAAATG CATTAAAAGA TTAA
 
Protein sequence
MKFKIPKLDY KSIIIWSVIL IAGFWYLTRG ISHETLWYDE SYSAAIINHS IPDIIRITAN 
DSHPPLYFIM LKIFSSVFGR TESALRLFSV LGLLALATLG AGPVRRVFGK FMGMMYSFCV
IAVPISLSIG QETRMYTWAA FFVTAGALYG YLALQENKRS DWIKFGLATL ASAFTHYYAL
LAVTVLNVLL FVWLLLRFIT TKDKKKFTSY LITAGAVVLC YFPWIFILFG QAKKVSKSFW
IPPVTKDVIW YTLQYPFAAK FWTFRFSRVC FICAVVLILW GLIFSVIKRK KQGLMSLLAV
LIYTLTLVSA IVLSNVIRPL LVERYIFPVV GLFVLAFAYG ISMLNSKGAS IFVCVALLAV
SISQNQLIIE KRFNGPMKEV CSYINSQNIT PEDVFIHTDE HTFGTFCYYY PNNKHYLYLP
PDFDGYSGYD AFSPAGSYGS DIKEFIDGRE KIWFVEREGS DMSKQGSKLL DKHILQSRGM
ILKFNLYPHS FYAVTLRRVV PGNALKD