Gene Cthe_0793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0793 
Symbol 
ID4810411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp956900 
End bp958168 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content37% 
IMG OID640106210 
Producthypothetical protein 
Protein accessionYP_001037221 
Protein GI125973311 
COG category 
COG ID 
TIGRFAM ID[TIGR02828] putative membrane fusion protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000606718 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAG AGAACAAAAG AAAAATCAAT GGAAAGGTAA AGCTGGGAGG TCTTTTGATT 
GCCCTGTTTT TGCTACTGTA TATTCCATCT TTTATATTTT GGATTTACGG CAAAAATATC
CACACGGATA TAATAAGAAT GGGGGAATTG GAAGACTATG TGACCACTGA TGCCTACATT
GTAAGAGACG AGACAGTAAT CAACTCTCCT TCCGACGGAA TCAGCATAAG GAATGTGGAA
GAAGGAGAAA AAGTGGGAGT GGGAGATACT ATTGCCACAG TATTAAACAA ATCTTCGGAG
AAACTTCTGG AAGATTTGAA GACTCTTGAC CTAAGAATAA TTGAGGCAAA GAGGGAGAAA
ACCAAAAACG ACAATTTTTT TTCCGAGGAT ATAAAAAAGC TTGACCAGGA AATACAGGAA
AAGCTGGTGC TTGTGATAAA GAAGAGCAAT AAAAACAGCA TTTCGGAGGT TAAGCAAATA
AAAAACGAAA TTGATGAACT TATTAAAAAG AAGGCTACCA TTTCAGGAGA CTTGAGCTAT
ACGGACGCCA ACATAAAAGC TCTTGAAAAT GAAAAAAGGA TACTTCAGGA CAGTATAAAC
GCAAACAAAC GAAATATTGT TTCAAATTTA TCAGGAATAG TATCTTATGT GATTGACGGA
TATGAAGAAA TTCTCAATCC TGAAAAAATA CCGGAAATTA CTCCGGAAAT GCTTGGTATG
ATAAAAGTCG TGGAAAACAG AAAAAAAACG GATGACTTGA GTACGCAGTA CAACAAACCT
TTTGTCAAGG TGATTGGCGG CATAGACTAT TATATAGTTT TTGTCCTGGA CAGGGAAAAA
GCCGATGATT TTAAAGTGGA TAATTATTTA AGAGTCCGTA TTAATGATAT TGGCAGAGTT
GTTGACGGGA CGATTGCGTA CAAATCCAAT GAAATGGACG GAAAATTTGT GATTGCGGTG
CGGACGGACA AGGCTTTGAG TGATACCGCA GGTTTGAGGG TAATAAATGT CGATCTTATC
AAGAGCCGTT ATGAAGGGCT GATTGTTCCA GTTAAAAGCC TTGTCAATAT TGATATGAAT
ACGATGAGGG CGGAAATTGC ATTGGTTAAG GCAAGAAGGG CAACTTTTGT TCCTGTCAAA
ATTGTCGGAA AAAATGACAA TTTTGCTGTG ATAGATAATG TTGAAGATTA CAAAGATGGA
GGAGTCAGCT TGTATTCAAG CTATATTATA AATCCAAAAA ACATAGAGGA AGGACAAGTC
ATAAATTAA
 
Protein sequence
MPEENKRKIN GKVKLGGLLI ALFLLLYIPS FIFWIYGKNI HTDIIRMGEL EDYVTTDAYI 
VRDETVINSP SDGISIRNVE EGEKVGVGDT IATVLNKSSE KLLEDLKTLD LRIIEAKREK
TKNDNFFSED IKKLDQEIQE KLVLVIKKSN KNSISEVKQI KNEIDELIKK KATISGDLSY
TDANIKALEN EKRILQDSIN ANKRNIVSNL SGIVSYVIDG YEEILNPEKI PEITPEMLGM
IKVVENRKKT DDLSTQYNKP FVKVIGGIDY YIVFVLDREK ADDFKVDNYL RVRINDIGRV
VDGTIAYKSN EMDGKFVIAV RTDKALSDTA GLRVINVDLI KSRYEGLIVP VKSLVNIDMN
TMRAEIALVK ARRATFVPVK IVGKNDNFAV IDNVEDYKDG GVSLYSSYII NPKNIEEGQV
IN