Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0793 |
Symbol | |
ID | 4810411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 956900 |
End bp | 958168 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106210 |
Product | hypothetical protein |
Protein accession | YP_001037221 |
Protein GI | 125973311 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02828] putative membrane fusion protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000606718 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAAG AGAACAAAAG AAAAATCAAT GGAAAGGTAA AGCTGGGAGG TCTTTTGATT GCCCTGTTTT TGCTACTGTA TATTCCATCT TTTATATTTT GGATTTACGG CAAAAATATC CACACGGATA TAATAAGAAT GGGGGAATTG GAAGACTATG TGACCACTGA TGCCTACATT GTAAGAGACG AGACAGTAAT CAACTCTCCT TCCGACGGAA TCAGCATAAG GAATGTGGAA GAAGGAGAAA AAGTGGGAGT GGGAGATACT ATTGCCACAG TATTAAACAA ATCTTCGGAG AAACTTCTGG AAGATTTGAA GACTCTTGAC CTAAGAATAA TTGAGGCAAA GAGGGAGAAA ACCAAAAACG ACAATTTTTT TTCCGAGGAT ATAAAAAAGC TTGACCAGGA AATACAGGAA AAGCTGGTGC TTGTGATAAA GAAGAGCAAT AAAAACAGCA TTTCGGAGGT TAAGCAAATA AAAAACGAAA TTGATGAACT TATTAAAAAG AAGGCTACCA TTTCAGGAGA CTTGAGCTAT ACGGACGCCA ACATAAAAGC TCTTGAAAAT GAAAAAAGGA TACTTCAGGA CAGTATAAAC GCAAACAAAC GAAATATTGT TTCAAATTTA TCAGGAATAG TATCTTATGT GATTGACGGA TATGAAGAAA TTCTCAATCC TGAAAAAATA CCGGAAATTA CTCCGGAAAT GCTTGGTATG ATAAAAGTCG TGGAAAACAG AAAAAAAACG GATGACTTGA GTACGCAGTA CAACAAACCT TTTGTCAAGG TGATTGGCGG CATAGACTAT TATATAGTTT TTGTCCTGGA CAGGGAAAAA GCCGATGATT TTAAAGTGGA TAATTATTTA AGAGTCCGTA TTAATGATAT TGGCAGAGTT GTTGACGGGA CGATTGCGTA CAAATCCAAT GAAATGGACG GAAAATTTGT GATTGCGGTG CGGACGGACA AGGCTTTGAG TGATACCGCA GGTTTGAGGG TAATAAATGT CGATCTTATC AAGAGCCGTT ATGAAGGGCT GATTGTTCCA GTTAAAAGCC TTGTCAATAT TGATATGAAT ACGATGAGGG CGGAAATTGC ATTGGTTAAG GCAAGAAGGG CAACTTTTGT TCCTGTCAAA ATTGTCGGAA AAAATGACAA TTTTGCTGTG ATAGATAATG TTGAAGATTA CAAAGATGGA GGAGTCAGCT TGTATTCAAG CTATATTATA AATCCAAAAA ACATAGAGGA AGGACAAGTC ATAAATTAA
|
Protein sequence | MPEENKRKIN GKVKLGGLLI ALFLLLYIPS FIFWIYGKNI HTDIIRMGEL EDYVTTDAYI VRDETVINSP SDGISIRNVE EGEKVGVGDT IATVLNKSSE KLLEDLKTLD LRIIEAKREK TKNDNFFSED IKKLDQEIQE KLVLVIKKSN KNSISEVKQI KNEIDELIKK KATISGDLSY TDANIKALEN EKRILQDSIN ANKRNIVSNL SGIVSYVIDG YEEILNPEKI PEITPEMLGM IKVVENRKKT DDLSTQYNKP FVKVIGGIDY YIVFVLDREK ADDFKVDNYL RVRINDIGRV VDGTIAYKSN EMDGKFVIAV RTDKALSDTA GLRVINVDLI KSRYEGLIVP VKSLVNIDMN TMRAEIALVK ARRATFVPVK IVGKNDNFAV IDNVEDYKDG GVSLYSSYII NPKNIEEGQV IN
|
| |