Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0643 |
Symbol | |
ID | 4808172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 794108 |
End bp | 795181 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106057 |
Product | hypothetical protein |
Protein accession | YP_001037071 |
Protein GI | 125973161 |
COG category | [S] Function unknown |
COG ID | [COG3949] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATA GCAGAGTATC CACCTTTAAG GTTGCTGCGA CATATATAGG AACTGTGGTG GGGGCGGGAT TTGCATCGGG ACAGGAAATG CTTCAGTTTT TTGCGGTTTT TGGTTTGAAG GGTTTTTGGG GATTAATTGT TGTTACGGCT TTGTTTATTG TATTTGGCTA TATTGTTATG GAAGTCGGAA GAAATCTTCA GTCAGAATCC CACCTTCAGA TAATCAAACA TTCCGGCGGA AAGTTTTTAG GCCCGTTAAT GGACTGGTTA ATAACTTTTT TCCTGTTTGG TGCCCTTACT GCGATGATTG CGGGTTCCGG TGCTTTGGTA AAACAGCAGT TTGGAATAAA TCCTTTTTGG GGAAATTTGT TTATGGCCGC TTTTACGGCA TTGACCGTGC TTACGGGAAT CAATGGGGTA ATAAACTCTA TCAGTGTTGT GGTTCCTTTT TTGGTTACTT CAGCGGTGGG TATCAGCCTG GCTTCAATTT TGTTAATGCC GCTGCGGACA AATCCTTATG AACTTGTTGC GGCAGCCGAG TCTGTGTCAC GAAACGGATT AATTGGCAAC TGGCTGTGGG CGGCCGTTCT TTACGTTTCA TACAACTTAG TGCTGGCCAT TTCCGTATTG GGGCCTTTGG GAGTGCAGGC AAAAAACAAA AACGCGATTA GAAACGGGGC AATACTTGGA GGAATTGGGC TTGGGGTCTC GGCAACAGCA ATTTATTTTG CCATGGAAAG AAATTTTGAA ATTGTCCGCG ACATGGAAAT ACCCATGATT TATCTTGCGG GCAGCCTGTC TTATATATTG CAGATTGTAT ATGCATTGGT GCTTATCGGT GAAATATATA CTACGGCTGT TGGAAGCTTG TACGGATTTT CGGCCAGAAT CACCGACATA AATGAAGGTA AAGCAAAGTT TTATATAATC GGAGCGACCC TTTTGGCGCT GGCCGCAAGT CAACTGGGAT TTTCAAATAT GGTAAAATAC CTTTATCCCG TTGTGGGTTA TGGAGGAGTT GTACTTCTTG CATGCCTTGC CGTCACAAGA TTTAAGCTTG CAAAGAGAGC TTAG
|
Protein sequence | MENSRVSTFK VAATYIGTVV GAGFASGQEM LQFFAVFGLK GFWGLIVVTA LFIVFGYIVM EVGRNLQSES HLQIIKHSGG KFLGPLMDWL ITFFLFGALT AMIAGSGALV KQQFGINPFW GNLFMAAFTA LTVLTGINGV INSISVVVPF LVTSAVGISL ASILLMPLRT NPYELVAAAE SVSRNGLIGN WLWAAVLYVS YNLVLAISVL GPLGVQAKNK NAIRNGAILG GIGLGVSATA IYFAMERNFE IVRDMEIPMI YLAGSLSYIL QIVYALVLIG EIYTTAVGSL YGFSARITDI NEGKAKFYII GATLLALAAS QLGFSNMVKY LYPVVGYGGV VLLACLAVTR FKLAKRA
|
| |