Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0116 |
Symbol | |
ID | 4808742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 149095 |
End bp | 150039 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105527 |
Product | hypothetical protein |
Protein accession | YP_001036550 |
Protein GI | 125972640 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.061515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCGTTTT CATCGGCAGT TAAAAATGAG TTGTGCAGAG TTGAAACAGA TCGCAGCAGC AGTTTGTTTG AACTTGCGGC TGCTGCAAGG ATAAGCGGGC TTATAAAGGT TGTTAACGCC AATGAGATTA ATCTGAGGCT TGTAACTGAA AATGCAGCAT TTGCCCGCAG GATATTTTCA CAGATCAAGG ATTTGTACGG CATAAATGCT GAAATAAGCA TAAGAAGAAG CCCAAAGCTG AAAAAAAATA TTGTCTATAT CATTGTGCTG ACTTCATCGA AAGGTTTGAT GAGAATACTT GAGGATATTA ATATTAAAGT ATCCGAAAAA GTGGAATACA TACCGTATGC CCGGTGTATG AGGAAGAAGG AATACAGAAA GGCGTATCTT AGGGGTGCGT TTTTAGCTTC CGGCTCAATG AGCGATCCTG AAAAAACATA TCATCTGGAG ATAACATGCC ACAATATGGC TTTGGCGACT GAAGTCAGTG ATTTGATGGA CAGTTTTAAA TTACATTCGA AAGTTACAAA AAGAAAAGGA AACTATGTGG TTTATCTTAA AGAAGGGGAG AACATAGTTG ACTTTTTGAA TATAATCGGA GCACATTCTG CGCTTTTGGA GCTTGAAAAC ATCAGGATTT TAAAGGAAAT GCGCAACAAC GTCAACAGAA TAGTCAATTG TGAGACGGCA AATTTGCAAA AGACTGTGGA TGCTTCGGTA AGGCAGGTGG AAAATATCAA CTATATAAAG GAGCATCTTG GATTTGACAA ATTGCCGGAA AATCTCAGAG AAATAGCAAA AATGAGGCTT GCCTACAGTG ATGCAACTTT AAAGGAACTA GGAGAAATGT TAAATCCTCC TCTTGGAAAA TCCGGAGTTA ATCATAGATT GAGAAAGTTG GACAAGATTG CTGAGAGCTT AAGAAACATG AAAGGAGAGT TATGA
|
Protein sequence | MSFSSAVKNE LCRVETDRSS SLFELAAAAR ISGLIKVVNA NEINLRLVTE NAAFARRIFS QIKDLYGINA EISIRRSPKL KKNIVYIIVL TSSKGLMRIL EDINIKVSEK VEYIPYARCM RKKEYRKAYL RGAFLASGSM SDPEKTYHLE ITCHNMALAT EVSDLMDSFK LHSKVTKRKG NYVVYLKEGE NIVDFLNIIG AHSALLELEN IRILKEMRNN VNRIVNCETA NLQKTVDASV RQVENINYIK EHLGFDKLPE NLREIAKMRL AYSDATLKEL GEMLNPPLGK SGVNHRLRKL DKIAESLRNM KGEL
|
| |