Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0858 |
Symbol | |
ID | 4810476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1033379 |
End bp | 1034368 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106274 |
Product | hypothetical protein |
Protein accession | YP_001037285 |
Protein GI | 125973375 |
COG category | [S] Function unknown |
COG ID | [COG4864] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGTA TAGCTTTCAT TCTTATTGTA GGAGCAATAC TAGTATTTAT CTCACTGTTT TTTGCAATTG TTCCTGTTGG TCTTTGGATT TCGGCATTTG CCGCAAACGT CAGGGTCAGC ATATTCACCC TAATAGGAAT GCGCCTTAGA AGAGTCGTTC CTTCAAGAGT TATAAATCCG CTTATTAAAG CTACAAAAGC AGGTATCAAT GTGTCCATCA ACAAACTTGA AGCCCATTAT CTTGCAGGCG GTAATGTGGA CAGGGTTGTA AATGCTCTTA TTGCCGCTCA AAGAGCCAAT ATCCCTCTTG AATTTGAAAG AGCCGCTGCT ATCGACCTTG CCGGAAGAAA TGTTCTTGAA GCCGTTCAAA TGAGTGTTAA CCCAAAAGTT ATTGAAACAC CAGTCGTGGC AGCAATAGCA AAAGACGGTA TTGAACTTCG GGCAAAAGCA AGAGTCACTG TCAGAGCAAA TATTGACCGT CTCGTCGGAG GTGCCGGTGA ACAAACCATC ATTGCCCGTG TCGGCGAAGG TGTTGTTACG ACAGTTGGTT CCGCTACGGA CCACAAACAG GTTCTCGAAA ATCCTGATGC CATATCCAAA ACAGTGTTAA GCAAAGGTCT TGATGCAGGT ACCGCTTTTG AAATTCTTTC CATTGATATT GCAGACATTG ACGTGGGAAG AAATGTCGGT GCCCAGCTGC AGACAGACCA GGCGGAAGCG GATAAACGTA TTGCCCAGGC AAAAGCTGAA GAAAGAAGAG CTATGGCTGT TGCAAGGGAA CAGGAAATGA AAGCCATGGT ACAGGAAATG AGAGCAAAAG TTGTGGAAGC CGAAGCGGAA GTGCCAAAAG CACTGGCTGC TGCTCTTCGC GAAGGAAAAA TCGGTGTCCT TGACTACTAT CATTTACAAA ATCTCATAGC AGACACCCAA ATGAGAGACA GTATTTCAAA AATGAGCAAA CATGATGATT CTTCATCTGA CAAAAAATAG
|
Protein sequence | MDGIAFILIV GAILVFISLF FAIVPVGLWI SAFAANVRVS IFTLIGMRLR RVVPSRVINP LIKATKAGIN VSINKLEAHY LAGGNVDRVV NALIAAQRAN IPLEFERAAA IDLAGRNVLE AVQMSVNPKV IETPVVAAIA KDGIELRAKA RVTVRANIDR LVGGAGEQTI IARVGEGVVT TVGSATDHKQ VLENPDAISK TVLSKGLDAG TAFEILSIDI ADIDVGRNVG AQLQTDQAEA DKRIAQAKAE ERRAMAVARE QEMKAMVQEM RAKVVEAEAE VPKALAAALR EGKIGVLDYY HLQNLIADTQ MRDSISKMSK HDDSSSDKK
|
| |