Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0353 |
Symbol | |
ID | 4808502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 443190 |
End bp | 444305 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105767 |
Product | hypothetical protein |
Protein accession | YP_001036784 |
Protein GI | 125972874 |
COG category | [S] Function unknown |
COG ID | [COG3581] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA CGTTTCCGCA TATGGGTAAC ACATACATTG CAGTAAAGGC TCTTTTGGAC GATATCGGAG CAGAGTATGT TATTCCTCCT TTAAACAGCA AACGTTCTTT GGAACTGGGA ACAAAGTATG CTCCGGAAAT GGCATGCCTG CCCCTTAAAA TCAATATTGG AAATTATATT GAGGCTTACG AAAAAGGAGC GGATACAATT CTGACTGCCG GCGGACGGGG GCCTTGCAGA TTTGGGTACT ATTGTGAAAT GTGCAGGGAA ATACTTAATG ACAACGGATA TACCATGGAT GTGATAGTTT TGGAACCTCC GGATGTGGAA ATTATGAAAT TTATCGGCAA AATCAGAAAG CTTGCAGGCG GTCTTAATAT ATACCATATT TTAAAGGTAA TAAAAAACAC AACTATTGTG GCCCAAAGGG TGGATGAACT TGAACGTTTG GCATTTAAAA TCCGCCCCAG AGAGCTTAGG AAGGGAAGTA CTGACAGAAT TTATGAAGGC TTTCGCAGAA AAGTAATTGA CGTAAAAGGT TCGCAGGAAA TTCTAAAGCT TGTGGAGGAT ACCAAGAATC AGCTTTTGAA ACTCGAAATA GACAAAGATG CAAGGCCGCT TAAGATCGCT ATTGTCGGGG AAATTTACAC GACAATAGAC TCCTATACCA GTTTTAATAT TGATTCGATA TTGGGCGGCA TGGGTGTTGA GGTACACAGG GCAAACACCA TCAGCGGCTG GATAATCGAG CATATACTGA AAGCAGTTGT TCCGTTTACA AAAGACAAAA GATATGCTGA GGCTGCAAAG CCTTATCTGG GCACCATGAT AGGAGGGCAT GCCCAGGAAA CTATTGGGAA TACGGTGCTT TATGCCAAAG ACGGCTTTGA CGGGATAATA CAGATTTATC CTTTAACCTG TATGCCGGAA ATAGTGGCAG AGAGTATACT TCCTGCTGTG GAAAGAGATT ATAACATACC CATACTTACC TTGATTATTG ATGAGATGAC GGGAGAAGCC GGGTATATGA CAAGGATAGA AGCTTTTGTG GATTTACTGA GAAAAAGAAG GGAGAGAGGA GAAATTGCTG AAAACCTCGT ATTATCTGGG AATTGA
|
Protein sequence | MKITFPHMGN TYIAVKALLD DIGAEYVIPP LNSKRSLELG TKYAPEMACL PLKINIGNYI EAYEKGADTI LTAGGRGPCR FGYYCEMCRE ILNDNGYTMD VIVLEPPDVE IMKFIGKIRK LAGGLNIYHI LKVIKNTTIV AQRVDELERL AFKIRPRELR KGSTDRIYEG FRRKVIDVKG SQEILKLVED TKNQLLKLEI DKDARPLKIA IVGEIYTTID SYTSFNIDSI LGGMGVEVHR ANTISGWIIE HILKAVVPFT KDKRYAEAAK PYLGTMIGGH AQETIGNTVL YAKDGFDGII QIYPLTCMPE IVAESILPAV ERDYNIPILT LIIDEMTGEA GYMTRIEAFV DLLRKRRERG EIAENLVLSG N
|
| |