Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0284 |
Symbol | |
ID | 4808567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 350817 |
End bp | 352175 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105696 |
Product | hypothetical protein |
Protein accession | YP_001036716 |
Protein GI | 125972806 |
COG category | [R] General function prediction only |
COG ID | [COG2607] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATTA ATGTTAATTT CAATGAATTA TTGCTTGAGT TGGATTCATT AAGTGTATAC AGGAATTTGC TTAATGACAG TGTAATAAAA AAATTATATG AACTTGGGGA CCTGATTTAC AAAAAAGAGC CCGGGACGGA CGAATTTATA AAAAAATATA ATGAATTCTT CTTTGAGTTG ACATCAGGTT CAATACAGTC ATTGAAGGAG CATATAATAG AGCTGATACT TTTTGATGAG AATCCTTACT CAAGACAGTC AGAAAAATTA CCTTTTGAGA ATATTGATAA AGTATTGATT GACGCAGCCT CAAAAGACTT GGATAGACTC CATGTAATAT CAGCTCTTAC ATCCCGGAAG TTTAAAGATT ATGCTTTAGA GAACATCTGC AAATCCGAGA GCGAAAAAAG GTTTATAGAC AGCCTTCCGG CATGGGAATT TGACGGCAGA AGCAATCAGG GAAAGAGAAA ATTGTCCCAA AACTGCCAAA AGATGATTGA TGTTCTCGTC TCAAGCAACA AATGGGGAGA ATGTGTCAAG AACCTTTCCG ATTTTTATTT TACCAACGGT TCGGGGATTT TTGCCCGCTA TCATGCTTTT GTTTGGGAAC CGTCAGAAAA ACCTTCTCTC AGAGGGGTTG AATTCCCCGA TCCGATTCGC TTGTCCGATT TTATCGGATA TGAACAGCAG CGCCTTGAAG TAATAGAAAA CACAGAAAAG TTTGTCAGGG GATTGCCGGC AAACAATGTT TTGCTTTATG GAGACAGGGG TACAGGAAAA TCTTCAACTG TAAAAGCCAT TGCAAATGAA TACAGGGAAC AGGGACTTAG AATCATTGAG ATACCGAGAA AATATCTTGT GGATTTTCCT GCGGTGTTAA AACTGATTAA GGGAAGAAGC TGTAAATTTA TTATTTTTAT TGACGATCTG GCTTTTGAGG ACAGTGAAGA AAGTTACACA GTTCTAAAGT CCGTACTTGA AGGAGGAGTG GAAAACAGAC CGGACAATGT GTTAATCTAT GCTACTTCCA ACAGGCGACA TCTTATAAAG GAGAAATTCA GTGACCGGGC AGGCTTTAAA TCGGACGATC CCGATGATGA AATCAGGGCA CAGGACACAA TGCAGGAAAA GCTGTCTCTT TCAGAAAGGT TTGGAATTAC CGTTGTTTTT TCATCACCGG ACAAAAAACA GTTTTTACAG ATTGTTGAAG GCCTTGTGGC TAAAAGAGGG ATTAACATTG ATAAAGACAC GCTCCAAAGA GAAGCTATGA AGTGGGAGCT TATGTACAAC GGACGCTCTG CAAGGACTGC AAGGCAATTT GTTGACTGGC TGGAAGGCAG TATGAAATTG AAAAAGTAA
|
Protein sequence | MLINVNFNEL LLELDSLSVY RNLLNDSVIK KLYELGDLIY KKEPGTDEFI KKYNEFFFEL TSGSIQSLKE HIIELILFDE NPYSRQSEKL PFENIDKVLI DAASKDLDRL HVISALTSRK FKDYALENIC KSESEKRFID SLPAWEFDGR SNQGKRKLSQ NCQKMIDVLV SSNKWGECVK NLSDFYFTNG SGIFARYHAF VWEPSEKPSL RGVEFPDPIR LSDFIGYEQQ RLEVIENTEK FVRGLPANNV LLYGDRGTGK SSTVKAIANE YREQGLRIIE IPRKYLVDFP AVLKLIKGRS CKFIIFIDDL AFEDSEESYT VLKSVLEGGV ENRPDNVLIY ATSNRRHLIK EKFSDRAGFK SDDPDDEIRA QDTMQEKLSL SERFGITVVF SSPDKKQFLQ IVEGLVAKRG INIDKDTLQR EAMKWELMYN GRSARTARQF VDWLEGSMKL KK
|
| |