Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3178 |
Symbol | |
ID | 4809629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3755283 |
End bp | 3756872 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108612 |
Product | hypothetical protein |
Protein accession | YP_001039566 |
Protein GI | 125975656 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACATC AAACACTGTA CCAGGATGAC TTTGTAGAAA TTTTCGAACA AGACAACGAA GTTTTCATCA AAACATTCAA ACCCGGTCTT CCGCCAAAAC AATTAAATGA AATATTGTCG TCACTTCCCC AGATACAAAT AACGAGCTTC AACTGCCTTA ACGACGCCCT AAACATCGTT TCCCCGTCCC CCCAAAAATT CGGCCGGTTA AAGGAAAGAA TCGCCATAAC TGTTACCCCG GACGAATTAA AAGCTTTCGT CACTTTCAAC TTACCCAGGG AAGAACTGGA TATTAAAAAC CGCGAAAATC TAATAAAAGA AACCTATCAA GTTTTGAAGA AAAAAAACAT AAACTTTGGA ATAAAAAAAG AATTGTTCTT CGGTGATTTG GAAAGCGGAA AAAAATATCT CATAGCTGAA GGAGTACCTG CCGTTGACGG GGAAGACTGC AAAATCAGGA TGTACGAACT TGAAGAGCCC AGACCTGAAA TAAGAGAGGA CGGAAAAGCA AACTACTATG AATTGAAGCT GATTAACAAG GTCAAGGCCG GAGACTGGCT CGGTGAGCGC ATTGACGCAA CCGAAGGTTT CCCGGGGAAA ACCGTATATG GCACCACTAT TCATCCTGTA AGAGGAAAAA ATTATCCTCT TTCCTATGAT AAAAATACCG TTTACGAAGT GGTACAAAAC AAGAAAGTTG TCCTATATTC AAAAATAGAC GGTGCCGTAC ACTATGACGG CAATAAAATC ACCGTGTCGA ACCATCTTGA AATTGACGGT GACGTGGACT TTAAAACTGG CAACATAATT TTTGACGGAT ATGTAACCAT TAAAGGTACC GTCACCGACG GATTCTACGT TGAAGCCACA AAAGACATAG AAATAAACAG CCCGTTGGGA CTTGGAAACG TCAAGGGAAT AAAAAGCCGT GAAGGAAGCA TCTACATAAA AGGCGGAATA TCGTCAAAAA GTTTCTCGGA AATTTCTGCC AGGAAAAACA TTTACACAAA GTTTGTGGAC AATGTCAAAT TATCCTGCGG CGGCACCGCC CACATAGGTT TTTATTGCAT AAACAGCATG GTTGAAGCAA AAGAAGTGTT TATTGAATCA ATAAAAGGAA ACATAATGGG AGGTCAGGTA AAAGCGGAAG TAAAAATTAC CGTACCTGTT TTAGGTTCGC CGCTGGAACC AAGAACCGTA CTCATTCTTA CAGGCTTTGA CAGAAAAAAA CTTTCAAAAA TGCTGGAAGA CATAGTTGAC AGAATAGACA GAATAAAAGA GGAACAAATG GACATAAAGC TGTATTTGTC AAAACAAGAC CCCTTTAAAG AAATGACTTC CCGGGAGACA GCGGAATACA ATTCAAAAAT GCAGAGGATG TCCGAACTTA AAGCAGAGCT TAAATCCTTG GAAGAAGAAA AAAAGAATAT TGCAAGATAT TTAAAAACAA AAGGCGAAGG TGAAATAGCC ATAACAAAAA AGGTTTATCC CAACTGTTCA ATAATTATAA AAAATATACA AACCGAAATA AAGGAGCCTT GCCTTGCAAC AACCTTTTTT GCAGCCGACG GCGAAATTAA ACGTGTTTAG
|
Protein sequence | MQHQTLYQDD FVEIFEQDNE VFIKTFKPGL PPKQLNEILS SLPQIQITSF NCLNDALNIV SPSPQKFGRL KERIAITVTP DELKAFVTFN LPREELDIKN RENLIKETYQ VLKKKNINFG IKKELFFGDL ESGKKYLIAE GVPAVDGEDC KIRMYELEEP RPEIREDGKA NYYELKLINK VKAGDWLGER IDATEGFPGK TVYGTTIHPV RGKNYPLSYD KNTVYEVVQN KKVVLYSKID GAVHYDGNKI TVSNHLEIDG DVDFKTGNII FDGYVTIKGT VTDGFYVEAT KDIEINSPLG LGNVKGIKSR EGSIYIKGGI SSKSFSEISA RKNIYTKFVD NVKLSCGGTA HIGFYCINSM VEAKEVFIES IKGNIMGGQV KAEVKITVPV LGSPLEPRTV LILTGFDRKK LSKMLEDIVD RIDRIKEEQM DIKLYLSKQD PFKEMTSRET AEYNSKMQRM SELKAELKSL EEEKKNIARY LKTKGEGEIA ITKKVYPNCS IIIKNIQTEI KEPCLATTFF AADGEIKRV
|
| |