Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1789 |
Symbol | |
ID | 4810034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2111364 |
End bp | 2113820 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107203 |
Product | ATPase AAA-2 |
Protein accession | YP_001038203 |
Protein GI | 125974293 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.607583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGGAC GTTTTACCGA AAAAGCACAA AAAGCAATAA ATATTTCTCA AAACATGGCA ATAGAATTGG GACATAATTA TGTCGGGACG GAGCACCTTC TTCTGGGATT GGTAAAAGAA GGAAGTGGAG TCGCCGCGCG GGTTTTGCAA AGCCAGGGCG TTACTGAAGA GAAAGTTATA AGGGAAATTG AAGAACTCAT CGGACGCGGC GAAATGATGG GCCAGCCTTT GGATTTCACT CCGAGAACCA AAAGGGTTCT GGAGCTTAGC TATAGAGAGG CCCGCAGAAT GGGTCACAAC TATATCGGAA CGGAGCACCT TCTTTTAGGA ATAATGAAAG AAGGGGAAAG CGTTGCCGTA AGAATTTTGA AGGACTTGGG AGTCGAGCAT CAAAAGCTTG TTCAGGAAAT AATGAACATG CTCAGTGAAG AAGCACCCAA TTCCACCGGT GTGCCAAAAG GCCATTCATC ATATTCGAAC ACACCGACCC TGAACCAGTT CGGAAGAGAC TTGACGGAGA TGGCAAGGGA GGCAAAATTT GACCCGGTAA TAGGCCGTGA CAAGGAAATT GAAAGAGTTA TACAGATTTT GAGCAGAAGA ACCAAAAACA ACCCTTGTCT GATTGGTGAG CCCGGAGTCG GTAAGACTGC GATTGCAGAA GGGCTTGCCC AGAAGATAGT TGAGGGAAAT ATTCCGGAAA TATTGAAGGA CAAAAGAGTT GTGACTTTGG ACTTGTCCTC AATGGTTGCC GGTGCAAAAT ACAGAGGTGA ATTTGAAGAA AGGCTTAAAA AAGCTCTTGA TGAAATCAGA AGGGCGGGCA ATGTAATACT GTTCATAGAT GAAATGCATA CTATCATCGG AGCCGGAGCC GCTGAAGGTG CAATCGACGC TTCCAATATA TTAAAGCCTT CTTTGGCGCG GGGCGAAATT CAGGTTATCG GTGCCACCAC CATTGATGAG TACAGGAAGC ACATTGAAAA GGACGCGGCT TTGGAGAGAA GATTCCAGCC GATACTTGTC GGAGAGCCGA CAAAAGAAGA AGCAATAGAA ATATTGAGGG GTCTGAGAGA CAAGTATGAA GCACATCATA GTGTGAAAAT CACTGATGAG GCTTTGGTGG CAGCCGTAAA CATGTCGGAC AGATACATTA CGGACCGGTT CCTGCCCGAT AAAGCCATCG ACCTTATTGA TGAGGCGGCA TCAAGGGTAA GGCTGAAATC CTTTACCGCA CCTCCTGATC TGAAACATCT CGAGGAAAAA GTGGAAAGAC TCAGAAAAGA AAAAGAAGAT GCTATAGTAT GCCAGGAATT TGAAAAGGCC GCCCGTATAA GAGATGAGGA GCAGAGGCTG AAAAATGAGC TGGAAAAGGC AAAGGACAGT TGGCGGCAGA AAAATCAGAC CACAACAAAC GTAGTCAGTG AAGATGATAT TGCAGTAATA GTGTCCGACT GGACGGGTAT TCCGGTAAAG AGACTTGCCG AGGAAGAATC GGAAAGACTT ATGAAGATGG AAGACATACT TCACAAAAGG GTAATAGGGC AGGATGAAGC GGTAAAGGCA ATATCCAAAG CCATCAGAAG AGGAAGAGTG GGTCTTAAAG ACCCGAAGAG GCCGGTGGGT TCATTCATTT TCCTGGGTCC TACAGGCGTT GGAAAAACTG AACTTAGCAA GGCTTTGGCA GAAGCTCTCT TTGGTGAGGA AAACGCAATG ATTAGAATAG ACATGTCGGA GTACATGGAA AAGCACAGCG TTTCAAGACT TGTAGGTTCA CCGCCGGGAT ATGTAGGTTA TGAAGAAGGA GGACAGCTTA CCGAAAAAGT AAGAAGAAAG CCTTATTCGG TAGTGTTGTT TGACGAAATT GAAAAGGCAC ATCCGGACAT ATTCAATATT CTGCTTCAAA TTCTTGAAGA CGGAAGACTG ACGGATTCCC AGGGCAGGGT GGTGGACTTT AGAAACACGG TAATTATCAT GACATCAAAC ATTGGTGCAA GGCTTATAAC CGAGCCAAAG CAGCTGGGAT TTGCTCCGGT CGCCCAGGAT AAAAAGAAGA GCTATGAAGA CATGAAGAAC AACGTTATGA ATGAGCTCAA AAAGAACTTC AGACCCGAGT TCTTAAACAG AATAGATGAA ATAATCGTGT TCCATCCTCT TGAGGAAGAA CACTTAAAGC AGATAGTAGG ACTTATGATA GACAATCTTG CGGAAAGACT TAAACAGAAT TCAATTGAAA TTGAGGTGTC GGACGAAGCA AAAGCTCTTC TTGCAAAGAA AGGATTTGAT CCTGTGTACG GAGCAAGGCC GTTAAGAAGG GCTGTACAGA GCATGGTTGA GGACAGACTT GCGGAGGAAA TGCTGGAAGG CAGAGTAAAG TCGGGAGACA AGGTATTTGT CGACGTTAAA GATGATGAAC TGGTATTTGT CAAAGACAAA AGCGAGCTTG TTTCAAACAA GGGCTAA
|
Protein sequence | MYGRFTEKAQ KAINISQNMA IELGHNYVGT EHLLLGLVKE GSGVAARVLQ SQGVTEEKVI REIEELIGRG EMMGQPLDFT PRTKRVLELS YREARRMGHN YIGTEHLLLG IMKEGESVAV RILKDLGVEH QKLVQEIMNM LSEEAPNSTG VPKGHSSYSN TPTLNQFGRD LTEMAREAKF DPVIGRDKEI ERVIQILSRR TKNNPCLIGE PGVGKTAIAE GLAQKIVEGN IPEILKDKRV VTLDLSSMVA GAKYRGEFEE RLKKALDEIR RAGNVILFID EMHTIIGAGA AEGAIDASNI LKPSLARGEI QVIGATTIDE YRKHIEKDAA LERRFQPILV GEPTKEEAIE ILRGLRDKYE AHHSVKITDE ALVAAVNMSD RYITDRFLPD KAIDLIDEAA SRVRLKSFTA PPDLKHLEEK VERLRKEKED AIVCQEFEKA ARIRDEEQRL KNELEKAKDS WRQKNQTTTN VVSEDDIAVI VSDWTGIPVK RLAEEESERL MKMEDILHKR VIGQDEAVKA ISKAIRRGRV GLKDPKRPVG SFIFLGPTGV GKTELSKALA EALFGEENAM IRIDMSEYME KHSVSRLVGS PPGYVGYEEG GQLTEKVRRK PYSVVLFDEI EKAHPDIFNI LLQILEDGRL TDSQGRVVDF RNTVIIMTSN IGARLITEPK QLGFAPVAQD KKKSYEDMKN NVMNELKKNF RPEFLNRIDE IIVFHPLEEE HLKQIVGLMI DNLAERLKQN SIEIEVSDEA KALLAKKGFD PVYGARPLRR AVQSMVEDRL AEEMLEGRVK SGDKVFVDVK DDELVFVKDK SELVSNKG
|
| |