Gene Cthe_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1789 
Symbol 
ID4810034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2111364 
End bp2113820 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content45% 
IMG OID640107203 
ProductATPase AAA-2 
Protein accessionYP_001038203 
Protein GI125974293 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGGAC GTTTTACCGA AAAAGCACAA AAAGCAATAA ATATTTCTCA AAACATGGCA 
ATAGAATTGG GACATAATTA TGTCGGGACG GAGCACCTTC TTCTGGGATT GGTAAAAGAA
GGAAGTGGAG TCGCCGCGCG GGTTTTGCAA AGCCAGGGCG TTACTGAAGA GAAAGTTATA
AGGGAAATTG AAGAACTCAT CGGACGCGGC GAAATGATGG GCCAGCCTTT GGATTTCACT
CCGAGAACCA AAAGGGTTCT GGAGCTTAGC TATAGAGAGG CCCGCAGAAT GGGTCACAAC
TATATCGGAA CGGAGCACCT TCTTTTAGGA ATAATGAAAG AAGGGGAAAG CGTTGCCGTA
AGAATTTTGA AGGACTTGGG AGTCGAGCAT CAAAAGCTTG TTCAGGAAAT AATGAACATG
CTCAGTGAAG AAGCACCCAA TTCCACCGGT GTGCCAAAAG GCCATTCATC ATATTCGAAC
ACACCGACCC TGAACCAGTT CGGAAGAGAC TTGACGGAGA TGGCAAGGGA GGCAAAATTT
GACCCGGTAA TAGGCCGTGA CAAGGAAATT GAAAGAGTTA TACAGATTTT GAGCAGAAGA
ACCAAAAACA ACCCTTGTCT GATTGGTGAG CCCGGAGTCG GTAAGACTGC GATTGCAGAA
GGGCTTGCCC AGAAGATAGT TGAGGGAAAT ATTCCGGAAA TATTGAAGGA CAAAAGAGTT
GTGACTTTGG ACTTGTCCTC AATGGTTGCC GGTGCAAAAT ACAGAGGTGA ATTTGAAGAA
AGGCTTAAAA AAGCTCTTGA TGAAATCAGA AGGGCGGGCA ATGTAATACT GTTCATAGAT
GAAATGCATA CTATCATCGG AGCCGGAGCC GCTGAAGGTG CAATCGACGC TTCCAATATA
TTAAAGCCTT CTTTGGCGCG GGGCGAAATT CAGGTTATCG GTGCCACCAC CATTGATGAG
TACAGGAAGC ACATTGAAAA GGACGCGGCT TTGGAGAGAA GATTCCAGCC GATACTTGTC
GGAGAGCCGA CAAAAGAAGA AGCAATAGAA ATATTGAGGG GTCTGAGAGA CAAGTATGAA
GCACATCATA GTGTGAAAAT CACTGATGAG GCTTTGGTGG CAGCCGTAAA CATGTCGGAC
AGATACATTA CGGACCGGTT CCTGCCCGAT AAAGCCATCG ACCTTATTGA TGAGGCGGCA
TCAAGGGTAA GGCTGAAATC CTTTACCGCA CCTCCTGATC TGAAACATCT CGAGGAAAAA
GTGGAAAGAC TCAGAAAAGA AAAAGAAGAT GCTATAGTAT GCCAGGAATT TGAAAAGGCC
GCCCGTATAA GAGATGAGGA GCAGAGGCTG AAAAATGAGC TGGAAAAGGC AAAGGACAGT
TGGCGGCAGA AAAATCAGAC CACAACAAAC GTAGTCAGTG AAGATGATAT TGCAGTAATA
GTGTCCGACT GGACGGGTAT TCCGGTAAAG AGACTTGCCG AGGAAGAATC GGAAAGACTT
ATGAAGATGG AAGACATACT TCACAAAAGG GTAATAGGGC AGGATGAAGC GGTAAAGGCA
ATATCCAAAG CCATCAGAAG AGGAAGAGTG GGTCTTAAAG ACCCGAAGAG GCCGGTGGGT
TCATTCATTT TCCTGGGTCC TACAGGCGTT GGAAAAACTG AACTTAGCAA GGCTTTGGCA
GAAGCTCTCT TTGGTGAGGA AAACGCAATG ATTAGAATAG ACATGTCGGA GTACATGGAA
AAGCACAGCG TTTCAAGACT TGTAGGTTCA CCGCCGGGAT ATGTAGGTTA TGAAGAAGGA
GGACAGCTTA CCGAAAAAGT AAGAAGAAAG CCTTATTCGG TAGTGTTGTT TGACGAAATT
GAAAAGGCAC ATCCGGACAT ATTCAATATT CTGCTTCAAA TTCTTGAAGA CGGAAGACTG
ACGGATTCCC AGGGCAGGGT GGTGGACTTT AGAAACACGG TAATTATCAT GACATCAAAC
ATTGGTGCAA GGCTTATAAC CGAGCCAAAG CAGCTGGGAT TTGCTCCGGT CGCCCAGGAT
AAAAAGAAGA GCTATGAAGA CATGAAGAAC AACGTTATGA ATGAGCTCAA AAAGAACTTC
AGACCCGAGT TCTTAAACAG AATAGATGAA ATAATCGTGT TCCATCCTCT TGAGGAAGAA
CACTTAAAGC AGATAGTAGG ACTTATGATA GACAATCTTG CGGAAAGACT TAAACAGAAT
TCAATTGAAA TTGAGGTGTC GGACGAAGCA AAAGCTCTTC TTGCAAAGAA AGGATTTGAT
CCTGTGTACG GAGCAAGGCC GTTAAGAAGG GCTGTACAGA GCATGGTTGA GGACAGACTT
GCGGAGGAAA TGCTGGAAGG CAGAGTAAAG TCGGGAGACA AGGTATTTGT CGACGTTAAA
GATGATGAAC TGGTATTTGT CAAAGACAAA AGCGAGCTTG TTTCAAACAA GGGCTAA
 
Protein sequence
MYGRFTEKAQ KAINISQNMA IELGHNYVGT EHLLLGLVKE GSGVAARVLQ SQGVTEEKVI 
REIEELIGRG EMMGQPLDFT PRTKRVLELS YREARRMGHN YIGTEHLLLG IMKEGESVAV
RILKDLGVEH QKLVQEIMNM LSEEAPNSTG VPKGHSSYSN TPTLNQFGRD LTEMAREAKF
DPVIGRDKEI ERVIQILSRR TKNNPCLIGE PGVGKTAIAE GLAQKIVEGN IPEILKDKRV
VTLDLSSMVA GAKYRGEFEE RLKKALDEIR RAGNVILFID EMHTIIGAGA AEGAIDASNI
LKPSLARGEI QVIGATTIDE YRKHIEKDAA LERRFQPILV GEPTKEEAIE ILRGLRDKYE
AHHSVKITDE ALVAAVNMSD RYITDRFLPD KAIDLIDEAA SRVRLKSFTA PPDLKHLEEK
VERLRKEKED AIVCQEFEKA ARIRDEEQRL KNELEKAKDS WRQKNQTTTN VVSEDDIAVI
VSDWTGIPVK RLAEEESERL MKMEDILHKR VIGQDEAVKA ISKAIRRGRV GLKDPKRPVG
SFIFLGPTGV GKTELSKALA EALFGEENAM IRIDMSEYME KHSVSRLVGS PPGYVGYEEG
GQLTEKVRRK PYSVVLFDEI EKAHPDIFNI LLQILEDGRL TDSQGRVVDF RNTVIIMTSN
IGARLITEPK QLGFAPVAQD KKKSYEDMKN NVMNELKKNF RPEFLNRIDE IIVFHPLEEE
HLKQIVGLMI DNLAERLKQN SIEIEVSDEA KALLAKKGFD PVYGARPLRR AVQSMVEDRL
AEEMLEGRVK SGDKVFVDVK DDELVFVKDK SELVSNKG