Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3158 |
Symbol | |
ID | 4809608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3730815 |
End bp | 3732743 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640108591 |
Product | aconitate hydratase |
Protein accession | YP_001039546 |
Protein GI | 125975636 |
COG category | [C] Energy production and conversion |
COG ID | [COG1048] Aconitase A |
TIGRFAM ID | [TIGR01342] aconitate hydratase, putative, Aquifex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000107146 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTTTGA ATTTGGCACA AAAAATAATT AAGGAGCATT TGGTAAGCGG AGAAATGAAG CCCGGCACTG AAATAGCAAT AAGGATAGAT CAGACTTTGA CCCAGGACTC TACCGGAACA ATGGCATATC TGCAGTTTGA GGCCATGGGA ATTCCAAGGG TAAAGACTAA AAAGTCCGTT GCCTATATTG ACCACAACAC GCTCCAGACA GGTTTTGAGA ACGCAGATGA CCATAAATAT ATTCAGACGG TTGCTGCAAA GCACGGAATA TATTTTTCAA AACCCGGTAA CGGAATATGC CATCAGGTTC ACCTGGAGAG ATTTGGCGTA CCGGGAATGA CTCTTCTGGG ATCCGACAGC CATACTCCCA CCGGTGGCGG AATCGGAATG CTTGCCATAG GAGCAGGCGG TCTTGACGTG GCGGTGGCAA TGGGCGGAGG CCCATACTAT ATGATGATGC CCAAAGTATG CAGGGTGGTT TTAAAGGGAG CTTTAAAGCC ATGGGTTACC GCCAAGGACA TAATTCTCGA AGTGCTGAGA AGACTTTCGG TAAAAGGCGG AGTTGGCAAG ATTATCGAGT ATGCCGGAGA CGGCATAAAA ACTCTTACCG TTCCTGAAAG GGCAACCATT ACCAACATGG GAGCGGAGCT TGGCGCCACC ACTTCAATTT TCCCGAGCGA TGAGGTTACA AGGGAGTTTT TGAGGGCCCA GGGAAGAGAG AATGACTGGG TGGAACTTAA GCCCGACGAG GATGCCGAGT ATGACGAAGA GATTGTTATT AATCTTGACG AGCTTGAGCC TCTTGCAGCA CAACCGCACA GCCCGGACAA TGTTGCAAAG GTTAAGGATA TAGGTAAGAT AAAGGTTGAC CAGGTGGCAA TCGGAAGCTG CACCAACTCT TCATACATGG ATATGATGAA GGTGGCTGCA ATACTTAAAG GAAAGAAAGT ACATCCCGAT GTCAGCCTTG TTATTGCACC GGGTTCAAAA CAGGTGCTGA CAATGCTTGC CCAAAACGGT GCGCTGGCTG ACATGGTTGC GGCAGGAGCA AGAATACTCG AAAGCGCCTG CGGACCGTGT ATAGGAATGG GACAGGCTCC GGCAACCGAT GCCGTTTCCT TGAGAACCTT CAACAGAAAC TTTGAGGGAA GAAGCGGTAC AAAGTCTGCC AAAGTTTATT TGGTAAGTCC TGAGACAGCT GCGGCAAGCG CAATAACCGG AGTGCTGATA GACCCGAGGG AATTGGGTGA GGCGCCGAAG GTAAGCATGC CTGAAAAGTT TGTTATTGAT GACAGCATGG TACTGCCGCC CGCACCGGAG GGAGCAGAAG TTGAGGTGGT AAGAGGACCC AACATTAAGC CTTTCCCGAT AAACCAGGCA TTGGCTGACA AAGTTTCCGG CAAAGCTTTG ATAAAAGTTG GGGACAATAT AACCACTGAC CATATTATGC CTTCAAATGC AAAGCTTCTG CCTTTCAGGT CAAATGTGCC GTACCTTGCG GAATTCTGCC TTACACCTTG CGACCCTGAT TTTCCGAAGA GGGCAAAGGA AAACGGCGGC GGATTTATCA TCGGCGGTTC AAACTACGGA CAGGGTTCAA GCCGTGAACA TGCTGCATTG GCTCCACTTC AGCTCGGAGT AAAGGGAGTT ATAGCAAAAT CTTTTGCAAG AATTCATATG GCAAACCTCA TTAACTCGGG TATTATCCCC ATGACCTTTG AAAATGAGGC TGATTACGAT GAAATAGACA TGGACGACGA ACTTGTGATT GAAAACGCAA GGGAGCAGAT TAAAAACGGC AGCAGCATTG TAGTGAAAAA TGTAACTAAA GGGAAAGATA TTAAAGTAAA TGTTGCTTTG TCGCAAAGAC AAGTGGAAAT AATTCTTGCT GGCGGGCTTT TAAACTATAC GAGGCAGCAG AATCAGTGA
|
Protein sequence | MGLNLAQKII KEHLVSGEMK PGTEIAIRID QTLTQDSTGT MAYLQFEAMG IPRVKTKKSV AYIDHNTLQT GFENADDHKY IQTVAAKHGI YFSKPGNGIC HQVHLERFGV PGMTLLGSDS HTPTGGGIGM LAIGAGGLDV AVAMGGGPYY MMMPKVCRVV LKGALKPWVT AKDIILEVLR RLSVKGGVGK IIEYAGDGIK TLTVPERATI TNMGAELGAT TSIFPSDEVT REFLRAQGRE NDWVELKPDE DAEYDEEIVI NLDELEPLAA QPHSPDNVAK VKDIGKIKVD QVAIGSCTNS SYMDMMKVAA ILKGKKVHPD VSLVIAPGSK QVLTMLAQNG ALADMVAAGA RILESACGPC IGMGQAPATD AVSLRTFNRN FEGRSGTKSA KVYLVSPETA AASAITGVLI DPRELGEAPK VSMPEKFVID DSMVLPPAPE GAEVEVVRGP NIKPFPINQA LADKVSGKAL IKVGDNITTD HIMPSNAKLL PFRSNVPYLA EFCLTPCDPD FPKRAKENGG GFIIGGSNYG QGSSREHAAL APLQLGVKGV IAKSFARIHM ANLINSGIIP MTFENEADYD EIDMDDELVI ENAREQIKNG SSIVVKNVTK GKDIKVNVAL SQRQVEIILA GGLLNYTRQQ NQ
|
| |