Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1246 |
Symbol | purH |
ID | 4809751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1510165 |
End bp | 1511709 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106669 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001037671 |
Protein GI | 125973761 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000980462 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAGC GTGCACTGAT AAGTGTATCC GACAAGACGG GAATTGTTGA AATGGCCCGT GAACTTCAAA GCATGGGAGT TGATATTATT TCCACCGGTG GTACTGCAAA GACATTGAGT GATGCCGGTA TAAAGGTAAT AAACATATCG GATGTTACCG GTTTTCCGGA ATGTCTTGAC GGAAGGGTAA AAACCCTTCA TCCCAAAGTT CATGCGGGAA TTCTTGCAAT AAGAAGCAAT GAGGAACATA TGAGACAGCT GAAAGAGCTT AACATAGAGA CAATAGACAT GGTAATCATC AATCTTTATC CGTTCAAGCA GACGATATTA AAAGAAAACG TTGACCTTTC GGAGGCAATT GAAAATATTG ATATAGGCGG ACCTACAATG ATTAGAGCTG CGGCAAAGAA TTATCAGGAC GTGGTTGTAA TTGTTGACCC TTCAGACTAT GCTGCCGTAT TGGAAGAGCT TAAGACTACG AAGGATGTAT CATTGAAAAC CAAGTTCAAG CTGGCATATA AAGTGTTTGA ACATACAAGT CATTATGATA CTTTAATTGC AAAGTATTTA AGAGAGCAAA TCGGAGAAGA CGAGTTCCCT CAAACCCTTT CTCTGACCTT TGAAAAGGTC CAGGATATGA GATATGGTGA AAATCCCCAC CAAAAAGCGG TGTTCTATAA AGAAGTGGGA GCGAATGTCG GCTGTATAAC GGCTGCAAAA CAGCTGCACG GAAAGGAACT TTCCTATAAC AATATAAATG ATGCAAACGG TGCCATAGAA ATCATAAAAG AGTTTGACGA ACCCACCGTG GTGGCGGTGA AACATGCAAA TCCGTGTGGT GTGGCAAGTG CTTCAAATAT ATATGATGCT TATATAAAGG CATATGAGGC GGATCCTGTG TCCATATTCG GCGGTATTAT TGCGGCCAAC AGGGAAATTG ACGAAAAAAC GGCCGAGGAA ATAAACAAGA TTTTTGTTGA GATAGTTATC GCACCGTCCT TTACTGAAGG GGCATTAAAA ATTCTTACCC AGAAGAAGAA CATAAGACTG CTTCAGCTTG AGGACATTTC GGCTAAAATT CCAAAGGGAA CTTATGACAT GAAGAAAGTG CCGGGAGGCT TGCTGGTGCA AAATTACAAC AGTGAACTTC TTAATATGGA CGATTTGAAA GTTGTTACGG AAAAGAAACC TACCCAGGAA GAATTGGAAG ATCTCATTTT TGCCATGAAA GTTGTAAAGC ATACCAAATC CAACGGTATT GCGCTGGCAA AGGGCAAGCA GACTATTGGA GTCGGACCGG GTCAGACCAA CAGAGTAACG GCCTGCAAGA TTGCCATTGA ATATGGCGGG GAAAGGACAA AAGGAGCCGT TCTTGCATCG GATGCCTTCT TCCCGTTTGC TGACTGTGTT GAGGCGGCAG CTGCTGCGGG CATTACTGCA ATTATCCAGC CCGGAGGCTC GATAAGGGAT CAGGAATCCA TTGATGCATG CAACAAGTAT GGCATTGCAA TGGTATTTAC GGGAATGAGA CATTTTAAGC ATTGA
|
Protein sequence | MIKRALISVS DKTGIVEMAR ELQSMGVDII STGGTAKTLS DAGIKVINIS DVTGFPECLD GRVKTLHPKV HAGILAIRSN EEHMRQLKEL NIETIDMVII NLYPFKQTIL KENVDLSEAI ENIDIGGPTM IRAAAKNYQD VVVIVDPSDY AAVLEELKTT KDVSLKTKFK LAYKVFEHTS HYDTLIAKYL REQIGEDEFP QTLSLTFEKV QDMRYGENPH QKAVFYKEVG ANVGCITAAK QLHGKELSYN NINDANGAIE IIKEFDEPTV VAVKHANPCG VASASNIYDA YIKAYEADPV SIFGGIIAAN REIDEKTAEE INKIFVEIVI APSFTEGALK ILTQKKNIRL LQLEDISAKI PKGTYDMKKV PGGLLVQNYN SELLNMDDLK VVTEKKPTQE ELEDLIFAMK VVKHTKSNGI ALAKGKQTIG VGPGQTNRVT ACKIAIEYGG ERTKGAVLAS DAFFPFADCV EAAAAAGITA IIQPGGSIRD QESIDACNKY GIAMVFTGMR HFKH
|
| |