Gene Cthe_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1246 
SymbolpurH 
ID4809751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1510165 
End bp1511709 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content43% 
IMG OID640106669 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001037671 
Protein GI125973761 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000980462 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGC GTGCACTGAT AAGTGTATCC GACAAGACGG GAATTGTTGA AATGGCCCGT 
GAACTTCAAA GCATGGGAGT TGATATTATT TCCACCGGTG GTACTGCAAA GACATTGAGT
GATGCCGGTA TAAAGGTAAT AAACATATCG GATGTTACCG GTTTTCCGGA ATGTCTTGAC
GGAAGGGTAA AAACCCTTCA TCCCAAAGTT CATGCGGGAA TTCTTGCAAT AAGAAGCAAT
GAGGAACATA TGAGACAGCT GAAAGAGCTT AACATAGAGA CAATAGACAT GGTAATCATC
AATCTTTATC CGTTCAAGCA GACGATATTA AAAGAAAACG TTGACCTTTC GGAGGCAATT
GAAAATATTG ATATAGGCGG ACCTACAATG ATTAGAGCTG CGGCAAAGAA TTATCAGGAC
GTGGTTGTAA TTGTTGACCC TTCAGACTAT GCTGCCGTAT TGGAAGAGCT TAAGACTACG
AAGGATGTAT CATTGAAAAC CAAGTTCAAG CTGGCATATA AAGTGTTTGA ACATACAAGT
CATTATGATA CTTTAATTGC AAAGTATTTA AGAGAGCAAA TCGGAGAAGA CGAGTTCCCT
CAAACCCTTT CTCTGACCTT TGAAAAGGTC CAGGATATGA GATATGGTGA AAATCCCCAC
CAAAAAGCGG TGTTCTATAA AGAAGTGGGA GCGAATGTCG GCTGTATAAC GGCTGCAAAA
CAGCTGCACG GAAAGGAACT TTCCTATAAC AATATAAATG ATGCAAACGG TGCCATAGAA
ATCATAAAAG AGTTTGACGA ACCCACCGTG GTGGCGGTGA AACATGCAAA TCCGTGTGGT
GTGGCAAGTG CTTCAAATAT ATATGATGCT TATATAAAGG CATATGAGGC GGATCCTGTG
TCCATATTCG GCGGTATTAT TGCGGCCAAC AGGGAAATTG ACGAAAAAAC GGCCGAGGAA
ATAAACAAGA TTTTTGTTGA GATAGTTATC GCACCGTCCT TTACTGAAGG GGCATTAAAA
ATTCTTACCC AGAAGAAGAA CATAAGACTG CTTCAGCTTG AGGACATTTC GGCTAAAATT
CCAAAGGGAA CTTATGACAT GAAGAAAGTG CCGGGAGGCT TGCTGGTGCA AAATTACAAC
AGTGAACTTC TTAATATGGA CGATTTGAAA GTTGTTACGG AAAAGAAACC TACCCAGGAA
GAATTGGAAG ATCTCATTTT TGCCATGAAA GTTGTAAAGC ATACCAAATC CAACGGTATT
GCGCTGGCAA AGGGCAAGCA GACTATTGGA GTCGGACCGG GTCAGACCAA CAGAGTAACG
GCCTGCAAGA TTGCCATTGA ATATGGCGGG GAAAGGACAA AAGGAGCCGT TCTTGCATCG
GATGCCTTCT TCCCGTTTGC TGACTGTGTT GAGGCGGCAG CTGCTGCGGG CATTACTGCA
ATTATCCAGC CCGGAGGCTC GATAAGGGAT CAGGAATCCA TTGATGCATG CAACAAGTAT
GGCATTGCAA TGGTATTTAC GGGAATGAGA CATTTTAAGC ATTGA
 
Protein sequence
MIKRALISVS DKTGIVEMAR ELQSMGVDII STGGTAKTLS DAGIKVINIS DVTGFPECLD 
GRVKTLHPKV HAGILAIRSN EEHMRQLKEL NIETIDMVII NLYPFKQTIL KENVDLSEAI
ENIDIGGPTM IRAAAKNYQD VVVIVDPSDY AAVLEELKTT KDVSLKTKFK LAYKVFEHTS
HYDTLIAKYL REQIGEDEFP QTLSLTFEKV QDMRYGENPH QKAVFYKEVG ANVGCITAAK
QLHGKELSYN NINDANGAIE IIKEFDEPTV VAVKHANPCG VASASNIYDA YIKAYEADPV
SIFGGIIAAN REIDEKTAEE INKIFVEIVI APSFTEGALK ILTQKKNIRL LQLEDISAKI
PKGTYDMKKV PGGLLVQNYN SELLNMDDLK VVTEKKPTQE ELEDLIFAMK VVKHTKSNGI
ALAKGKQTIG VGPGQTNRVT ACKIAIEYGG ERTKGAVLAS DAFFPFADCV EAAAAAGITA
IIQPGGSIRD QESIDACNKY GIAMVFTGMR HFKH