Gene Teth514_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0525 
SymbolpurH 
ID5876089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp540731 
End bp542257 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content35% 
IMG OID641540861 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001662169 
Protein GI167039184 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA AAGCGTTAAT AAGTGTTTCA AAAAAAGAAG GCATAGTTGA ATTTGCCAAA 
AAGCTTAATG AATTAGGATA TGAAATAATA TCAACAGGTG GGACTTATAA TCTTCTTAAA
GAAAATAGAG TAAATGTAGT AAAGGTATCA GACATAACAG GTTTCCCTGA AATTATGGAC
GGGAGAGTAA AGACGCTTCA CCCTAAAATT CATGGAGGGC TTCTTGCAAT AAGAGACAAT
GAAGAGCACA TTAAAGCTCT TAAAGAACAT GGCATTGAAC CAATAGATAT AGTTGTCATA
AATTTATATC CTTTTAAAGA AACAATCCTC AAAGAAAATG TGACTTTAGA AGAAGCGATA
GAAAATATAG ACATTGGCGG TCCTTCAATG ATAAGGGCTG CAGCTAAAAA CTACAAATAT
GTAACTATTC TTGTAGATCC AAAGGATTAT GATACAGTCA TTGAAGAAAT AAAACAATAT
GGCAATACAA AAGAAGAGAC GAGATTTTAT CTGGCAGCAA AAGCTTTTGG CCATACAGCC
CTTTACGATT CTTTGATATA CAATTATTTG ATACAAAAAA ATAACATAGA GTTTCCCGAA
GTTATGGCCT TTGCCTATGA AAAAGCTCAA GACATGAGAT ATGGTGAAAA TCCTCATCAA
AAGGCGGCTT TTTACAAAAA TCCCATAAAA GCCTATGGCA TTGCAGAATG TGAGCAGCTG
CACGGTAAAG AGCTCTCTTT TAACAATATA AACGATGCAA ATGCCGCAAT AGAACTTTTA
AGAGAGTTTA AAGAGCCTGC AGCAGTTGCC GTAAAGCACA CAAATCCTTG TGGAGTAGCT
ATTGCAGATA ATATATACAA TGCTTACTTA AAAGCTTATG AGAGTGACCC TGTTTCTATT
TTTGGGGGAA TTGTAGCATT AAATAGAACT GTAGATGTTA AAACAGCTGA AGAACTTATA
AAAATATTCT TAGAAATAGT AATTGCTCCA GACTTTGAAG AAGAGGCCTT TGAGATTTTA
AAGAAAAAGA AAAACTTAAG GATATTGAGG TTAAAAGAAG GATATGAAAA AGAATATGAT
TTAAAGAAAG TAGAAGGCGG ACTTTTAGTA CAAGAAAAAG ATGAAATAGA TTTAGATGAG
AATAATTTGA AAGTAGTTAC TAAAAAAGCA CCTACGCAAA AAGAGATGGA AGATTTAAGG
TTTGCCTGGA AGGTAGTAAA GCACGTAAAA TCTAATGCGA TTGTTTTAGC AAAAGATGGA
GCTACAGTAG GTATTGGCGT TGGACAAGTC AACAGAATAT GGCCAACAGA GCAAGCAATC
AAACAAGCAG GTAGCAAAGC AAAAGGAAGT GTCCTTGCAT CAGATGCCTT TTTCCCATTT
CCAGATGTTG TGGAAGCAGC TGTAAAAGGC GGTATAACTG CCATAATCCA ACCAGGCGGC
TCACAAAACG ATGCTTTATC AATTGAAGCT GCAGATAAAG GCGGCGTATC AATGATATTC
ACAGGCATAA GGCATTTTAA ACATTGA
 
Protein sequence
MAKKALISVS KKEGIVEFAK KLNELGYEII STGGTYNLLK ENRVNVVKVS DITGFPEIMD 
GRVKTLHPKI HGGLLAIRDN EEHIKALKEH GIEPIDIVVI NLYPFKETIL KENVTLEEAI
ENIDIGGPSM IRAAAKNYKY VTILVDPKDY DTVIEEIKQY GNTKEETRFY LAAKAFGHTA
LYDSLIYNYL IQKNNIEFPE VMAFAYEKAQ DMRYGENPHQ KAAFYKNPIK AYGIAECEQL
HGKELSFNNI NDANAAIELL REFKEPAAVA VKHTNPCGVA IADNIYNAYL KAYESDPVSI
FGGIVALNRT VDVKTAEELI KIFLEIVIAP DFEEEAFEIL KKKKNLRILR LKEGYEKEYD
LKKVEGGLLV QEKDEIDLDE NNLKVVTKKA PTQKEMEDLR FAWKVVKHVK SNAIVLAKDG
ATVGIGVGQV NRIWPTEQAI KQAGSKAKGS VLASDAFFPF PDVVEAAVKG GITAIIQPGG
SQNDALSIEA ADKGGVSMIF TGIRHFKH