Gene GWCH70_0263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0263 
SymbolpurH 
ID7976129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp297341 
End bp298879 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content47% 
IMG OID644797258 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002948458 
Protein GI239825834 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAA AACGGGCATT ATTAAGTGTA TCGAACAAAG AAGGAATCGT ATCATTGGCG 
AAACAGTTAG TGGAACTCGG TGTGGAAATT ATTTCAACAG GCGGAACGAA AAAAACGTTA
GAAGAAGCGG GAGTTCCTGT CATAGGCATT TCTGATGTTA CCGGATTTCC GGAAATTTTA
GATGGGCGCG TCAAAACATT GCATCCGATG ATCCATGGGG GACTTCTTGC GATTCGCGAC
AATGAGCGCC ATCAAAGCGA GCTTCGCGAG CATCACATTA CCCCAATTGA CCTTGTTGTC
GTCAATTTGT ACCCGTTTCA ACAAACGATC GCCAAAAGCG ATGTGACGTT TGCCGAAGCG
ATCGAAAACA TTGATATTGG CGGCCCGACG ATGCTGCGGG CGGCGGCGAA AAACCATCAA
TATGTGACGG TTGTCGTTGA CCCGGCCGAC TATGATACGG TAGTGCAAGA ACTAAAAAAA
CACGGAGATG TCTCGGTAGA AACGAAATTA AAGCTTGCGG CAAAAGTATT TCGCCATACA
GCTGCGTACG ATGCGATGAT TGCGGAGTAC TTAACGAATA AAACTGGAGA AGAGTATCCG
GAATCATTGA CGATCACTTT CGAGAAAAAA CAAGCGTTGC GTTATGGGGA AAACCCTCAC
CAAACTGCGG CGTTTTACAA AAAACCGTTA GGCGCATCTT TCTCGATTGC CCAAGCAATG
CAGCTGCATG GAAAAGAATT GTCATATAAC AACATTAACG ATGCGAATGC GGCATTGCAA
ATCGTCAAAG AATTTACCGA ACCAGCCGCG GTGGCGGTGA AGCATATGAA TCCGTGCGGT
GTCGGTGTTG GCGCAACGAT TTACGAAGCG TTCACCAAAG CGTACGAAGC GGATCCAACC
TCGATTTTTG GCGGCATTAT CGCCCTAAAC CGCGAGGTCG ATAAAGAAAC TGCCGAAAAA
ATGCATGAAA TCTTTTTAGA AATCGTGATT GCCCCATCGT TTAGCAAAGA AGCGCTCGAT
ATTTTAACAC AAAAGAAAAA CATTCGCCTT CTTACCGTTG ATTTCACGGC GCCAAACACG
AAAGAGAAGC TGCTTGTTTC CGTACAAGGC GGATTGCTTG TTCAAGAAAC GGATACGCAT
ACGCTCGATG ACGCGGAGAT TAAAGTTGTA ACGAAACGCG AGCCGACGGA ACAAGAATGG
GAAGCGTTGC GGTTTGCATG GAAAGTGGTG AAACATGTGA AATCGAACGC GATCGTGCTT
GCAAAAGACG GAATGACCAT CGGCATTGGT GCCGGTCAAA TGAATCGCGT CGGCGCGGCG
AAAATTGCGA TTGAACAAGC AGGGGAAAAA GCAAAAGGAG CCGTGCTTGC CTCTGACGCG
TTTTTCCCAA TGGATGATAC GGTGGAGGCG GCAGCGAAAG CAGGAATCAC AGCAATCATC
CAGCCAGGCG GCTCGATTCG CGACGCCGAT TCGATCAAAA AAGCAGATGA ATACGGAATT
GCCATGGTCT TTACAGGAAT CCGCCATTTT AAACATTAA
 
Protein sequence
MAVKRALLSV SNKEGIVSLA KQLVELGVEI ISTGGTKKTL EEAGVPVIGI SDVTGFPEIL 
DGRVKTLHPM IHGGLLAIRD NERHQSELRE HHITPIDLVV VNLYPFQQTI AKSDVTFAEA
IENIDIGGPT MLRAAAKNHQ YVTVVVDPAD YDTVVQELKK HGDVSVETKL KLAAKVFRHT
AAYDAMIAEY LTNKTGEEYP ESLTITFEKK QALRYGENPH QTAAFYKKPL GASFSIAQAM
QLHGKELSYN NINDANAALQ IVKEFTEPAA VAVKHMNPCG VGVGATIYEA FTKAYEADPT
SIFGGIIALN REVDKETAEK MHEIFLEIVI APSFSKEALD ILTQKKNIRL LTVDFTAPNT
KEKLLVSVQG GLLVQETDTH TLDDAEIKVV TKREPTEQEW EALRFAWKVV KHVKSNAIVL
AKDGMTIGIG AGQMNRVGAA KIAIEQAGEK AKGAVLASDA FFPMDDTVEA AAKAGITAII
QPGGSIRDAD SIKKADEYGI AMVFTGIRHF KH