Gene BCG9842_B4976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4976 
SymbolpurH 
ID7184182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp305320 
End bp306855 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content38% 
IMG OID643548104 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002443816 
Protein GI218895405 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones155 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC GTGCATTAGT AAGTGTTTCA GATAAAACAG GAGTAGTAGA ATTTGTTAAA 
GGGTTACTAG AACAAGGGAT TGAAGTTATT TCAACAGGTG GTACGAAAAA GTTACTAGAA
GCAAACGGCT TACAAGTAAT TGGTATTTCT GAAGTAACTG GTTTCCCTGA AATTATGGAT
GGCCGTGTGA AAACATTACA TCCAAATATT CATGGTGGTC TATTAGCAGT TCGTGATAAT
GAAACACACG TAATACAAAT GAATGAATTA GGTATTCAGC CAATTGACTT TGTTGTTGTT
AACTTATACC CATTTAAAGA AACAATCGCT AAGCCTGATG TAACATTTGC TGATGCAATT
GAAAATATTG ATATCGGTGG CCCGACAATG ATTCGCTCTG CTGCGAAAAA TCATCAATTC
GTATCTGTAA TTGTAGATCC AGTAGATTAT GATGTTGTAT TAGCAGAACT AAAAGAGAAC
GGCGAAGTAA TGGATGAAAC GAAACGTAAA CTAGCAGCGA AAGTATTCCG TCATACAGCA
GCATATGATG CGCTAATTTC TAACTATTTA ACAGAGCAAA TGGGTGAAGA AAGTCCAGAA
ACATTAACTG TGACATTCGA GAAAAAGCAA GACTTACGTT ATGGCGAGAA CCCACATCAA
AAGGCAACTT TCTATAAAGC GCCATTCGCA GCAACTTCTT CTGTTGCATA CGCAGAACAA
TTACACGGTA AAGAATTATC GTATAACAAT ATTAATGATG CAGACGCAGC GCTCAGCATC
GTAAAAGAAT TTACAGAACC AGCAGTAGTA GCAGTAAAAC ATATGAATCC ATGTGGTGTT
GGAGTAGGAA CTGATATTCA CGAAGCATAT ACTCGTGCTT ATGAAGCAGA TCCAGTATCA
ATCTTCGGCG GTATTATTGC AGCAAATCGT GAAATTGATA AAGCTACAGC AGAAAAGTTA
CACGAAATTT TCTTAGAAAT CATTATTGCA CCTTCTTTCT CAAAAGAAGC TTTAGAAGTA
TTGCAAAGTA AGAAAAACTT ACGTCTACTA ACTGTAAATA TTGAAAAAGC GACAAGTGCA
AGCAAAAAAC TAACTTCTGT ACAAGGTGGG CTTCTCGTTC AAGAGGAAGA TACGTTATCA
TTAGATGAAA GTACAATTTC AATTCCAACG AAACGTGAAC CTTCAGAGCA AGAATGGAAA
GATTTAAAAC TAGCTTGGAA AGTTGTAAAG CATGTGAAAT CAAATGCAAT TGTTTTAGCG
AAAGATGATA TGACAATTGG TGTCGGTGCA GGGCAGATGA ACCGTGTAGG TTCTGCAAAA
ATCGCAATTA CACAAGCTGG TGAAAAAGCA CAAGGTAGCG CACTTGCATC TGATGCTTTC
TTCCCAATGC CAGATACATT AGAAGAAGCA GCAAAAGCAG GAATTACAGC AATCATTCAA
CCGGGCGGAT CAATCCGTGA TGAAGATTCT ATTAAAGTGG CGGATACGTA TGGGATTGCT
ATGGTGTTCA CTGGCGTACG TCATTTCAAA CACTAA
 
Protein sequence
MKKRALVSVS DKTGVVEFVK GLLEQGIEVI STGGTKKLLE ANGLQVIGIS EVTGFPEIMD 
GRVKTLHPNI HGGLLAVRDN ETHVIQMNEL GIQPIDFVVV NLYPFKETIA KPDVTFADAI
ENIDIGGPTM IRSAAKNHQF VSVIVDPVDY DVVLAELKEN GEVMDETKRK LAAKVFRHTA
AYDALISNYL TEQMGEESPE TLTVTFEKKQ DLRYGENPHQ KATFYKAPFA ATSSVAYAEQ
LHGKELSYNN INDADAALSI VKEFTEPAVV AVKHMNPCGV GVGTDIHEAY TRAYEADPVS
IFGGIIAANR EIDKATAEKL HEIFLEIIIA PSFSKEALEV LQSKKNLRLL TVNIEKATSA
SKKLTSVQGG LLVQEEDTLS LDESTISIPT KREPSEQEWK DLKLAWKVVK HVKSNAIVLA
KDDMTIGVGA GQMNRVGSAK IAITQAGEKA QGSALASDAF FPMPDTLEEA AKAGITAIIQ
PGGSIRDEDS IKVADTYGIA MVFTGVRHFK H