Gene BCAH820_0330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_0330 
SymbolpurH 
ID7190462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp313342 
End bp314877 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content38% 
IMG OID643553741 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002449322 
Protein GI218901488 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones321 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC GTGCATTAGT AAGTGTTTCA GATAAAACAG GAGTAGTAGA ATTTGTTAAA 
GGTTTACTAG AACAAGGGAT CGAAGTTATA TCAACAGGTG GTACGAAAAA ATTACTAGAA
GAAAACGGCT TACAAGTAAT CGGTATTTCT GAAGTAACTG GTTTCCCAGA AATTATGGAT
GGTCGTGTGA AAACATTACA TCCAAATATT CATGGTGGTC TATTAGCAGT TCGTGATAAT
GAAACACATG TAGCGCAAAT GAATGAATTA GGTATGGAGC CGATTGATTT TGTTGTCGTT
AACTTATACC CATTCAAAGA AACGATCGCT AAGCCTGATG TAACATTTGC TGATGCAATT
GAAAATATTG ATATCGGTGG CCCGACAATG ATTCGCTCTG CTGCGAAAAA TCATAAATTC
GTATCTGTAA TTGTAGATCC AGTAGATTAT GATGTTGTAT TAGCTGAATT AAAAGAGAAC
GGTGAAGTAG CAGAGGAAAC AAAACGTAAA CTAGCAGCGA AAGTATTCCG TCATACAGCA
GCGTATGATG CGTTAATTTC TAACTACTTA ACAGAGCAAA TGGGTGAAGA AAGTCCAGAA
ACATTAACTG TGACATTTGA GAAAAAGCAA GACTTACGTT ATGGCGAGAA CCCACATCAA
AAAGCAACTT TCTATAAAGC GCCATTCGCA GCAACTTCTT CTGTTGCATA CGCAGAACAA
TTACATGGCA AGGAATTATC GTATAACAAT ATTAACGATG CAGATGCAGC GCTTAGTATC
GTGAAAGAAT TTACAGAACC AGCAGTAGTA GCGGTAAAAC ATATGAATCC ATGTGGTGTT
GGAGTAGGTA CGGATATTCA TGAAGCATAC ACACGTGCTT ATGAAGCGGA TCCAGTATCC
ATTTTTGGAG GAATTATTGC AGCGAATCGT GAAATTGATA AAGCTACAGC TGAAAAGTTA
CACGAAATTT TCTTAGAGAT TATTATCGCA CCTTCTTTCT CGAAAGAAGC TTTAGAAGTA
CTGCAAAGTA AGAAAAACTT ACGTTTATTA ACTGTAAATA TTGAAAAAGC GACAAGCGCA
AGCAAAAAAT TAACTTCTGT ACAAGGTGGC CTTCTCGTTC AAGAGGAAGA TACGTTATCA
TTAGATGAAA GTACAATTTC AATTCCAACG AAACGTGAAC CTTCAGAGCA AGAATGGAAA
GATTTAAAAC TAGCTTGGAA AGTTGTAAAG CATGTGAAAT CAAATGCAAT TGTTTTAGCA
AAAGATGATA TGACAATTGG TGTCGGTGCT GGACAGATGA ACCGTGTAGG TTCTGCAAAA
ATCGCAATTA CACAAGCTGG CGAAAAAGCA CAAGGTAGCG CACTTGCATC TGATGCTTTC
TTCCCAATGC CAGATACAGT AGAAGAAGCA GCAAAAGCAG GAATTACGGC AATCATTCAA
CCAGGCGGAT CAATCCGTGA CGAAGATTCT ATTAAAGTGG CGGATACGTA TGGGATTGCT
ATGGTGTTCA CTGGCGTACG TCATTTCAAA CACTAA
 
Protein sequence
MKKRALVSVS DKTGVVEFVK GLLEQGIEVI STGGTKKLLE ENGLQVIGIS EVTGFPEIMD 
GRVKTLHPNI HGGLLAVRDN ETHVAQMNEL GMEPIDFVVV NLYPFKETIA KPDVTFADAI
ENIDIGGPTM IRSAAKNHKF VSVIVDPVDY DVVLAELKEN GEVAEETKRK LAAKVFRHTA
AYDALISNYL TEQMGEESPE TLTVTFEKKQ DLRYGENPHQ KATFYKAPFA ATSSVAYAEQ
LHGKELSYNN INDADAALSI VKEFTEPAVV AVKHMNPCGV GVGTDIHEAY TRAYEADPVS
IFGGIIAANR EIDKATAEKL HEIFLEIIIA PSFSKEALEV LQSKKNLRLL TVNIEKATSA
SKKLTSVQGG LLVQEEDTLS LDESTISIPT KREPSEQEWK DLKLAWKVVK HVKSNAIVLA
KDDMTIGVGA GQMNRVGSAK IAITQAGEKA QGSALASDAF FPMPDTVEEA AKAGITAIIQ
PGGSIRDEDS IKVADTYGIA MVFTGVRHFK H