Gene Bcer98_0277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_0277 
SymbolpurH 
ID5344707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp318322 
End bp319857 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content39% 
IMG OID640837865 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001373635 
Protein GI152974118 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC GTGCATTAGT AAGTGTTTCT AATAAAACAG GAGTAGTAGA ATTCGTGAAA 
GGTTTGCTTG AACAAGGAAT TGAAGTGATT TCAACAGGTG GCACGAAAAA ATTATTAGAG
GAAAATGGCT TACAAGTAAT GGGGATTTCT GAAGTAACAG GTTTCCCAGA GATTATGGAT
GGTCGTGTCA AAACATTACA TCCCAATATT CATGGTGGAT TACTTGCAGT GCGTGATAAT
GAAGCGCATG TAGCAGAAAT GAGCGAATTA GGCATTCAGC TGATTGATTT TGTCGTTGTA
AATTTATACC CATTTAAAGA GACGATTGCT AAGCCTGATG TAACATTTGC TGATGCGATT
GAAAATATTG ATATCGGTGG TCCAACAATG ATTCGTTCAG CTGCAAAAAA TCATAAATTT
GTATCAGTAA TTGTAGATCC AGCAGACTAT GACGTTGTAT TAGCTGAATT AAAAGAAAAA
GGTGAGGTTA CAGACGAAAC AAAGCGTAAA TTAGCAGCGA AAGTATTCCG TCATACAGCG
GCATATGATG CATTAATCTC AAACTATTTG ACTGAACAAA TGGGAGAAGA AAGCCCAGAA
ATATTAACAG TAACATTCGA GAAAAAGCAA GATTTACGTT ACGGAGAAAA TCCGCATCAA
AAAGCAACAT TCTATAAAGC ACCATTTGCG GTGGCTTCTT CTGTTGCATA TGCAGAGCAA
TTGCACGGAA AAGAACTATC TTATAACAAC ATCAATGACG CAGATGCAGC GCTTAGCATT
GTGAAAGAAT TTACAGAACC AGCGGTAGTC GCCGTAAAAC ATATGAATCC ATGCGGAGTT
GGTGTTGGTA CGGATATCCA TGAAGCGTAT ACACGTGCTT ATGAGGCGGA TCCAGTATCA
ATCTTCGGCG GCATTATTGC AGCGAACCGT GAAATTGATA AACGTGTGGC AGAGAAATTA
CATGAAATCT TCTTAGAAAT TATTATTGCA CCTTCATTTT CGAAAGAGGC TTTAGAAGTA
TTGCAAAGTA AGAAAAACTT ACGTCTGTTA ACGGTAAATA TTGAGAAGAC AACAAGTGCA
AGTAAAAAAC TAACTTCTGT TCAAGGTGGA CTTCTTGTTC AAGAAGAAGA TACGTTAGCG
CTAAATGAAG AGACAATCAT AATTCCTACA AAACGTGAAC CAACAGAGCA AGAATGGAAC
GACTTAAAAT TAGCTTGGAA AGTTGTAAAA CATGTAAAAT CAAATGCAAT TGTACTCGCA
AAAGATAATA TGACAATCGG TGTTGGTGCT GGACAAATGA ATCGTGTTGG TTCTGCAAAA
ATTGCAATCT CGCAAGCAGG TAGCAAAGCG CAAGGTAGCG CCTTAGCATC CGATGCGTTC
TTCCCAATGC CAGACACAGT AGAAGAGGCC GCAAAAGCGG GGATTACAGC AATCATTCAA
CCAGGCGGGT CAATCCGTGA CGAAGATTCG ATTAAAAAAG CGGATGAATA TGGGATTACG
ATGGTGTTCA CGGGCGTACG TCATTTCAAA CATTAA
 
Protein sequence
MKKRALVSVS NKTGVVEFVK GLLEQGIEVI STGGTKKLLE ENGLQVMGIS EVTGFPEIMD 
GRVKTLHPNI HGGLLAVRDN EAHVAEMSEL GIQLIDFVVV NLYPFKETIA KPDVTFADAI
ENIDIGGPTM IRSAAKNHKF VSVIVDPADY DVVLAELKEK GEVTDETKRK LAAKVFRHTA
AYDALISNYL TEQMGEESPE ILTVTFEKKQ DLRYGENPHQ KATFYKAPFA VASSVAYAEQ
LHGKELSYNN INDADAALSI VKEFTEPAVV AVKHMNPCGV GVGTDIHEAY TRAYEADPVS
IFGGIIAANR EIDKRVAEKL HEIFLEIIIA PSFSKEALEV LQSKKNLRLL TVNIEKTTSA
SKKLTSVQGG LLVQEEDTLA LNEETIIIPT KREPTEQEWN DLKLAWKVVK HVKSNAIVLA
KDNMTIGVGA GQMNRVGSAK IAISQAGSKA QGSALASDAF FPMPDTVEEA AKAGITAIIQ
PGGSIRDEDS IKKADEYGIT MVFTGVRHFK H