Gene Mpe_A0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0485 
SymbolpurH 
ID4787070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp525930 
End bp527525 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content70% 
IMG OID640089043 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001019682 
Protein GI124265678 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.371937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.063337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG CCCTCCTCTC CGTTTCCGAC AAGACCGGCA TCGTCGAACT CGCCCGGTCC 
CTGCATGCGC TGGGCGTGAA GCTGCTCTCG ACCGGTGGCA CGGCCAGGCT GCTGGCCGAC
AGCGGCCTCC CGGTCACCGA GGTGGCCGAC CACACCGGCT TCCCCGAAAT GCTCGACGGT
CGCGTGAAGA CGCTGCACCC GACCATCCAC GGCGGCCTGC TGGCGCGCCG CGACCTGCCG
GCGCACATGG CCTCGCTGGC CGCGCACGGC ATCGAGACGA TCGACCTGCT GGTGGTCAAC
CTCTACCCCT TCGAGGCCAC AGTCGCGAAG CCCGGCTGCA CGCTGGAGGA CGCGATCGAG
AACATCGACA TCGGCGGACC GGCGATGGTG CGTTCGGCCG CCAAGAACTG GAAGGACGTG
GCGGTGCTGA CCGACGCCTC GCAGTACGCC GGCGTGCTGG CCGACCTGCA GCAGGACGGC
CGGGTGAGCG AGAGCACGCG CTTCGCGCTT GCGGTCGCGG CCTTCAACCG CATCAGCAAC
TACGACGCGG CCATCAGCGA CCACCTGTCG GCGCTGCGCC CCGACGGCAC GCGCGCCGAG
TTCCCGGCGC AAAGCAACGG CCGCTTCGTC AAGCTGCAGG ACCTGCGCTA CGGCGAGAAC
CCGCACCAGA GCGCCGCGTT CTACCGCGAC CTGCACCCGG CGCCTGGCTC GCTGGTGAGC
GCCGTGCAGC TGCAGGGCAA GGAGCTGTCG TACAACAACA TCGCCGACGC CGATGCGGCG
TGGGAGTGCG TGAAAGGCTT CGACGCTTCC GTCGACGGGC CGGCCTGCGT GATCGTCAAG
CACGCCAACC CCTGCGGCGT GGCCCTCGGC GCCAACGCGG CCGAGGCCTA TGGCAAGGCC
TTCCGCACCG ACCCGACCTC CGCGTTCGGC GGCATCATCG CCTTCAACGT TCCGGTCGAC
GGCGCGGCAG CGCAGGCGAT CGCGAAGCAG TTCGTCGAGG TGCTGATCGC CCCCGGCTAT
ACCGACGAGG CGCGCGCCGT GTTCGCCGCC AAGGCCAACA CGCGCGTGCT GCAGATCTCG
CTCGACGGCG TGCAGCGCGA CGCGCCCGAC GCCTGGTCGC GCGGCCTCAA TTCGCACGAC
ATCAAGCGCG TCGGCTCGGG TCTGCTGATC CAGAGCGCCG ACAACCACGT GCTCGGACTG
CAGGACCTGA AAGTCGTCAC GAAGCTGGCG CCGACCGACG GACAGCTGGC CGACCTGCTG
TTCGCGTGGA AGGTGGCCAA GTTCGTCAAG AGCAATGCCA TCGTGTTCTG CGGCGACGGC
ATGACGCTCG GCGTCGGCGC CGGCCAGATG AGCCGGCTCG ACAGCGCGCG CATCGCCAGC
ATCAAGGCCA GCCACGCCGA CCTGAGCCTG GCCGGCTCGG CGGTCGCGAG CGACGCCTTC
TTCCCGTTCC GCGACGGCCT CGACGTGCTG GCCGATGCCG GAGCGCGCAG CGTCATCCAG
CCCGGCGGCA GCCTGCGCGA CGACGAGGTG ATCGCCGCCG CCAACGAACG CGGCATCGCG
ATGGTGCTGA CAGGTGTGCG TCACTTCAGG CACTGA
 
Protein sequence
MATALLSVSD KTGIVELARS LHALGVKLLS TGGTARLLAD SGLPVTEVAD HTGFPEMLDG 
RVKTLHPTIH GGLLARRDLP AHMASLAAHG IETIDLLVVN LYPFEATVAK PGCTLEDAIE
NIDIGGPAMV RSAAKNWKDV AVLTDASQYA GVLADLQQDG RVSESTRFAL AVAAFNRISN
YDAAISDHLS ALRPDGTRAE FPAQSNGRFV KLQDLRYGEN PHQSAAFYRD LHPAPGSLVS
AVQLQGKELS YNNIADADAA WECVKGFDAS VDGPACVIVK HANPCGVALG ANAAEAYGKA
FRTDPTSAFG GIIAFNVPVD GAAAQAIAKQ FVEVLIAPGY TDEARAVFAA KANTRVLQIS
LDGVQRDAPD AWSRGLNSHD IKRVGSGLLI QSADNHVLGL QDLKVVTKLA PTDGQLADLL
FAWKVAKFVK SNAIVFCGDG MTLGVGAGQM SRLDSARIAS IKASHADLSL AGSAVASDAF
FPFRDGLDVL ADAGARSVIQ PGGSLRDDEV IAAANERGIA MVLTGVRHFR H