Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0485 |
Symbol | purH |
ID | 4787070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 525930 |
End bp | 527525 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640089043 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001019682 |
Protein GI | 124265678 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.371937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.063337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCG CCCTCCTCTC CGTTTCCGAC AAGACCGGCA TCGTCGAACT CGCCCGGTCC CTGCATGCGC TGGGCGTGAA GCTGCTCTCG ACCGGTGGCA CGGCCAGGCT GCTGGCCGAC AGCGGCCTCC CGGTCACCGA GGTGGCCGAC CACACCGGCT TCCCCGAAAT GCTCGACGGT CGCGTGAAGA CGCTGCACCC GACCATCCAC GGCGGCCTGC TGGCGCGCCG CGACCTGCCG GCGCACATGG CCTCGCTGGC CGCGCACGGC ATCGAGACGA TCGACCTGCT GGTGGTCAAC CTCTACCCCT TCGAGGCCAC AGTCGCGAAG CCCGGCTGCA CGCTGGAGGA CGCGATCGAG AACATCGACA TCGGCGGACC GGCGATGGTG CGTTCGGCCG CCAAGAACTG GAAGGACGTG GCGGTGCTGA CCGACGCCTC GCAGTACGCC GGCGTGCTGG CCGACCTGCA GCAGGACGGC CGGGTGAGCG AGAGCACGCG CTTCGCGCTT GCGGTCGCGG CCTTCAACCG CATCAGCAAC TACGACGCGG CCATCAGCGA CCACCTGTCG GCGCTGCGCC CCGACGGCAC GCGCGCCGAG TTCCCGGCGC AAAGCAACGG CCGCTTCGTC AAGCTGCAGG ACCTGCGCTA CGGCGAGAAC CCGCACCAGA GCGCCGCGTT CTACCGCGAC CTGCACCCGG CGCCTGGCTC GCTGGTGAGC GCCGTGCAGC TGCAGGGCAA GGAGCTGTCG TACAACAACA TCGCCGACGC CGATGCGGCG TGGGAGTGCG TGAAAGGCTT CGACGCTTCC GTCGACGGGC CGGCCTGCGT GATCGTCAAG CACGCCAACC CCTGCGGCGT GGCCCTCGGC GCCAACGCGG CCGAGGCCTA TGGCAAGGCC TTCCGCACCG ACCCGACCTC CGCGTTCGGC GGCATCATCG CCTTCAACGT TCCGGTCGAC GGCGCGGCAG CGCAGGCGAT CGCGAAGCAG TTCGTCGAGG TGCTGATCGC CCCCGGCTAT ACCGACGAGG CGCGCGCCGT GTTCGCCGCC AAGGCCAACA CGCGCGTGCT GCAGATCTCG CTCGACGGCG TGCAGCGCGA CGCGCCCGAC GCCTGGTCGC GCGGCCTCAA TTCGCACGAC ATCAAGCGCG TCGGCTCGGG TCTGCTGATC CAGAGCGCCG ACAACCACGT GCTCGGACTG CAGGACCTGA AAGTCGTCAC GAAGCTGGCG CCGACCGACG GACAGCTGGC CGACCTGCTG TTCGCGTGGA AGGTGGCCAA GTTCGTCAAG AGCAATGCCA TCGTGTTCTG CGGCGACGGC ATGACGCTCG GCGTCGGCGC CGGCCAGATG AGCCGGCTCG ACAGCGCGCG CATCGCCAGC ATCAAGGCCA GCCACGCCGA CCTGAGCCTG GCCGGCTCGG CGGTCGCGAG CGACGCCTTC TTCCCGTTCC GCGACGGCCT CGACGTGCTG GCCGATGCCG GAGCGCGCAG CGTCATCCAG CCCGGCGGCA GCCTGCGCGA CGACGAGGTG ATCGCCGCCG CCAACGAACG CGGCATCGCG ATGGTGCTGA CAGGTGTGCG TCACTTCAGG CACTGA
|
Protein sequence | MATALLSVSD KTGIVELARS LHALGVKLLS TGGTARLLAD SGLPVTEVAD HTGFPEMLDG RVKTLHPTIH GGLLARRDLP AHMASLAAHG IETIDLLVVN LYPFEATVAK PGCTLEDAIE NIDIGGPAMV RSAAKNWKDV AVLTDASQYA GVLADLQQDG RVSESTRFAL AVAAFNRISN YDAAISDHLS ALRPDGTRAE FPAQSNGRFV KLQDLRYGEN PHQSAAFYRD LHPAPGSLVS AVQLQGKELS YNNIADADAA WECVKGFDAS VDGPACVIVK HANPCGVALG ANAAEAYGKA FRTDPTSAFG GIIAFNVPVD GAAAQAIAKQ FVEVLIAPGY TDEARAVFAA KANTRVLQIS LDGVQRDAPD AWSRGLNSHD IKRVGSGLLI QSADNHVLGL QDLKVVTKLA PTDGQLADLL FAWKVAKFVK SNAIVFCGDG MTLGVGAGQM SRLDSARIAS IKASHADLSL AGSAVASDAF FPFRDGLDVL ADAGARSVIQ PGGSLRDDEV IAAANERGIA MVLTGVRHFR H
|
| |