Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5476 |
Symbol | purH |
ID | 6967779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5117860 |
End bp | 5119449 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389122 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002273523 |
Protein GI | 209398930 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000710529 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG GTGGTTGTTA ACCTGTATCC GTTCGCCCAG ACCGTGGCCC GTGAAGGTTG CTCGCTGGAA GATGCGGTTG AGAACATCGA TATCGGCGGC CCAACGATGG TGCGCTCCGC CGCCAAGAAC CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCT TATCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC ATTAAGAAGC AGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGCG TGGCTATCGG CAATTCAATT CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCGACCT CCGCATTCGG CGGCATCATT GCCTTTAACC GCGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTT GAAGTGATTA TTGCGCCGTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAACAG AATGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGTCT TGATTTCAAA CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGCGATCTGG GCATGGTCGG TGCGGAAGAA CTGCGCGTCG TCACCAAACG TCAGCCGACC GAACAGGAAC TGCGTGATGC GCTGTTCTGC TGGAAAGTGG CGAAGTTCGT GAAATCCAAT GCTATCGTCT ATGCCAAAAA CAATATGACC ATCGGTATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGTATTAAA GCTGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG TTCCGCGACG GTATTGATGC CGCCGCCGCT GCGGGCGTGA CCTGCGTAAT CCAGCCTGGC GGTTCCATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC TTCACCGATA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAQ TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGNSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTKRQPT EQELRDALFC WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH
|
| |