Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4550 |
Symbol | purH |
ID | 5588057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4542948 |
End bp | 4544537 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640928166 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001465502 |
Protein GI | 157156680 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000795546 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAAT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG GTGGTTGTTA ACCTGTATCC GTTCGCCAAG ACCGTTGCCC GTGAAGGTTG CTCGCTGGAA GATGCGGTTG AGAACATCGA TATCGGCGGC CCAACGATGG TGCGCTCCGC CGCCAAGAAC CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCT TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC ATTAAGAAGC AGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGCG TGGCTATCAG CAATTCTATT CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCAACCT CCGCATTCGG CGGCATCATT GCCTTTAACC GCGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC GAAGTGATTA TTGCGCCTTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAGCAG AACGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGTCT TGATTTCAAA CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGTGACCTGG GCATGGTCGG TGCGGAAGAA CTGCGCGTCG TCACCCAACG TCAGCCGACC GAACAGGAAC TGCGTGATGC GCTGTTCTGC TGGAAAGTGG CGAAGTTCGT GAAATCCAAC GCTATCGTCT ATGCCAAAAA CAATATGACC ATCGGGATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGTATTAAA GCTGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG TTCCGCGACG GTATTGATGC CGCCGCCGCT GCAGGCGTGA CCTGCGTAAT CCAGCCTGGC GGTTCTATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPIRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAK TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAISNSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTQRQPT EQELRDALFC WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH
|
| |