Gene EcE24377A_4550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4550 
SymbolpurH 
ID5588057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4542948 
End bp4544537 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content56% 
IMG OID640928166 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001465502 
Protein GI157156680 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000795546 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAAT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC 
GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT
GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC
CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG
GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG
GTGGTTGTTA ACCTGTATCC GTTCGCCAAG ACCGTTGCCC GTGAAGGTTG CTCGCTGGAA
GATGCGGTTG AGAACATCGA TATCGGCGGC CCAACGATGG TGCGCTCCGC CGCCAAGAAC
CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG
GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC
GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCT
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC
ATTAAGAAGC AGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA
GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC
TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG
CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGCG TGGCTATCAG CAATTCTATT
CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCAACCT CCGCATTCGG CGGCATCATT
GCCTTTAACC GCGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC
GAAGTGATTA TTGCGCCTTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAGCAG
AACGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGTCT TGATTTCAAA
CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGTGACCTGG GCATGGTCGG TGCGGAAGAA
CTGCGCGTCG TCACCCAACG TCAGCCGACC GAACAGGAAC TGCGTGATGC GCTGTTCTGC
TGGAAAGTGG CGAAGTTCGT GAAATCCAAC GCTATCGTCT ATGCCAAAAA CAATATGACC
ATCGGGATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGTATTAAA
GCTGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG
TTCCGCGACG GTATTGATGC CGCCGCCGCT GCAGGCGTGA CCTGCGTAAT CCAGCCTGGC
GGTTCTATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPIRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAK TVAREGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI
EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAISNSI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ
NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTQRQPT EQELRDALFC
WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP
FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH