Gene PA14_64200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_64200 
SymbolpurH 
ID4383323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp5719711 
End bp5721318 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content66% 
IMG OID639327782 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_793320 
Protein GI116053002 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.195838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AAACCACCCG CCTGCCCATC CGCCGCGCGC TGATCAGCGT TTCCGACAAG 
ACCGGCGTCG TCGACTTCGC CCGTGAGCTG GTCGCCCTCG GCGTGGAAAT CCTTTCCACC
GGCGGCACCT ACAAGCTGCT CCGGGACAAC GGCATCTCTG CCGTGGAAGT GGCCGACTAC
ACCGGCTTCC CGGAAATGAT GGACGGTCGG GTGAAGACCC TGCACCCGAA GGTGCATGGC
GGTATCCTCG GTCGCCGCGA CCTCGACGGC GCGGTCATGG AGCAGCACGG CATCAAGCCG
ATCGACCTGG TGGCGGTCAA CCTGTATCCC TTCGAGGCCA CGGTGGTCAG GCCCGACTGC
GACCTGCCTA CCGCCATCGA GAACATCGAT ATCGGCGGGC CGACCATGGT CCGTTCGGCG
GCGAAGAACC ACAAGGATGT CGCCATCGTG GTCAATGCCG GCGACTACGC CGCCGTGATC
GAATCCCTCA AGGCCGGCGG CCTGACCTAC GCCCAGCGTT TCGACCTGGC CCTCAAGGCG
TTCGAGCACA CCTCCGCCTA CGACGGCATG ATCGCCAACT ACCTGGGTAC CATCGACCAG
ACCCGCGACA CCCTCGGCAC CGCCGACCGC GGCGCCTTCC CGCGCACCTT CAACAGCCAG
TTCGTCAAGG CCCAGGAAAT GCGCTACGGC GAGAACCCGC ACCAGAGCGC GGCGTTCTAC
GTCGAGGCGA AGAAGGGCGA GGCCAGCGTC TCCACCGCCA TCCAGTTGCA AGGCAAGGAG
TTGTCGTTCA ACAACGTCGC CGACACCGAC GCCGCCCTGG AATGCGTGAA GAGCTTCCTC
AAGCCGGCCT GCGTGATCGT CAAGCATGCC AACCCCTGCG GCGTCGCCGT GGTGCCGGAA
GACGAAGGCG GCATCCGCAA GGCCTACGAC CTGGCCTACG CCACCGATAG CGAATCGGCG
TTCGGCGGCA TCATCGCCTT CAACCGCGAG CTGGACGGCG AAACCGCCAA GGCCATCGTC
GAGCGCCAGT TCGTCGAGGT GATCATCGCG CCGAAGATTT CCGCCGCCGC CCGCGAGGTG
GTCGCCGCCA AGGCCAACGT ACGCCTGCTC GAATGCGGCG AATGGCCAGC CGAGCGCGCC
CCGGGCTGGG ACTTCAAGCG GGTCAACGGC GGCCTGCTGG TACAGAGCCG CGACATCGGC
ATGATCAAGG CCGAGGACCT GAAGATCGTC ACTCGCCGCG CACCCACCGA GCAGGAGATC
CACGACCTGA TCTTCGCCTG GAAGGTGGCC AAGTTCGTCA AGTCCAACGC CATCGTCTAC
GCCAGGAACC GCCAGACCGT CGGCGTCGGC GCCGGCCAGA TGAGCCGGGT CAACTCCGCA
CGGATCGCCG CGATCAAGGC CGAGCACGCC GGCCTGGAAG TGAAAGGCGC GGTGATGGCC
TCGGACGCCT TCTTCCCGTT CCGCGACGGC ATCGACAACG CGGCCAAGGC CGGCATCACC
GCGGTGATCC AGCCGGGCGG CTCGATGCGC GACAGCGAAG TGATCGCGGC GGCCGACGAG
GCGGATATCG CGATGGTGTT CACTGGCATG CGCCATTTCC GCCATTGA
 
Protein sequence
MTDQTTRLPI RRALISVSDK TGVVDFAREL VALGVEILST GGTYKLLRDN GISAVEVADY 
TGFPEMMDGR VKTLHPKVHG GILGRRDLDG AVMEQHGIKP IDLVAVNLYP FEATVVRPDC
DLPTAIENID IGGPTMVRSA AKNHKDVAIV VNAGDYAAVI ESLKAGGLTY AQRFDLALKA
FEHTSAYDGM IANYLGTIDQ TRDTLGTADR GAFPRTFNSQ FVKAQEMRYG ENPHQSAAFY
VEAKKGEASV STAIQLQGKE LSFNNVADTD AALECVKSFL KPACVIVKHA NPCGVAVVPE
DEGGIRKAYD LAYATDSESA FGGIIAFNRE LDGETAKAIV ERQFVEVIIA PKISAAAREV
VAAKANVRLL ECGEWPAERA PGWDFKRVNG GLLVQSRDIG MIKAEDLKIV TRRAPTEQEI
HDLIFAWKVA KFVKSNAIVY ARNRQTVGVG AGQMSRVNSA RIAAIKAEHA GLEVKGAVMA
SDAFFPFRDG IDNAAKAGIT AVIQPGGSMR DSEVIAAADE ADIAMVFTGM RHFRH