Gene Ppha_2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2259 
SymbolpurH 
ID6462198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2343121 
End bp2344698 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content52% 
IMG OID642728447 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002019071 
Protein GI194337277 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00898643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CTGTCATCAA GCGGGCGCTG GTCTCTGTAT CTGATAAAAC CGGTATTGTT 
GAATTTTGCC GGGAGTTGAG TGGCATGGGC GTTGAAATTT TCTCAACAGG GGGTACCTTG
AAGTCGCTTC AGGATTCAGG AGTCAGCGCA TCCTCCATCT CCACCATTAC CGGATTTCCG
GAAATCATGG ATGGACGGGT CAAAACCCTG CACCCGAAAA TACATGGTGG ACTGCTTGCC
GTAAGGGAGA ATCCGGAGCA TGTCAAACAG GCTACGGAGA ACGGTATCAG CTTCATTGAT
CTTGTTGTCG TCAACCTTTA TCCTTTCGAG GCTACAGTGG CAAGGCCGGA TGTAACCTTC
GAGGATGCTA TAGAGAATAT TGATATTGGT GGTCCATCCA TGCTGCGCAG TGCAGCCAAG
AACAACGAAT CGGTAACGGT GGTAACCGAT AGTGCCGACT ACGCTCTTGT GCTGCAGGAG
ATGCGTGAGC ATAACGGTGC GACAAAAAGA ACGACCCGTC TGACGCTTGC CCTGAAAGTA
TTTGAACTCA CCTCCCGTTA TGACCGTGCC ATTGCCTCTT ACCTTGCCGG AGCAGTCGCA
GGAGAGCAGC AGGGTGCGGC CTCAAAGATG ACGGTCACTC TTGAGCGTGA GCTCGATATG
CGTTACGGTG AAAATCCGCA CCAGAGCGCA GGGCTTTACC GCCTGACCGA TGAGAACGGA
ACACGCTCCT TTGGCGACTT TTTCGAGAAG CTGCATGGCA AGGAGCTCTC CTACAATAAT
ATGCTCGACA TCGCTGCAGC AGTCTCCCTG ATTGAGGAGT TCCGTGGAGA GGAGCCGACA
GTGGTCATTG TCAAACACAC CAACCCCTGT GGTGTCGCTC AGGCCCCGAC CCTTGCCGAA
GCCTACCGCA GGGCATTTTC AACCGATACC CAGGCTCCTT TTGGTGGAAT TATCTCCTTT
AACCGTCCTC TCGATATGGA GGCAGCAAAG GCGGTCAATG AAATTTTCAC CGAGATTCTC
ATTGCTCCCG CTTTTGAGGA TGGCGTGCTT GAGATGCTGA TGAAGAAAAA AGATCGCAGG
CTGGTGCTGC AGACGAACGC TTTGCCCAAA GGTGGCTGGG AGTTCAAGTC AACCCCGTTC
GGGATGCTTG TTCAGGAACG TGACAGCAAA ATCGTTGCAA AAGAGGATCT GACGGTTGTA
ACCAAACGGC AGCCGACAGA AGAGGAGATT GCCGACCTGA TGTTTGCCTG GAAAATCTGC
AAGCATATCA AGTCGAACAC CATTCTCTAT GTCAAGAATC GTCAGACATA CGGCGTCGGC
GCTGGACAGA TGTCGCGCGT TGACTCCTCC AAAATTGCAC GTTGGAAGGC CTCTGAAGTT
AGTCTCGACC TGCATGGATC GGTTGTTGCT TCGGATGCGT TTTTCCCCTT CGCTGATGGC
CTGCTTGCCG CTGCCGAAGC TGGTGTTACC GCAGTCATTC AGCCTGGTGG CTCCATTCGC
GATAACGAGG TGATTGAAGC AGCCGATGCC AACAACCTTG CGATGGTCTT TACCGGAATG
CGTCACTTCA AGCATTGA
 
Protein sequence
MSDPVIKRAL VSVSDKTGIV EFCRELSGMG VEIFSTGGTL KSLQDSGVSA SSISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRENPEHVKQ ATENGISFID LVVVNLYPFE ATVARPDVTF
EDAIENIDIG GPSMLRSAAK NNESVTVVTD SADYALVLQE MREHNGATKR TTRLTLALKV
FELTSRYDRA IASYLAGAVA GEQQGAASKM TVTLERELDM RYGENPHQSA GLYRLTDENG
TRSFGDFFEK LHGKELSYNN MLDIAAAVSL IEEFRGEEPT VVIVKHTNPC GVAQAPTLAE
AYRRAFSTDT QAPFGGIISF NRPLDMEAAK AVNEIFTEIL IAPAFEDGVL EMLMKKKDRR
LVLQTNALPK GGWEFKSTPF GMLVQERDSK IVAKEDLTVV TKRQPTEEEI ADLMFAWKIC
KHIKSNTILY VKNRQTYGVG AGQMSRVDSS KIARWKASEV SLDLHGSVVA SDAFFPFADG
LLAAAEAGVT AVIQPGGSIR DNEVIEAADA NNLAMVFTGM RHFKH