Gene YpsIP31758_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3843 
SymbolpurH 
ID5386555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4329535 
End bp4331124 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content51% 
IMG OID640866868 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001402794 
Protein GI153949966 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000026179 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC GCCGTCCAAT CCGCCGTGCT CTACTCAGTG TGTCTGACAA AGCAGGTATC 
ATCGAATTCG CCCAAGCACT TTCTCAACGC GGTATCGAGT TACTTTCCAC CGGTGGGACT
GCCCGCCTGC TGGCTGATGC TGGTTTACCC GTTACCGAAG TGTCTGACTA CACCGGCTTC
CCGGAAATGA TGGATGGACG TGTGAAGACT TTGCATCCAA AGGTGCATGG TGGGATTTTA
GGTCGTCGTG GCCAAGATGA TGGCATTATG GCTCAACATG GCATTCAACC AATTGATATT
GTCGTCGTTA ATTTATATCC CTTCGCCCAG ACGGTTGCCC GCCCGGATTG CTCGCTGGAA
GATGCGGTTG AGAATATTGA TATTGGTGGC CCAACCATGG TTCGCTCTGC GGCCAAGAAC
CATAAAGATG TCGCCATCGT GGTGAAGAGT AGCGACTACC CCGCCATTAT TACTGAGCTT
GATAATAATG ATGGTTCGTT GACTTACCCC ACCCGTTTCA ATCTGGCCAT TAAAGCTTTC
GAACACACCG CCGCCTATGA CAGCATGATC GCCAACTACT TCGGTACGCT GGTGCCACCT
TATCATGGTG ATACGGAACA GCCTTCCGGC CACTTCCCTC GCACCCTAAA TCTTAACTAT
ATAAAGAAGC AGGATATGCG TTACGGTGAA AACAGCCACC AGCAAGCTGC CTTCTATATA
GAAGAAGATG TCAAAGAGGC ATCCGTTGCC ACTGCCCAGC AATTACAAGG GAAAGCCCTC
TCTTATAACA ATATTGCGGA TACCGATGCC GCGCTGGAAT GCGTGAAAGA GTTCAGTGAA
CCAGCCTGTG TGATCGTTAA ACATGCCAAC CCATGCGGTG TGGCTATCGG TGATTCTATT
CTTGCCGCTT ATGAACGTGC CTATCAAACC GATCCAACCT CAGCTTTCGG TGGCATCATC
GCCTTTAACC GTGAATTGGA TGCAGCAACG GCCAGCGCGA TCATCAGCCG CCAGTTTGTC
GAAGTGATCA TTGCGCCAAC AGTCAGCTCT GATGCATTGG CATTGCTTGC AGCTAAACAA
AATGTCCGAG TCCTGACTTG TGGCCAGTGG CAAGCACGTT CAGCAGGTTT AGATTTCAAA
CGTGTTAATG GGGGTTTGCT GGTACAAGAA CGCGATTTAG GTATGGTGAC GGCGGCCGAC
CTTCGCGTGG TTTCCAAGCG TCAGCCTACC GAACAGGAAC TGCGTGATGC GCTGTTCTGC
TGGAAAGTGG CTAAGTTTGT TAAATCCAAT GCGATTGTCT ATGCCCGCGA TAACATGACA
ATCGGTATAG GTGCCGGCCA AATGAGCCGC GTGTACTCTG CGAAAATAGC CGGTATCAAG
GCCGCAGATG AAGGGCTGGA AGTGGCTGGC TCAGCCATGG CCTCTGATGC CTTCTTCCCG
TTCCGTGATG GTATTGATGC CGCCGCGGCT GTGGGCATTA CTTGTGTCAT CCAACCGGGC
GGCTCAATTC GTGATGATGA AGTCATCGCG GCTGCTGATG AACACAGTAT TGCCATGATC
TTCACCGACA TGCGCCATTT CCGTCATTAA
 
Protein sequence
MQQRRPIRRA LLSVSDKAGI IEFAQALSQR GIELLSTGGT ARLLADAGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDGIM AQHGIQPIDI VVVNLYPFAQ TVARPDCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYPAIITEL DNNDGSLTYP TRFNLAIKAF
EHTAAYDSMI ANYFGTLVPP YHGDTEQPSG HFPRTLNLNY IKKQDMRYGE NSHQQAAFYI
EEDVKEASVA TAQQLQGKAL SYNNIADTDA ALECVKEFSE PACVIVKHAN PCGVAIGDSI
LAAYERAYQT DPTSAFGGII AFNRELDAAT ASAIISRQFV EVIIAPTVSS DALALLAAKQ
NVRVLTCGQW QARSAGLDFK RVNGGLLVQE RDLGMVTAAD LRVVSKRQPT EQELRDALFC
WKVAKFVKSN AIVYARDNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVAG SAMASDAFFP
FRDGIDAAAA VGITCVIQPG GSIRDDEVIA AADEHSIAMI FTDMRHFRH