Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3843 |
Symbol | purH |
ID | 5386555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4329535 |
End bp | 4331124 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640866868 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001402794 |
Protein GI | 153949966 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000000026179 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GCCGTCCAAT CCGCCGTGCT CTACTCAGTG TGTCTGACAA AGCAGGTATC ATCGAATTCG CCCAAGCACT TTCTCAACGC GGTATCGAGT TACTTTCCAC CGGTGGGACT GCCCGCCTGC TGGCTGATGC TGGTTTACCC GTTACCGAAG TGTCTGACTA CACCGGCTTC CCGGAAATGA TGGATGGACG TGTGAAGACT TTGCATCCAA AGGTGCATGG TGGGATTTTA GGTCGTCGTG GCCAAGATGA TGGCATTATG GCTCAACATG GCATTCAACC AATTGATATT GTCGTCGTTA ATTTATATCC CTTCGCCCAG ACGGTTGCCC GCCCGGATTG CTCGCTGGAA GATGCGGTTG AGAATATTGA TATTGGTGGC CCAACCATGG TTCGCTCTGC GGCCAAGAAC CATAAAGATG TCGCCATCGT GGTGAAGAGT AGCGACTACC CCGCCATTAT TACTGAGCTT GATAATAATG ATGGTTCGTT GACTTACCCC ACCCGTTTCA ATCTGGCCAT TAAAGCTTTC GAACACACCG CCGCCTATGA CAGCATGATC GCCAACTACT TCGGTACGCT GGTGCCACCT TATCATGGTG ATACGGAACA GCCTTCCGGC CACTTCCCTC GCACCCTAAA TCTTAACTAT ATAAAGAAGC AGGATATGCG TTACGGTGAA AACAGCCACC AGCAAGCTGC CTTCTATATA GAAGAAGATG TCAAAGAGGC ATCCGTTGCC ACTGCCCAGC AATTACAAGG GAAAGCCCTC TCTTATAACA ATATTGCGGA TACCGATGCC GCGCTGGAAT GCGTGAAAGA GTTCAGTGAA CCAGCCTGTG TGATCGTTAA ACATGCCAAC CCATGCGGTG TGGCTATCGG TGATTCTATT CTTGCCGCTT ATGAACGTGC CTATCAAACC GATCCAACCT CAGCTTTCGG TGGCATCATC GCCTTTAACC GTGAATTGGA TGCAGCAACG GCCAGCGCGA TCATCAGCCG CCAGTTTGTC GAAGTGATCA TTGCGCCAAC AGTCAGCTCT GATGCATTGG CATTGCTTGC AGCTAAACAA AATGTCCGAG TCCTGACTTG TGGCCAGTGG CAAGCACGTT CAGCAGGTTT AGATTTCAAA CGTGTTAATG GGGGTTTGCT GGTACAAGAA CGCGATTTAG GTATGGTGAC GGCGGCCGAC CTTCGCGTGG TTTCCAAGCG TCAGCCTACC GAACAGGAAC TGCGTGATGC GCTGTTCTGC TGGAAAGTGG CTAAGTTTGT TAAATCCAAT GCGATTGTCT ATGCCCGCGA TAACATGACA ATCGGTATAG GTGCCGGCCA AATGAGCCGC GTGTACTCTG CGAAAATAGC CGGTATCAAG GCCGCAGATG AAGGGCTGGA AGTGGCTGGC TCAGCCATGG CCTCTGATGC CTTCTTCCCG TTCCGTGATG GTATTGATGC CGCCGCGGCT GTGGGCATTA CTTGTGTCAT CCAACCGGGC GGCTCAATTC GTGATGATGA AGTCATCGCG GCTGCTGATG AACACAGTAT TGCCATGATC TTCACCGACA TGCGCCATTT CCGTCATTAA
|
Protein sequence | MQQRRPIRRA LLSVSDKAGI IEFAQALSQR GIELLSTGGT ARLLADAGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDGIM AQHGIQPIDI VVVNLYPFAQ TVARPDCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYPAIITEL DNNDGSLTYP TRFNLAIKAF EHTAAYDSMI ANYFGTLVPP YHGDTEQPSG HFPRTLNLNY IKKQDMRYGE NSHQQAAFYI EEDVKEASVA TAQQLQGKAL SYNNIADTDA ALECVKEFSE PACVIVKHAN PCGVAIGDSI LAAYERAYQT DPTSAFGGII AFNRELDAAT ASAIISRQFV EVIIAPTVSS DALALLAAKQ NVRVLTCGQW QARSAGLDFK RVNGGLLVQE RDLGMVTAAD LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYARDNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVAG SAMASDAFFP FRDGIDAAAA VGITCVIQPG GSIRDDEVIA AADEHSIAMI FTDMRHFRH
|
| |