Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4387 |
Symbol | purH |
ID | 6516108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 4258468 |
End bp | 4260057 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642749338 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002117077 |
Protein GI | 194738357 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00813885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.752552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAAC TGCTGTCTAC GGGGGGCACC GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTC CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTACACGG CGGCATCCTC GGCCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTTGCAC GCGTTGGTTG CTCGCTGGAA GATGCAGTAG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCCGC CGCGAAGAAC CATAAAGACG TGGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GACGCTAACG AAGGTTCTTT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA CCTGAACTTC ATTAAGAAGC AGGATATGCG CTACGGCGAG AACAGCCACC AGCAAGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAAC AGGTGCAGGG CAAAGCGCTC TCTTATAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTAAG TACCTCTATT CTCGACGCTT ACGATCGCGC CTATAAAACC GACCCGACCT CCGCGTTCGG CGGCATTATC GCCTTTAACC GTGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC GAAGTGATCA TCGCCCCATC CGCAAGCGAA GAAGCGCTGA AAATCACCGC TGCCAAGCAG AACGTCCGTG TTCTGACCTG CGGCCAATGG GCAAGCCGCG TTCCGGGCCT GGATTTCAAA CGCGTTAACG GTGGCCTGCT GGTTCAGGAC AGGGATCTGG GTATGGTGAG TGAAGCTGAA CTGCGCGTGG TGTCCAAACG CCAGCCGACC GAGCAGGAGC TGCGTGACGC GCTGTTCTGC TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCTATTGTGT ATGCCAAAGA GAACATGACC ATCGGTATAG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CCAAAATCGC CGGGATTAAA GCCGCTGATG AAGGTCTGGA AGTGAAAGGC TCAGCCATGG CTTCCGACGC GTTCTTCCCG TTCCGCGATG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCAGGC GGTTCTATCC GTGATGATGA AGTCATTGCC GCTGCCGATG AACACGGTAT TGCGATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVARVGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ NVRVLTCGQW ASRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH
|
| |