Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4583 |
Symbol | purH |
ID | 6870984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4422902 |
End bp | 4424491 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642787490 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002218092 |
Protein GI | 198244601 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00178575 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC GCCCGCCTGT TAGCAGAAAA AGGCCTGGCG GTGACCGAAG TTTCCGATTA CACCGGTTTC CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTCCACGG CGGCATCCTC GGTCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTGGCAC GCGAAGGCTG CTCGCTGGAA GATGCGGTAG AGAACATTGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCTAAGAAC CATAAAGACG TCGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC ATTAAGAAGC AGGATATGCG CTATGGCGAG AACAGCCACC AGCAGGCAGC CTTCTATATA GAAGAAAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAGC AGGTTCAGGG CAAAGCGCTT TCTTACAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTGAG CACCACTATT CTCGACGCTT ACGACCGTGC GTATAAAACC GACCCAACCT CCGCGTTCGG CGGCATTATC GCCTTCAACC GCGAACTGGA TGCCGAAACC GCGCAGGCCA TCATCTCCCG CCAGTTCGTG GAAGTGATCA TCGCCCCGTC CGCGACCGAA GATGCGCTGA AAATCACGGC TGCCAAGCAG AATGTGCGCG TACTGACCTG TGGTCAGTGG GCACAGCGCG TACCTGGTCT GGACTTCAAA CGCGTTAACG GCGGCCTGCT GGTTCAGGAT CGTGACCTGG GTATGGTGAG CGAAGCTGAA CTGCGCGTGG TTTCCAAACG CCAGCCGACC GAGCAGGAGC TGCGTGACGC GCTGTTCTGC TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCCATTGTGT ATGCCAAAGA GAACATGACT ATCGGCATAG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC TGGCATTAAA GCGGCTGACG AAGGTCTGGA AGTGAAAGGC TCCGCGATGG CCTCTGACGC CTTCTTCCCG TTCCGTGACG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCTGGC GGCTCTATCC GTGATGATGA AGTCATTGCC GCCGCCGACG AACACGGCAT TGCGATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLA VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTTI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE DALKITAAKQ NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH
|
| |