Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4420 |
Symbol | purH |
ID | 6792746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4310655 |
End bp | 4312244 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642778514 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002149084 |
Protein GI | 197248309 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000744921 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTT CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTCCACGG CGGCATCCTC GGTCGCCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTAGCAC GCGAAGGCTG CTCGCTGGAA GATGCGGTAG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCGAAGAAC CATAAAGACG TCGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC ATTAAGAAGC AGGATATGCG CTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAAC AGGTTCAGGG CAAAGCGCTT TCCTACAACA ACATCGCCGA TACCGACGCG GCGCTGGAGT GTGTGAAAGC GTTCAACGAG CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTAAG TACCTCTATT CTCGACGCTT ACGATCGCGC CTATAAAACC GACCCGACCT CCGCGTTCGG CGGCATCATC GCCTTTAACC GCGAGCTGGA TGCTGAAACC GCGCAGGCCA TCATCTCCCG CCAGTTCGTG GAAGTGATCA TCGCCCCATC TGCGACCGAA GAAGCGCTGA AGATCACTGC CGCTAAACAG AACGTTCGCG TCCTGACCTG TGGCCAGTGG GCACAGCGCG TACCGGGCCT GGATTTCAAA CGCGTTAACG GCGGCCTGCT GGTTCAGGAC AGGGATCTGG GTATGGTGAG CGAAGCTGAA CTGCGCGTGG TCTCTAAACG CCAGCCGACC GAGCAGGAAC TGCGCGATGC GCTGTTCTGC TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCCATTGTGT ATGCCAAAGA GAACATGACT ATCGGCATTG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGGATTAAA GCGGCTGACG AAGGTCTGGA AGTGAAAGGC TCCGCGATGG CCTCTGACGC CTTCTTCCCG TTCCGTGACG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCTGGC GGTTCGATTC GCGACGAAGA GGTGATTGCC GCCGCCGACG AACACGGCAT TGCGATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKAFNE PACVIVKHAN PCGVAVSTSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE EALKITAAKQ NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGVSCVIQPG GSIRDEEVIA AADEHGIAMI FTDMRHFRH
|
| |