Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4507 |
Symbol | purH |
ID | 6490266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4387866 |
End bp | 4389455 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642744581 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002048161 |
Protein GI | 194449397 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0348151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.014905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTC CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAATCCACGG CGGCATCCTC GGTCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG GTTGTCGTTA ACCTGTATCC GTTCGCCGAG ACCGTGGCAC GCGAAGGCTG CTCGCTGGAA GATGCGGTAG AGAACATTGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCTAAGAAC CATAAAGACG TGGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC ATTAAGAAGC AGGATATGCG CTATGGCGAG AACAGCCACC AGCAGGCAGC CTTCTATATA GAAGAAAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAGC AGGTTCAGGG CAAAGCGCTT TCTTACAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTGAG CACCACTATT CTCGACGCTT ACGACCGTGC GTATAAAACC GACCCAACCT CCGCGTTCGG CGGCATTATC GCCTTCAACC GCGAACTGGA TGCCGAAACC GCGCAGGCCA TTATCTCCCG CCAGTTCGTG GAAGTGATCA TCGCCCCGTC CGCAACCGAA GAGGCGCTGA AAATCACCGC CGCCAAGCAG AATGTGCGCG TACTGACCTG TGGTCAGTGG GCACAGCGCG TACCTGGTCT GGACTTCAAA CGCGTTAACG GCGGCCTGCT GGTTCAGGAC AGGGACCTGG GTATGGTGAG CGAAGCTGAA CTGCGCGTGG TTTCCAAACG CCAGCCGACC GAGCAAGAGC TGCGCGATGC GCTGTTCTGC TGGAAAGTGG CAAAGTTCGT GAAATCTAAC GCCATTGTTT ACGCCAAAGA GAATATGACC ATCGGCATAG GCGCAGGCCA GATGAGCCGC GTCTACTCCG CGAAAATCGC TGGCATTAAA GCGGCTGACG AAGGTCTGGA AGTGAAAGGT TCCGCGATGG CCTCTGACGC CTTCTTCCCG TTCCGCGATG GTATTGATGC CGCTGCCGCC GTGGGCGTGA GCTGCGTGAT CCAGCCTGGC GGTTCTATCC GTGATGATGA AGTCATTGCC GCCGCCGACG AACACGGTAT TGCGATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKIHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTTI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE EALKITAAKQ NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH
|
| |