Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4509 |
Symbol | purH |
ID | 6485735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4382603 |
End bp | 4384192 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642739738 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002043424 |
Protein GI | 194444373 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000231662 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.200878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTT CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTCCACGG CGGCATCCTC GGTCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTAGCAC GCGAAGGCTG CTCGCTGGAA GATGCGGTAG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCGAAGAAC CATAAAGACG TCGCTATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC GAACACACCG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCC TACCACGGTG AGAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA CCTGAACTTC ATTAAGAAGC AGGATATGCG CTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAAAATG TGAAAGAAGC GTCCGTTGCC ACCGCAACCC AGGTTCAGGG TAAAGCCCTC TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAAT GCGTGAAAGA GTTCAACGAG CCGGCCTGCG TGATCGTCAA GCACGCCAAC CCGTGCGGCG TGGCGGTAAG TACCTCTATT CTCGACGCTT ACGACCGCGC ATACAAAACC GACCCGACCT CCGCCTTCGG CGGCATCATC GCCTTCAACC GCGAGCTGGA TGCCGAAACC GCGCAGGCCA TCATCTCCCG CCAGTTCGTA GAAGTGATCA TCGCCCCATC CGCCAGCGGA GAAGCGCTGA AAATCACCGC CGCCAAGCAG AACGTGCGCG TTCTGACCTG CGGTCAGTGG GCACAGCGTG TGCCGGGTCT GGACTTCAAG CGTGTTAACG GCGGCCTGCT GGTTCAGGAT CGTGACCTGG GTATGGTGAG CGAAGCTGAA CTGCGCGTGG TTTCCAAACG CCAGCCGACC GAGCAAGAGC TGCGCGATGC GCTGTTCTGC TGGAAAGTGG CAAAGTTCGT GAAATCTAAC GCCATTGTTT ACGCCAAAGA GAATATGACC ATCGGCATAG GCGCAGGCCA GATGAGCCGC GTCTACTCCG CGAAAATCGC CGGGATTAAA GCGGCTGACG AAGGTCTGGA AGTGAAAGGT TCCGCGATGG CTTCCGACGC CTTCTTCCCG TTCCGTGACG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCTGGC GGTTCTATCC GTGATGATGA AGTCATTGCC GCTGCCGATG AACACGGCAT TGCGATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASG EALKITAAKQ NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH
|
| |