Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4498 |
Symbol | purH |
ID | 6268518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4206779 |
End bp | 4208368 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641728290 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001882692 |
Protein GI | 187733909 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000358321 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG GTGGTTGTTA ACCTGTATCC GTTCGCCCAG ACTGTTGCTC GTGAAGGCTG CTCGCTGGAA GATGCGGTTG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCCGC CGCTAAGAAC CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCG TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC ATTAAGAAGC AGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGTG TAGCTATCGG CAATTCCATT CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCAACCT CCGCATTCGG CGGCATCATT GCCTTTAACC GCGAGCTAGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC GAAGTGATTA TTGCGCCTTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAGCAG AACGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGTCT TGATTTCAAA CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGTGACCTGG GGATGGTCGG TGCGGAAGAA CTGCGCGTCG TCACCCAACG TCAGCCGACC GAACAGGAAC TGCGCGATGC GCTGTTCTGC TGGAAAGTGG CGAAGTTCGT GAAATCCAAT GCTATCGTCT ATGCCAAAAA CAATATGACC ATCGGTATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGTATTAAA GCTGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG TTCCGCGACG GTATTGATGC CGCCGCCGCT GCGGGCGTGA CCTGCGTAAT CCAGCCAGGC GGTTCCATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAQ TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGNSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTQRQPT EQELRDALFC WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH
|
| |