Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0293 |
Symbol | purH |
ID | 5607182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 334269 |
End bp | 335858 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640935792 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001476531 |
Protein GI | 157368542 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000389639 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0060748 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACAAC CTCGTCCAAT CCGCCGGGCC CTGCTCAGCG TCTCTGACAA AGCCGGTATC GTTGAATTCG CCGAAGCGCT GTCCCAGCGT GGCGTTGAAC TGCTCTCCAC CGGTGGCACC GCCCGCCTGC TGGCAGATGC CGGCCTGCCT GTTACCGAAG TTTCCGACTA CACCGGCTTC CCGGAAATGA TGGACGGACG AGTAAAGACC CTGCACCCCA AAGTACACGG CGGTATTCTC GGCCGCCGCG GCCAGGACGA CGCCATCATG GGTCAGCATG ACATCAAGCC GATCGACATG GTGGTGGTAA ACCTCTATCC GTTCGCCCAG ACCGTGGCGC GCCCGAACTG CTCACTGGAA GACGCGGTCG AGAACATCGA CATCGGCGGC CCAACCATGG TGCGTTCCGC GGCCAAGAAC CACAAAGACG TCGCCATCGT GGTAAAGAGC AGCGACTACG CCGCTATTAT TACCGAGATG GATAACAACG ACGGTTCACT GCAATACACC ACCCGTTTCG ATCTGGCCAT CAAAGCCTTC GAGCACACCG CCGCTTACGA CAGCATGATC GCCAACTACT TCGGCGCGCT GGTTCCGGCC TACCACGGCG ATACCGAACA ACCTGCCGGT CGTTTCCCTC GCACCCTGAA CCTCAACTAT ATAAAGAAGC AGGATATGCG CTACGGTGAG AACAGCCACC AGCAAGCAGC CTTCTATATA GAAGAGAACG TTCAGGAAGC CTCTGTCGCC ACCGCGGAAC AACTGCAAGG CAAAGCGCTG TCCTACAACA ACATCGCCGA CACCGACGCC GCACTGGAAT GTGTGAAGGA ATTCGCCGAG CCGGCCTGCG TGATCGTCAA GCACGCCAAC CCATGCGGTG TGGCGATCGG CGATGATATT CTGTCTGCCT ATGAGCGCGC CTATCAAACC GACCCGACCT CTGCTTTCGG CGGCATCATC GCCTTTAACC GCGAACTGGA CGCCGCTACC GCACAGGCCA TTATCAGCCG TCAGTTTGTG GAAGTGATTA TCGCGCCGAG CATCAGTCAG GAAGCTCGCT CCCTGTTGGC AGCCAAACAG AACGTGCGCG TACTGGCCTG CGGCCAATGG CAGCAACGTA TTGCCGCTCT CGACTTCAAA CGTGTCAACG GTGGCCTGCT GGTGCAAGAC CGCGATCTGG GTATGGTGAG CGAAGGCGAC CTGCGCGTGG TATCTGAACG TCAGCCGACC GCGCAGGAAC TGCGTGATGC GCTGTTCTGT TGGAAAGTCG CCAAGTTCGT GAAGTCCAAC GCTATCGTCT ATGCACGTGA CAACATGACC ATCGGCATAG GCGCCGGGCA AATGAGCCGC GTTTACTCTG CCAAGATCGC CGGGATCAAA GCCGCGGACG AAGGCCTGGA AGTCAAAGGC TCCGCCATGG CGTCTGACGC TTTCTTCCCG TTCCGTGATG GCATCGATGC CGCCGCAGCG GTGGGCATCA GCTGCGTGAT CCAGCCAGGC GGTTCGATCC GCGATGATGA AGTGATTGCC GCCGCCAATG AGCACGGCAT CGCAATGATC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQPRPIRRA LLSVSDKAGI VEFAEALSQR GVELLSTGGT ARLLADAGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM GQHDIKPIDM VVVNLYPFAQ TVARPNCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYAAIITEM DNNDGSLQYT TRFDLAIKAF EHTAAYDSMI ANYFGALVPA YHGDTEQPAG RFPRTLNLNY IKKQDMRYGE NSHQQAAFYI EENVQEASVA TAEQLQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGDDI LSAYERAYQT DPTSAFGGII AFNRELDAAT AQAIISRQFV EVIIAPSISQ EARSLLAAKQ NVRVLACGQW QQRIAALDFK RVNGGLLVQD RDLGMVSEGD LRVVSERQPT AQELRDALFC WKVAKFVKSN AIVYARDNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP FRDGIDAAAA VGISCVIQPG GSIRDDEVIA AANEHGIAMI FTDMRHFRH
|
| |