Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4470 |
Symbol | purH |
ID | 4070953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5304131 |
End bp | 5305702 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986509 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_593544 |
Protein GI | 94971496 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAA TTCGTCGCGC TATTCTCTCC GTTACTGACA AAATCGGCCT GTCCGACTTC GCACGCACGT TGGCGAAGCA TGGCGTTGAA CTGATCTCCA CCGGCGGCAC GGCGAAGATG CTGCGCGACG CCGGCATCCC CGTCAAAGAC ATCTCCGAAT TGACCGGCTT TCCCGAGATG CTCGATGGCC GCGTGAAGAC CCTTCACCCC AAGGTGCATG GCGGCATCCT GCACCTGCGC GCCAACGAAG AGCACGTGGC GACGGTGAAA GAGCACGGCA TCCAGCCAAT CGACATGGTG GTGGTGAACC TGTACGCGTT CGAAAAGACG GCATCGAAGC CCGGGGCGCA CTTCGAAGAA ATCATCGAGA ACATCGATAT CGGCGGGCCG AGCATGGTGC GCTCGGCGGG CAAGAATTTC CAGGACGTGG CGATCGTCAC TTCGCCCGAC CAGTACGCGC AGGTCGCTGA AGAGATGGAC AAGAGCGGCG GTTCGGTTTC CAAGCAGATG CATTGGAAGC TGGCGCAGCG CGCGTTCGCC ACGACCGCCG CGTACGACTC AGCAATTGCT TCGGCGCTGG AGCGCGTGAT GGTGGACGAT GCCGGAAAGT TCGACATATC GAACATCCAC GGCGGCACTG GTTTCCCTGA GATCTTGCGA CTATTGTTCC GCAAATCCAT GGATCTTCGC TACGGCGAGA ACCCGCACCA GAAGGCGGCG CTCTACTCCA ACGGCACCGA TCTTGGCGTC GCCAACGGCA AGCAGCTCCA GGGCAAGGAG CTTTCGTACA ACAATATCGT CGATCTGCAG GCAGCGTGGG ACCTGGCGCA GGAGTTCGAT GAGCCCGTCT GCGCGATCAT CAAGCACACC AATCCGTGCG GCACGGCAGT CAGTTCGATA CTTGTCGAAG CGTATAAACG TGCTCTCGAA GCCGATCCGG TTTCGGCGTT CGGGGGTGTG ATTGGCGTAA ACCGCGAGAT CGACGAAGCA ACAGCGGAAG AAATGGCGAA GCTTTTCCTC GAAGTGATCG CCGCTCCGAG TTTCAGCGAG GGAGCGAAGG CGCGCTTCGC CGCGAAGAAG AACTTGCGGC TCGTCGAAGT AAAGGCGCTC GACCAGAAAT ACACGCTGAA GAATGTATCC GGCGGCGTGC TGGTGCAGGA CAACGACATT CGTCCGCTGA CCGACGCAGA TTTGAAAGTT GTCAGCGAGC GCAAGCCCAC TGAATCCGAG ATGAAGGACC TGCTCTTCGC GTGGAAGGTC TGCAAGCATG TGAAATCGAA TGCGATCCTC TACGCGAAAG ACGGCCGCAG CGTGGGCGTG GGCGCCGGCC AGATGAGCCG CGTGGATTCA GCGCGCATCG GTGCGATGAA AGCCGTGTTG CCTCTGAAGG GTTGCGTCGC GGCGAGCGAT GCGTTCTTCC CGTTCCCTGA TGGAGTCGAA GTCATCGCCG AAGCTGGAGC GACGGCGATC ATCCAGCCTG GCGGATCAGT GAAAGACCAG GAAGTGATTG ACACCGCGAA CCGGTTGGGA CTGGCAATGG TGCTCACGGG TGTGCGGCAC TTCCGGCACT AA
|
Protein sequence | MAKIRRAILS VTDKIGLSDF ARTLAKHGVE LISTGGTAKM LRDAGIPVKD ISELTGFPEM LDGRVKTLHP KVHGGILHLR ANEEHVATVK EHGIQPIDMV VVNLYAFEKT ASKPGAHFEE IIENIDIGGP SMVRSAGKNF QDVAIVTSPD QYAQVAEEMD KSGGSVSKQM HWKLAQRAFA TTAAYDSAIA SALERVMVDD AGKFDISNIH GGTGFPEILR LLFRKSMDLR YGENPHQKAA LYSNGTDLGV ANGKQLQGKE LSYNNIVDLQ AAWDLAQEFD EPVCAIIKHT NPCGTAVSSI LVEAYKRALE ADPVSAFGGV IGVNREIDEA TAEEMAKLFL EVIAAPSFSE GAKARFAAKK NLRLVEVKAL DQKYTLKNVS GGVLVQDNDI RPLTDADLKV VSERKPTESE MKDLLFAWKV CKHVKSNAIL YAKDGRSVGV GAGQMSRVDS ARIGAMKAVL PLKGCVAASD AFFPFPDGVE VIAEAGATAI IQPGGSVKDQ EVIDTANRLG LAMVLTGVRH FRH
|
| |