Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlut_04500 |
Symbol | purH |
ID | 7984564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Micrococcus luteus NCTC 2665 |
Kingdom | Bacteria |
Replicon accession | NC_012803 |
Strand | + |
Start bp | 487373 |
End bp | 489064 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644805424 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002956545 |
Protein GI | 239916987 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCTCTG CCCAGCCCAC CACCACCGTC CTGGACACGG TGCCCCTGAA GCGGGCCCTG ATCTCCGTGT ACGACAAGAC CGGCCTCGAG GAGCTCGCCA CCGGGCTCCA CGCCGCGGGC GTGCAGATCG TCTCGACCGG CTCCACCGCC CAGCGCATCG CCGCCGCCGG CGTGCCCGTC ACCGAGGTCG CCGAGGTCAC CGGGTTCCAG GAGTGCCTGG ACGGCCGCGT GAAGACGCTG CACCCGCGCG TGCACGCGGG CATCCTGGCG GACCGTCGTC GCGAGGACCA CGTGACCCAG CTGCGCGAGC TCGAGGTGGA GCCGTTCGAC CTCGTCGTCG TGAACCTCTA CCCGTTCGTG GACACCGTGA ACTCGGGCGC CGCGGAGGAC GCCGTCGTCG AGCAGATCGA CATCGGCGGG CCGTCCATGG TGCGCGCGGC CGCGAAGAAC CATGCATCCG TGGCGATCGT CGTGGACCCG GCCCGCTACG GCGAGGTCGT CCAGGCCGCG CAGTCCGGCG GCTTCGACCT GCGCGCCCGC CAGCGCCTGG CCGCCCTGGC CTTCGCCCAC ACCGCGGCGT ACGACAACGC CGTGGCTGCC TGGACCGCCG CCCACTTCGG CGAGGACATC CACGCCGACG AGATCCCCGT GTTCCCGCCC TACGCCGGCT TCTCCCTCGA GCGCGCGCAG ATCCTGCGCT ACGGCGAGAA CCCGCACCAG CCCGCGGCGC TGTACCTGGA CTCCTCGGCG GCCCCCGGCA TCGCGCAGGC CGAGCTGCTG CACGGCAAGC CGATGAGCTA CAACAACTAC GTGGACGCCG ACGCCGCGGT GCGCGCCGCC TTCGACCACC CGGTCCCGGC CGTGGCGATC GTCAAGCACG CCAACCCGTG CGGCGTGGCC GTCACGGACG CGGGCACGGA CATCGCGCAG GCGCACGCCA AGGCCCACGC GTGCGACCCG GTCTCCGCGT TCGGCGGCGT GATCGCGGCC AACCGCCCCG TCACCGACGC CATGGCCGCC CAGGTGAAGG ACGTGTTCAC GGAGGTCGTC GTGGCCCCGG CGTTCGAGCC CGAGGCCCTG GAGATCCTCT CCGCCAAGAA GAACCTGCGC CTGCTGAGCC TGCCCGAGGG CTTCCTGCGG GACGCCGTGG AGGCCAAGCA GGTCTCCGGC GGCATGCTGC TGCAGATCGC GGACGCGGTG GACGCGGACG GCGACGACCC GGCCACCTGG ACCCTCGCCG CCGGCCCGGC CGCCGACGAG GCCGTCCTGG CCGACCTGGC CTTCGCGTGG CGCGCCGTGC GCGCGGCCAA GTCCAACGCC GTGCTGCTGG CCCACGACGG CGCCACGGTC GGCGTGGGCA TGGGCCAGGT CAACCGCCTC GACTCCTGCC GCCTGGCCGT CGAGCGCGCC AACACCCTGG GCGCGGCGCA GACCGGCGGG CAGGACGTGA ACAGCGCCGG CGGCGCGGAG AACGTCTCCG GGGAGGGCGC CCCCGAGCGG GCCCGCGGGT CCGTGGCCGC CTCGGACGCG TTCTTCCCGT TCGCGGACGG GCTGCAGATC CTCATCGACG CCGGCGTGAA GGCCGTCGTC CAGCCGGGCG GCTCCGTCCG GGATGAGGAG GTCGTGGCCG CGGCCGAGGC CGCCGGCGTG ACCCTGTACC TGACCGGGGC GCGCCACTTC TTCCACGGCT GA
|
Protein sequence | MISAQPTTTV LDTVPLKRAL ISVYDKTGLE ELATGLHAAG VQIVSTGSTA QRIAAAGVPV TEVAEVTGFQ ECLDGRVKTL HPRVHAGILA DRRREDHVTQ LRELEVEPFD LVVVNLYPFV DTVNSGAAED AVVEQIDIGG PSMVRAAAKN HASVAIVVDP ARYGEVVQAA QSGGFDLRAR QRLAALAFAH TAAYDNAVAA WTAAHFGEDI HADEIPVFPP YAGFSLERAQ ILRYGENPHQ PAALYLDSSA APGIAQAELL HGKPMSYNNY VDADAAVRAA FDHPVPAVAI VKHANPCGVA VTDAGTDIAQ AHAKAHACDP VSAFGGVIAA NRPVTDAMAA QVKDVFTEVV VAPAFEPEAL EILSAKKNLR LLSLPEGFLR DAVEAKQVSG GMLLQIADAV DADGDDPATW TLAAGPAADE AVLADLAFAW RAVRAAKSNA VLLAHDGATV GVGMGQVNRL DSCRLAVERA NTLGAAQTGG QDVNSAGGAE NVSGEGAPER ARGSVAASDA FFPFADGLQI LIDAGVKAVV QPGGSVRDEE VVAAAEAAGV TLYLTGARHF FHG
|
| |