Gene Mlut_04500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlut_04500 
SymbolpurH 
ID7984564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMicrococcus luteus NCTC 2665 
KingdomBacteria 
Replicon accessionNC_012803 
Strand
Start bp487373 
End bp489064 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content74% 
IMG OID644805424 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002956545 
Protein GI239916987 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCTCTG CCCAGCCCAC CACCACCGTC CTGGACACGG TGCCCCTGAA GCGGGCCCTG 
ATCTCCGTGT ACGACAAGAC CGGCCTCGAG GAGCTCGCCA CCGGGCTCCA CGCCGCGGGC
GTGCAGATCG TCTCGACCGG CTCCACCGCC CAGCGCATCG CCGCCGCCGG CGTGCCCGTC
ACCGAGGTCG CCGAGGTCAC CGGGTTCCAG GAGTGCCTGG ACGGCCGCGT GAAGACGCTG
CACCCGCGCG TGCACGCGGG CATCCTGGCG GACCGTCGTC GCGAGGACCA CGTGACCCAG
CTGCGCGAGC TCGAGGTGGA GCCGTTCGAC CTCGTCGTCG TGAACCTCTA CCCGTTCGTG
GACACCGTGA ACTCGGGCGC CGCGGAGGAC GCCGTCGTCG AGCAGATCGA CATCGGCGGG
CCGTCCATGG TGCGCGCGGC CGCGAAGAAC CATGCATCCG TGGCGATCGT CGTGGACCCG
GCCCGCTACG GCGAGGTCGT CCAGGCCGCG CAGTCCGGCG GCTTCGACCT GCGCGCCCGC
CAGCGCCTGG CCGCCCTGGC CTTCGCCCAC ACCGCGGCGT ACGACAACGC CGTGGCTGCC
TGGACCGCCG CCCACTTCGG CGAGGACATC CACGCCGACG AGATCCCCGT GTTCCCGCCC
TACGCCGGCT TCTCCCTCGA GCGCGCGCAG ATCCTGCGCT ACGGCGAGAA CCCGCACCAG
CCCGCGGCGC TGTACCTGGA CTCCTCGGCG GCCCCCGGCA TCGCGCAGGC CGAGCTGCTG
CACGGCAAGC CGATGAGCTA CAACAACTAC GTGGACGCCG ACGCCGCGGT GCGCGCCGCC
TTCGACCACC CGGTCCCGGC CGTGGCGATC GTCAAGCACG CCAACCCGTG CGGCGTGGCC
GTCACGGACG CGGGCACGGA CATCGCGCAG GCGCACGCCA AGGCCCACGC GTGCGACCCG
GTCTCCGCGT TCGGCGGCGT GATCGCGGCC AACCGCCCCG TCACCGACGC CATGGCCGCC
CAGGTGAAGG ACGTGTTCAC GGAGGTCGTC GTGGCCCCGG CGTTCGAGCC CGAGGCCCTG
GAGATCCTCT CCGCCAAGAA GAACCTGCGC CTGCTGAGCC TGCCCGAGGG CTTCCTGCGG
GACGCCGTGG AGGCCAAGCA GGTCTCCGGC GGCATGCTGC TGCAGATCGC GGACGCGGTG
GACGCGGACG GCGACGACCC GGCCACCTGG ACCCTCGCCG CCGGCCCGGC CGCCGACGAG
GCCGTCCTGG CCGACCTGGC CTTCGCGTGG CGCGCCGTGC GCGCGGCCAA GTCCAACGCC
GTGCTGCTGG CCCACGACGG CGCCACGGTC GGCGTGGGCA TGGGCCAGGT CAACCGCCTC
GACTCCTGCC GCCTGGCCGT CGAGCGCGCC AACACCCTGG GCGCGGCGCA GACCGGCGGG
CAGGACGTGA ACAGCGCCGG CGGCGCGGAG AACGTCTCCG GGGAGGGCGC CCCCGAGCGG
GCCCGCGGGT CCGTGGCCGC CTCGGACGCG TTCTTCCCGT TCGCGGACGG GCTGCAGATC
CTCATCGACG CCGGCGTGAA GGCCGTCGTC CAGCCGGGCG GCTCCGTCCG GGATGAGGAG
GTCGTGGCCG CGGCCGAGGC CGCCGGCGTG ACCCTGTACC TGACCGGGGC GCGCCACTTC
TTCCACGGCT GA
 
Protein sequence
MISAQPTTTV LDTVPLKRAL ISVYDKTGLE ELATGLHAAG VQIVSTGSTA QRIAAAGVPV 
TEVAEVTGFQ ECLDGRVKTL HPRVHAGILA DRRREDHVTQ LRELEVEPFD LVVVNLYPFV
DTVNSGAAED AVVEQIDIGG PSMVRAAAKN HASVAIVVDP ARYGEVVQAA QSGGFDLRAR
QRLAALAFAH TAAYDNAVAA WTAAHFGEDI HADEIPVFPP YAGFSLERAQ ILRYGENPHQ
PAALYLDSSA APGIAQAELL HGKPMSYNNY VDADAAVRAA FDHPVPAVAI VKHANPCGVA
VTDAGTDIAQ AHAKAHACDP VSAFGGVIAA NRPVTDAMAA QVKDVFTEVV VAPAFEPEAL
EILSAKKNLR LLSLPEGFLR DAVEAKQVSG GMLLQIADAV DADGDDPATW TLAAGPAADE
AVLADLAFAW RAVRAAKSNA VLLAHDGATV GVGMGQVNRL DSCRLAVERA NTLGAAQTGG
QDVNSAGGAE NVSGEGAPER ARGSVAASDA FFPFADGLQI LIDAGVKAVV QPGGSVRDEE
VVAAAEAAGV TLYLTGARHF FHG