Gene Achl_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1167 
SymbolpurH 
ID7292612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1282421 
End bp1284106 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content68% 
IMG OID643589572 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002487247 
Protein GI220911938 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000225993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCTTCA CGCAGCATGA GCGCGTATCC ATTGACCGTG TACCCATCCG CCGGGCCCTG 
ATCTCGGTTT ACGACAAGAC CGGCTTGGAG GAGCTCGCCA CAGGCCTGCA CGCAGCGGGT
GTGAAGCTGG TCTCCACCGG TTCCACCGCG AAGAAGATCG CTGCAGCGGG CATCCCGGTG
CAGGAGGTTG AGGAAGTCAC CGGTTCCCCG GAGATGCTGG ACGGCCGCGT CAAGACGCTG
CACCCCCGCG TCCACGGCGG CATCCTGGCG GACCGCCGCG TCCCCGCCCA CATGGAGACG
CTCGAAAGCA TGGACATCGA GGCCTTCGAC CTGGTGGTGG TGAACCTCTA CCCGTTCGTG
GAGACCGTCA AGTCCGGTGC CGCGCAGGAC GACGTCGTTG AGCAGATCGA CATCGGAGGC
CCCGCCATGG TGCGCTCGGC CGCGAAGAAC CATGCCGCCG TCGCCATCGT GGTGGACCCG
TCCTTCTACG GTCAGGTAGT CACCGCAGCC GCTGAAGGCG GTTTCGACCT GAAGACCCGC
CGCCGCCTCG CGGCCAAGGC GTTCGCGCAC ACGGCCTCCT ACGACAACGC CGTGGCCACT
TGGACTGCCA GCCAGTTCCT GGACGAAGAC GGCGACGGCA TCATCGACTG GCCTGCCTAC
GCCGGCATGT CCCTGGAACG CTCTGAAGTG CTGCGTTACG GCGAGAACCC GCATCAGCAG
GCGGCGCTCT ACGTGGACAA GGCCGCCCCG GTGGGCATCG CCCAGGCAGA CCAGCTGCAC
GGCAAGGCCA TGAGCTACAA CAATTTCGTT GATGCCGACG CCGCCCTCCG CGCCGCCTTC
GATTTCAGTG AGCCGGCCGT GGCCATCATC AAGCACGCCA ACCCGTGCGG CGTTGCCGTC
GGCTCGGCAG GCGCCGCGGA CCCCATCGCC GACGCCCACG CCAAGGCCCA CGCCTGCGAC
CCCGTGTCCG CGTTCGGCGG AGTTATCGCA GCCAACCGTC CGGTCACGGC GGCCATGGCC
AACACCGTCA AGGACATCTT CACCGAGGTT GTCATCGCGC CCGGTTTCGA GCCCGAGGCC
GTGGAGATCC TCTCCAAGAA GAAGAACATC CGGCTGCTTT CCCTGCCTGA GGGCTACGGC
CGCTACCCCA CGGAGTTCCG CCAGGTTTCC GGCGGCATGC TGGTCCAGGT CAGCGACAAG
GTGGACGCCG ACGGCGACAA CCCCGCCAAC TGGACGCTGG CCGCGGGCGA AGCCGCCGAT
GAGAAGACCC TCGCGGACCT CGCCTTCGCC TGGACCGCCT GCCGTGCAGC AAAGTCCAAC
GCCATCCTCC TCGCGGACAA CGGTGCAGCT GTGGGCATCG GCATGGGCCA GGTCAACCGC
CTCGATTCCT GCAAGCTGGC CGTGGAACGC GCCAACACGC TGGGCCTCCA GGTGGAGTCC
GACGTCGATG GCGCCGGCGG TGCCACGAAC GCCAGCGGTG CCGGCGCACC GCAGCGTGCC
CAGGGCGCCG TGGCAGCATC GGATGCCTTC TTCCCGTTCG CTGACGGACT GCAGATCCTG
ATCGACGCCG GCGTCCGCGC CGTCGTGCAG CCCGGCGGAT CCGTCCGGGA CGAGGAAGTC
ATCGCTGCGG CCAACGCCGC CGGCATCTCC ATGTACTTCA CCGGAGCGCG CCACTTCTTC
CACTGA
 
Protein sequence
MSFTQHERVS IDRVPIRRAL ISVYDKTGLE ELATGLHAAG VKLVSTGSTA KKIAAAGIPV 
QEVEEVTGSP EMLDGRVKTL HPRVHGGILA DRRVPAHMET LESMDIEAFD LVVVNLYPFV
ETVKSGAAQD DVVEQIDIGG PAMVRSAAKN HAAVAIVVDP SFYGQVVTAA AEGGFDLKTR
RRLAAKAFAH TASYDNAVAT WTASQFLDED GDGIIDWPAY AGMSLERSEV LRYGENPHQQ
AALYVDKAAP VGIAQADQLH GKAMSYNNFV DADAALRAAF DFSEPAVAII KHANPCGVAV
GSAGAADPIA DAHAKAHACD PVSAFGGVIA ANRPVTAAMA NTVKDIFTEV VIAPGFEPEA
VEILSKKKNI RLLSLPEGYG RYPTEFRQVS GGMLVQVSDK VDADGDNPAN WTLAAGEAAD
EKTLADLAFA WTACRAAKSN AILLADNGAA VGIGMGQVNR LDSCKLAVER ANTLGLQVES
DVDGAGGATN ASGAGAPQRA QGAVAASDAF FPFADGLQIL IDAGVRAVVQ PGGSVRDEEV
IAAANAAGIS MYFTGARHFF H