Gene Anae109_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1399 
SymbolpurH 
ID5374120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1580889 
End bp1582463 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content72% 
IMG OID640842909 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001378590 
Protein GI153004265 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.136347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCC GCGCTCTCGT CTCCGTCTCC GACAAGACGG GCCTCGTCCC GTTCGCGAAG 
CGGCTCGCCG CGCTCGGCGT CGAGATTCTC TCCACCGGCG GCACGCAGCG CGCGCTCGCC
GACGCCGGCG TGCCCGTCGT CTCGGTGGGT GACTACACGC AGGCTCCGGA GATCCTGGCC
GGGCGCGTGA AGACCCTCCA CCCGCGCGTG CACGGCGGCA TCCTCTACCG GCGCGGCCTC
GCGTCCGACG AGGCCGACGT GAAGGCCCGG GACATCCCCC CCATCGACCT CGTGGTGGTG
AACCTCTACC CGTTCCGCGA GGCGGTCGCG GCCGGGAAGC CGTTCTGGGA CTGCGTCGAG
GAGATCGACA TCGGCGGGCC GACCATGGTG CGCAGCGCGG CGAAGAACGC GGCGCACGTG
GGCGTGGTGG TCGACCCCGC GGACTACGAG CGCGTCGCGG CCGAGCTCGA GGCCTCGCGC
GCGCTGTCGG ATCAGACGCG CTTCGAGCTC ATGAAGAAGG CCTTCGCCCA CACGGCCGCC
TACGACGCCG CCATCTCCGA GTTCCTCACG GCGCGCGAGA GCACGGACGC GCAGGCGAAG
CGCTTCCCCG CCACGCTCGC CGCCGTCTAC TCGAAGGCGG GGGACCTCCG CTACGGCGAG
AACCCCCACC AGGCGGGCGC CTTCTACCGC GCCGGCCGCG AGCCGGACGA GCCGACGGTC
GCCTTCGCGA AGGTGCTGCA GGGCAAGGAG CTCAGCTACA ACAACCTCCT CGACCTCGAG
GCGGCCCTCG CCGCCGTCAA GGAGCACGAC GAGGTCGCCT GCGTCGTCAT CAAGCACAAC
ACCCCCTGCG GCGTGTCGCT CGGGAAGACG CCCGCGGAGG CGTTCGCGCG CGCCCGCGCG
TGCGACCCGG TCTCCGCGTT CGGCGGCATC GTCGCGCTCA ACCGCCCCGT CGACGCCGCG
GCCGCGAAGG AGCTGACCGA TCTCTTCCTC GAGTGCGTGA TCGCGCCCGG CTACGACGAG
GCCGCGCGCG CCGCCCTCGG CGCGAAGAAG AACCTGCGGC TGCTCGAGGC GCCGCGGCTC
GCCGAGCCGC GCACGAGCTG GACGCGCCGG CCCGAGGAGC TCCGCGAGCT CCGCTCGATC
CCCGGCGGCC TGCTCGTCAT GGACCGCGAT CTCGGCGCCA TCCGCCGCGA CGACTGCAAG
GTGATGACGA AGCGCGCGCC GACCGACGCC GAGTGGGAGG ATCTCCTCTT CGCGTGGAAG
GTCGTGAAGC ACGTGAAGTC GAACGCGATC GTCTTCGCGA AGGAGAAGCG CACGGTCGGC
ATCGGGGGCG GGCAGACGAG CCGGGTCGAG TCGGTGAAGA CGGCCGTCAT GAAGGCCCAG
CTCGAGCTCG TCGGGTCGAC GGTCGGCTCG GACGCCTTCT TCCCGTTCAA GGACGGCGTC
GAGGAGATCA TCAAGGCCGG CGCGACCGCC ATCATCCAGC CCGGCGGCTC GGTGCGCGAC
CCCGAGGTGA TCGAGGCCGC GGACGCGGCG AACGTGGCGA TGGTGGCCAC CGGGATGCGC
CACTTCCGGC ACTGA
 
Protein sequence
MVRRALVSVS DKTGLVPFAK RLAALGVEIL STGGTQRALA DAGVPVVSVG DYTQAPEILA 
GRVKTLHPRV HGGILYRRGL ASDEADVKAR DIPPIDLVVV NLYPFREAVA AGKPFWDCVE
EIDIGGPTMV RSAAKNAAHV GVVVDPADYE RVAAELEASR ALSDQTRFEL MKKAFAHTAA
YDAAISEFLT ARESTDAQAK RFPATLAAVY SKAGDLRYGE NPHQAGAFYR AGREPDEPTV
AFAKVLQGKE LSYNNLLDLE AALAAVKEHD EVACVVIKHN TPCGVSLGKT PAEAFARARA
CDPVSAFGGI VALNRPVDAA AAKELTDLFL ECVIAPGYDE AARAALGAKK NLRLLEAPRL
AEPRTSWTRR PEELRELRSI PGGLLVMDRD LGAIRRDDCK VMTKRAPTDA EWEDLLFAWK
VVKHVKSNAI VFAKEKRTVG IGGGQTSRVE SVKTAVMKAQ LELVGSTVGS DAFFPFKDGV
EEIIKAGATA IIQPGGSVRD PEVIEAADAA NVAMVATGMR HFRH