Gene AnaeK_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_1391 
SymbolpurH 
ID6785935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp1578931 
End bp1580505 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content72% 
IMG OID642762848 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002133751 
Protein GI197121800 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCGCC GCGCGCTCGT CTCGGTCTCC GACAAGACCG GTCTCGTTCC TTTCGCCAGG 
CGGCTCGCCG CCCTCGGCGT GGAGCTGCTC TCCACCGGCG GCACGCAGAA GACGCTCGCC
GAGGCCGGCG TCCCGGTGGT CGGCGTGGGC GACTACACGC AGGCCCCGGA GATCCTGGGT
GGCCGCGTGA AGACGCTCCA CCCGCGCGTG CACGGCGGCA TCCTCTACCG CCGCGGCCTC
GCCTCCGACG AGGCCGACGT GAAGGCGCGG GACATCCCGC CCATCGACCT CGTGGTGGTG
AACCTGTACC CGTTCCGCGA GGCGGTGGCG GCCGGCAAGC CGTTCGAGAC CTGCGTCGAG
GAGATCGACA TCGGCGGGCC GACCATGGTC CGGAGCGCCG CCAAGAACTC GGCGCACGTG
GGCGTGGTGG TGGACCCGGC CGACTACGAG AAGGTGGCGG CGGAGCTGGA GGCGACGCGC
ACGCTCTCCG CCGCGACGCG CTTCTACCTC ATGAAGAAGG CGTTCGCGCA CACCGCCGCG
TACGACGCCG CCATCTCCGA GTACCTCACG GCGCGCGAGG CCCCCGAGGC CGCGCCCGCG
CACTTCCCCG CCACGCTCGC GGCGGTCTAC ACCAAGGCGT ACGACCTCCG GTACGGCGAG
AACCCGCACC AGGCCGGCGC GTTCTACCGC GCGGCCCGCG AGCCGGAGGA GCCCTCGGTC
GCGTTCGCCG ACGTGCTGCA GGGCAAGGAG CTCAGCTACA ACAACCTGCT CGACCTGCAG
GCCGCGCTCG CCGGCGTGAT GGAGTTCGAC GAGACCGCCT GCGTGATCAT CAAGCACAAC
ACGCCCTGCG GCGTCTCCAC CGGCCGCACC GCGGGCGAGG CGTTCGCGCG CGCCCGCGAG
TGCGATCCGG TCTCGGCGTT CGGCGGCATC GTGGCGCTGA ACCGCCCCGT GGACGAGGCC
ACCGCCTCGG AGCTCACCAG CCTGTTCCTC GAGTGCGTGA TCGCGCCCGG CTACGACGCC
GCCGCCCGCG CCGCGCTCGC GGTGAAGAAG AACCTCCGCC TGCTCGAGGC GCCGCGGCTC
GGCGCCGCGC GCGCCACCTG GCGGCGGCGC CCCGAGGAGG GGCGCGAGCT CCGCTCCATC
CCCGGCGGCC TGCTGGTGAT GGACCGCGAC CTCGGCTCGG TCCGCCGCGA CGACTGCAAG
GTGATGACGA AGCGCGCGCC CACCGAGCAG GAGTGGAAGG ACCTGCTGTT CGCGTGGAAG
GTCGTGAAGC ACGTGAAGTC GAACGCCATC GTGTTCGCGA AGGACGACCG GACCGTGGCG
ATCGGCGGCG GTCAGACCAG CCGGGTGGAG TCGGTGAAGA CCGCGGTGAT GAAGGCGGCG
CTCGACGTCC GCGGCTCCTC GGTGGGCTCC GACGCGTTCT TCCCGTTCGC CGACGGCGTC
GAGGAGATCA TCAAGGCCGG CGCCACCGCC ATCATCCAGC CCGGCGGCTC GATGCGCGAC
GCCGAGGTGA TCGCCGCGGC CGACAAGGCC GGCATCGCCA TGGTCGCGAC CGGCATGCGG
CACTTCCGGC ACTGA
 
Protein sequence
MTRRALVSVS DKTGLVPFAR RLAALGVELL STGGTQKTLA EAGVPVVGVG DYTQAPEILG 
GRVKTLHPRV HGGILYRRGL ASDEADVKAR DIPPIDLVVV NLYPFREAVA AGKPFETCVE
EIDIGGPTMV RSAAKNSAHV GVVVDPADYE KVAAELEATR TLSAATRFYL MKKAFAHTAA
YDAAISEYLT AREAPEAAPA HFPATLAAVY TKAYDLRYGE NPHQAGAFYR AAREPEEPSV
AFADVLQGKE LSYNNLLDLQ AALAGVMEFD ETACVIIKHN TPCGVSTGRT AGEAFARARE
CDPVSAFGGI VALNRPVDEA TASELTSLFL ECVIAPGYDA AARAALAVKK NLRLLEAPRL
GAARATWRRR PEEGRELRSI PGGLLVMDRD LGSVRRDDCK VMTKRAPTEQ EWKDLLFAWK
VVKHVKSNAI VFAKDDRTVA IGGGQTSRVE SVKTAVMKAA LDVRGSSVGS DAFFPFADGV
EEIIKAGATA IIQPGGSMRD AEVIAAADKA GIAMVATGMR HFRH