Gene Daci_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_1564 
SymbolpurH 
ID5747121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp1755476 
End bp1757074 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content67% 
IMG OID641296645 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001562593 
Protein GI160897011 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0949908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.593049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC TGCTCTCCGT CTCTGACAAG ACCGGTATCG TCGAATTCGC CCAAGGCCTG 
CACGCCCTGG GCATCAAGCT GCTGTCCACC GGCGGCACCG CCAAGCTGCT GGCCGAGGCC
GGCCTGCCCG TGACCGAGGT GGCCGAAGTC ACCCAGTTCC CCGAAATGCT GGACGGCCGC
GTCAAGACCC TGCACCCCAA GGTCCACGGC GGCCTGCTGG CGCGCCGCGA GCTGCCCGCC
CACATGGCGG CCCTCAAGGA GCACGGCATC GACACCATCG ACCTGCTGGT GGTGAACCTG
TACCCCTTCG AGGCCACCGT GGCCAAGGCC GGCTGCACGC TGGCCGACGC CATCGAGAAC
ATCGACATCG GCGGCCCGGC CATGGTGCGC TCTGCCGCCA AGAACTGGGC CGACGTGGGC
GTGATCACCG ACGCCGGCCA GTACGAGGCC GTGCTGGGTG AACTCAAGGC CAGCGGAAAG
CTGTCCGACA AGCTGCGCTT TGCGCTGTCC GTGGCCGCCT TCAACCGCAT CGCCCAGTAC
GACGGCGCCA TCAGCGACTA CCTGTCCAGC GTGAAGTTCG AGGACGAGAA GCTCTCCGAA
GCCTACGTGC CCGAGCGCAG CCCCTTCCCC GGCCAGAGCA ACGGCCACTT CACCAAGGTG
CAGGACCTGC GCTACGGCGA AAACAGCCAC CAGCAGGCCG CCCTGTACCG CGACCTGTAT
CCGGCTCCCG GCTCCCTGGT CACGGGCGAG CAGCTGCAGG GCAAGGAGCT GTCGTACAAC
AACATCGCCG ACGCCGACGC TGCCTGGGAA TGCGTCAAGA GCTTCGAGCA GCCGGCCTGC
GTGATCGTCA AGCACGCCAA CCCCTGCGGC GTGGCCGTGG GCAAGGACGC GCACGAAGCC
TATGCCAAGG CCTTCCAGAC CGACCCCACC AGCGCCTTCG GCGGCATCAT CGCCTTCAAC
CGCACGGTGG ACAAGGCAGC GGCCGAGGCC GTGGTCAAGC AGTTCGTCGA GGTGCTGATG
GCCCCCGACT TCACGTCCGA GGCGCTGGAA ATCTTCAAGC CCAAGGTCAA TGTGCGCCTG
ATGAAGATCG CCCTGCCTGC CGGCGGCGAG CGCGCCTGGG ACCAGGGCCG CAACGCCATG
GACGCCAAGC GCGTCGGCTC GGGCCTGCTG CTGCAGACCG CTGACAACCA TGAACTGGCC
CTGGCCGACC TCAAGGTCGT CACCGTCAAG CAGCCTACCC CCGAAGAACT GCAGGACCTG
CTGTTCGCCT GGAAGGTCGC CAAGTACGTC AAGAGCAATG CCATCGTGTT CTGCAAGAAC
GGCATGACCA TGGGCGTGGG CGCAGGCCAG ATGAGCCGCC TGGACTCGGC ACGCATCGCC
TCCATCAAGG CCGAGGCCGC CAAGCTGAGC CTGCAGGGCA CCGTCGTGGC CAGCGATGCC
TTCTTCCCCT TCCGCGACGG CCTGGACGTG GTCGTTGACG CAGGCGCCAC CTGCGTGGCC
CAGCCCGGCG GCTCCATGCG CGACCAGGAA GTCATTGACG CCGCCAACGA GCGCGGCGTC
GCCATGGTCT TCACCGGCGT GCGCCACTTC CGCCACTGA
 
Protein sequence
MNALLSVSDK TGIVEFAQGL HALGIKLLST GGTAKLLAEA GLPVTEVAEV TQFPEMLDGR 
VKTLHPKVHG GLLARRELPA HMAALKEHGI DTIDLLVVNL YPFEATVAKA GCTLADAIEN
IDIGGPAMVR SAAKNWADVG VITDAGQYEA VLGELKASGK LSDKLRFALS VAAFNRIAQY
DGAISDYLSS VKFEDEKLSE AYVPERSPFP GQSNGHFTKV QDLRYGENSH QQAALYRDLY
PAPGSLVTGE QLQGKELSYN NIADADAAWE CVKSFEQPAC VIVKHANPCG VAVGKDAHEA
YAKAFQTDPT SAFGGIIAFN RTVDKAAAEA VVKQFVEVLM APDFTSEALE IFKPKVNVRL
MKIALPAGGE RAWDQGRNAM DAKRVGSGLL LQTADNHELA LADLKVVTVK QPTPEELQDL
LFAWKVAKYV KSNAIVFCKN GMTMGVGAGQ MSRLDSARIA SIKAEAAKLS LQGTVVASDA
FFPFRDGLDV VVDAGATCVA QPGGSMRDQE VIDAANERGV AMVFTGVRHF RH