Gene Ajs_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3584 
SymbolpurH 
ID4672846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3781361 
End bp3782962 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content68% 
IMG OID639840616 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_987772 
Protein GI121595876 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.390461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC TCCTTTCCGT CTCCGACAAG ACCGGCATCG TCGAATTTGC CCAGGCGCTG 
CATGCGCTGG GCATCCGCCT GCTGTCCACC GGCGGCACCG CCAAGCTGCT GGCCGAGAGC
GGCCTGCCCG TCACCGAGGT GGCCGAGGTC ACGCAGTTCC CCGAGATGCT GGACGGCCGC
GTGAAGACGC TGCACCCCAA GGTGCATGGC GGCCTGCTGG CGCGCCGCGA GCTGCCTGCG
CACATGGCGG CGCTGAAGGA GCACGGCATC GACACCATCG ACCTGCTGGT GGTCAACCTG
TACCCGTTCG AGGCCACGGT GGCCAACGCC GGCTGCACGC TGGCCGACGC CATCGAGAAC
ATCGACATCG GCGGCCCCGC CATGGTGCGC AGCGCCGCCA AGAACTGGAA GGACGTGGGC
GTGGTCACCT CGGCCGACCA GTACGACGCG GTGCTGGGCG AGTTGAAGGC CGCGGGCAAG
CTGTCCGACA AGCTGCGCTT CGCGCTGTCG GTGGCGGCGT TCAACCGCAT CGCGCAGTAC
GACGGCGCCA TCAGCGACTA CCTGTCGTCC ATCCAGTTCG ACGAGGCCAA GCTGTCCGAG
AGCTACGTGC CCGAACGCGC GCTGTTTCCC GGCCAGAGCA ACGGCATCTT CACCAAGATC
CAGGACCTGC GCTACGGCGA GAACAGCCAC CAGCAGGCCG CGCTGTACCG CGACCTGCAC
CCCGCGCCCG GCTCCATCGT CACCGGCGTG CAGCTGCAGG GCAAGGAACT CTCATACAAC
AACATCGCCG ACGCCGACGC GGCCTGGGAA TGCGTCAAGA GCTTCAAGCT GCCGGCCTGC
GTGATCGTCA AGCATGCCAA CCCCTGCGGC GTGGCCGTGG GCACGAGCGC GCTGGAGGCC
TACAGCAAGG CCTTCCAGAC CGACCCGACG AGCGCCTTCG GCGGCATCAT CGCGCTGAAC
CGCCCCGTGG ACGGCGCGGC CGCGCAGCAG ATCGCCAAGC AGTTCGTCGA AGTGCTGATG
GCGCCCGACT TCACGCCCGA GGCGCTGGAG GTGTTCAAGG CCAAGGCCAA CGTGCGCCTG
ATGAAGATCG CGTTGCCTGC CTCCGGCGGT GCCACGGCGT GGGAGCAGGG CCGCAACCTG
ATGGACGCCA AGCGCGTGGG CTCGGGCCTG CTGCTGCAGA CGGCCGACAA CCATGAGCTG
CAACTGCCCG ATGTGAAGGT GGTGACCCTC AAGCAGCCCA CGCAGGAAGA GATGCAGGAC
CTGATGTTCG CCTGGAAGGT GGCCAAGTAC GTCAAGAGCA ACGCCATCGT GTTCGTGAAG
GGCGGCATGA CCATGGGCGT GGGTGCGGGC CAGATGAGCC GGCTGGATTC GGCGCGCATC
GCCAGCATCA AGGCGCAGGC CGCGGGCCTG TCCCTGCAGA ACACCGTGGT GGCCAGCGAC
GCCTTCTTCC CGTTCCGCGA TGGGCTGGAC GTGGTGGTCG ACGCGGGCGC GACCTGCGTG
GCCCAGCCCG GTGGTTCCAT GCGCGACCAG GAGGTCATCG ACGCGGCCAA CGAGCGCGGC
GTGGCCATGG TCTTCACGGG CGTGCGCCAC TTCCGTCACT GA
 
Protein sequence
MNALLSVSDK TGIVEFAQAL HALGIRLLST GGTAKLLAES GLPVTEVAEV TQFPEMLDGR 
VKTLHPKVHG GLLARRELPA HMAALKEHGI DTIDLLVVNL YPFEATVANA GCTLADAIEN
IDIGGPAMVR SAAKNWKDVG VVTSADQYDA VLGELKAAGK LSDKLRFALS VAAFNRIAQY
DGAISDYLSS IQFDEAKLSE SYVPERALFP GQSNGIFTKI QDLRYGENSH QQAALYRDLH
PAPGSIVTGV QLQGKELSYN NIADADAAWE CVKSFKLPAC VIVKHANPCG VAVGTSALEA
YSKAFQTDPT SAFGGIIALN RPVDGAAAQQ IAKQFVEVLM APDFTPEALE VFKAKANVRL
MKIALPASGG ATAWEQGRNL MDAKRVGSGL LLQTADNHEL QLPDVKVVTL KQPTQEEMQD
LMFAWKVAKY VKSNAIVFVK GGMTMGVGAG QMSRLDSARI ASIKAQAAGL SLQNTVVASD
AFFPFRDGLD VVVDAGATCV AQPGGSMRDQ EVIDAANERG VAMVFTGVRH FRH