Gene Dtpsy_2907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2907 
SymbolpurH 
ID7384092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp3093251 
End bp3094852 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content68% 
IMG OID643656217 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002554341 
Protein GI222112077 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC TCCTTTCCGT CTCCGACAAG ACCGGCATCG TCGAATTTGC CCAGGCACTG 
CATGCGCTGG GCATCCGCCT GCTGTCCACC GGCGGCACCG CCAAGCTGCT GGCCGAGAGC
GGCCTGCCCG TCACCGAGGT GGCCGAGGTC ACGCAGTTCC CCGAGATGCT GGACGGCCGC
GTGAAGACGC TGCACCCCAA GGTGCATGGC GGCCTGCTGG CGCGCCGCGA GCTGCCTGCG
CACATGGCGG CGCTGAAGGA GCACGGCATC GACACCATCG ATCTGCTGGT GGTCAACCTG
TACCCGTTCG AGGCCACGGT GGCCAAGGCC GGCTGCACGC TGGCCGACGC CATCGAGAAC
ATCGACATCG GCGGCCCCGC CATGGTGCGC AGCGCCGCCA AGAACTGGAA GGACGTGGGC
GTGGTCACCT CGGCTGACCA GTACGAGGCG GTGCTGGGCG AGTTGAAGGC CGCGGGCAAG
CTGTCCGACA AGCTGCGCTT CACGCTGTCG GTGGCGGCGT TCAACCGCAT CGCGCAGTAC
GACGGCGCCA TCAGCGACTA CCTGTCGTCC ATCCAGTTCG ACGAGGCCAA GCTGTCCGAG
AGCTACGTGC CCGAACGCGC GCTGTTCCCC GGCCAGAGCA ACGGCATCTT CACCAAGATC
CAGGACCTGC GCTACGGCGA GAACAGCCAC CAGCAGGCCG CGCTGTACCG CGACCTGCAC
CCTGCGCCCG GCTCCATCGT CACCGGCGTG CAGCTGCAGG GCAAGGAACT CTCGTACAAC
AACATCGCCG ACGCCGACGC CGCCTGGGAA TGCGTCAAGA GCTTCAAGCT GCCGGCCTGC
GTGATCGTCA AGCACGCCAA CCCCTGCGGC GTGGCCGTGG GCACGAGCGC GCTGGAGGCC
TACAGCAAGG CCTTCCAGAC CGACCCGACG AGCGCCTTCG GCGGCATCAT CGCGCTGAAC
CGCCCCGTGG ACGGCGCGGC CGCGCAGCAG ATCGCCAAGC AGTTCGTCGA AGTGCTGATG
GCGCCCGACT TCACGCCCGA GGCGCTGGAG GTGTTCAAGG CCAAGGCCAA CGTGCGCCTG
ATGAAGATCG CGTTGCCTGC CTCCGGCGGT GCCACGGCGT GGGAGCAGGG GCGCAACCTG
ATGGACGCCA AGCGCGTGGG CTCGGGCCTG CTGCTGCAGA CGGCCGACAA CCATGAGCTG
CAACTGCCCG ATGTGAAGGT GGTGACCCTC AAGCAGCCCA CGCAGGAAGA GATGCAGGAC
CTGCTGTTCG CCTGGAAGGT GGCCAAGTAC GTCAAGAGCA ACGCCATCGT GTTCGTGAAG
GGCGGCATGA CCATGGGCGT GGGTGCTGGC CAGATGAGCC GGCTGGATTC GGCGCGCATC
GCCAGCATCA AGGCGCAGGC CGCGGGCCTG TCCCTGCAGA ACACCGTGGT GGCCAGCGAC
GCCTTCTTCC CGTTCCGCGA TGGGCTGGAC GTGGTGGTCG ACGCGGGCGC GACCTGCGTG
GCTCAGCCCG GCGGCTCCAT GCGCGACCAG GAGGTCATCG ACGCGGCCAA CGAGCGCGGC
GTGGCCATGG TCTTCACGGG CGTGCGCCAC TTCCGTCACT GA
 
Protein sequence
MNALLSVSDK TGIVEFAQAL HALGIRLLST GGTAKLLAES GLPVTEVAEV TQFPEMLDGR 
VKTLHPKVHG GLLARRELPA HMAALKEHGI DTIDLLVVNL YPFEATVAKA GCTLADAIEN
IDIGGPAMVR SAAKNWKDVG VVTSADQYEA VLGELKAAGK LSDKLRFTLS VAAFNRIAQY
DGAISDYLSS IQFDEAKLSE SYVPERALFP GQSNGIFTKI QDLRYGENSH QQAALYRDLH
PAPGSIVTGV QLQGKELSYN NIADADAAWE CVKSFKLPAC VIVKHANPCG VAVGTSALEA
YSKAFQTDPT SAFGGIIALN RPVDGAAAQQ IAKQFVEVLM APDFTPEALE VFKAKANVRL
MKIALPASGG ATAWEQGRNL MDAKRVGSGL LLQTADNHEL QLPDVKVVTL KQPTQEEMQD
LLFAWKVAKY VKSNAIVFVK GGMTMGVGAG QMSRLDSARI ASIKAQAAGL SLQNTVVASD
AFFPFRDGLD VVVDAGATCV AQPGGSMRDQ EVIDAANERG VAMVFTGVRH FRH