Gene Vapar_4630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4630 
SymbolpurH 
ID7972840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4920509 
End bp4922116 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content67% 
IMG OID644795214 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002946501 
Protein GI239817591 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.564581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGA CCGCACTCAT CTCCGTCTCC GACAAAACCG GCATCCTCGA ATTCGCGCAA 
GCGCTGCATG CGCTGGGCAT CAAGCTGCTG TCCACCGGCG GCACCGCCAA GCTGCTGGCC
GATGCCGGCC TGCCCGTGAC CGAAGTGGCC GACCACACCG GCTTTCCCGA AATGCTCGAC
GGCCGCGTGA AGACGCTGCA CCCCAAGATC CATGGCGGCC TGCTCGCGCG GCGCGACCTG
CCCGCGCACG TGGCGGCCAT CCAGGAACAC GGCATCGACA CCATCGACCT GCTGGTGGTC
AATCTCTATC CGTTCGAAGC CACGGTGGCC AAGGCCGGCT GCACGCTCGA AGACGCAATC
GAGAACATCG ACATCGGCGG ACCGGCCATG GTGCGCAGCG CGGCCAAGAA CTGGAAGGAC
GTGGGCGTGC TGACCGACGC CTCGCAGTAC GCCGTGGCGC TGGCCGAACT CCAGGCCGGC
GGCAAGCTCA GCGACAAGAC CAAGTTCGCG TTCTCGGTGG CCGCGTTCAA CCGCATCGCC
GACTACGACG GTGCCATCAG CGACTATCTC TCGGCCATCG ACTTCGACGC CAGCATCGGC
CAGGCTTCGC CCACGCGCTC GATGTTCCCG GCGCAAAGCA ACGGCCGCTT CGTGAAGGTG
CAGGACCTGC GCTACGGCGA GAACCCGCAC CAGCAGGCCG CGTTCTACCG CGACCTGCAT
CCGGCGCCCG GCTCGCTGGT GTCGGCGAAG CAACTGCAGG GCAAGGAGCT CAGCTACAAC
AACATCGCCG ATGCCGACGC CGCATGGGAA TGCGTGAAGA GCTTCGACGT GCCCGCGTGC
GTGATCGTCA AGCACGCCAA CCCCTGCGGC GTGGCCGTGG GCAAGGACGC GGCCGAAGCC
TACGGCAAGG CCTTCAAGAC CGACCCGACC TCGGCCTTCG GCGGCATCAT CGCCTTCAAC
CGCCCGGTCG ATGGCGAGAC CGCGCAGGCC ATTGCCAAGC AGTTCGTCGA AGTGCTGATG
GCGCCGGGCT ACACGCCCGA GGCGCTCGCC GTGTTCCAGG CCACCAAGGT CAAGCAGAAC
GTGCGCGTGC TCGAGATCGC ACTGCCGCCG GGCGGCACCA CCGACTGGGA CAACGGCCGC
AACCTCATGG ACGTCAAGCG CGTCGGTTCG GGCCTGTTGA TGCAGACCGC CGACAACCAC
GAGCTCGCGG CGAGCGACCT CAAGGTGGTC ACGAAGAAGC AGCCCACGCC CGAGCAACTG
CAGGACCTGC TGTTCGCATG GAAGGTCGCC AAGTACGTGA AGAGCAACGC CATCGTGTTC
TGCGCCGGCG GCATGACCAT GGGCGTGGGC GCGGGCCAGA TGAGCCGCCT CGACTCCGCG
CGCATCGCGA GCATCAAGGC CGAGCATGCG GGCCTCTCGC TGAAGGGCAC GGCGGTGGCG
AGCGACGCCT TCTTCCCGTT CCGCGACGGG CTCGACGTGG TGGTCGATGC CGGCGCGAGC
TGCGTGATCC AGCCGGGCGG CTCGATGCGC GACCAGGAAG TGATTGATGC CGCCGACGAG
CGCGGCGTGG TCATGGTGCT CTCGGGCGTG CGCCACTTCC GGCACTGA
 
Protein sequence
MAQTALISVS DKTGILEFAQ ALHALGIKLL STGGTAKLLA DAGLPVTEVA DHTGFPEMLD 
GRVKTLHPKI HGGLLARRDL PAHVAAIQEH GIDTIDLLVV NLYPFEATVA KAGCTLEDAI
ENIDIGGPAM VRSAAKNWKD VGVLTDASQY AVALAELQAG GKLSDKTKFA FSVAAFNRIA
DYDGAISDYL SAIDFDASIG QASPTRSMFP AQSNGRFVKV QDLRYGENPH QQAAFYRDLH
PAPGSLVSAK QLQGKELSYN NIADADAAWE CVKSFDVPAC VIVKHANPCG VAVGKDAAEA
YGKAFKTDPT SAFGGIIAFN RPVDGETAQA IAKQFVEVLM APGYTPEALA VFQATKVKQN
VRVLEIALPP GGTTDWDNGR NLMDVKRVGS GLLMQTADNH ELAASDLKVV TKKQPTPEQL
QDLLFAWKVA KYVKSNAIVF CAGGMTMGVG AGQMSRLDSA RIASIKAEHA GLSLKGTAVA
SDAFFPFRDG LDVVVDAGAS CVIQPGGSMR DQEVIDAADE RGVVMVLSGV RHFRH