Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4630 |
Symbol | purH |
ID | 7972840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4920509 |
End bp | 4922116 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644795214 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002946501 |
Protein GI | 239817591 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.564581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAGA CCGCACTCAT CTCCGTCTCC GACAAAACCG GCATCCTCGA ATTCGCGCAA GCGCTGCATG CGCTGGGCAT CAAGCTGCTG TCCACCGGCG GCACCGCCAA GCTGCTGGCC GATGCCGGCC TGCCCGTGAC CGAAGTGGCC GACCACACCG GCTTTCCCGA AATGCTCGAC GGCCGCGTGA AGACGCTGCA CCCCAAGATC CATGGCGGCC TGCTCGCGCG GCGCGACCTG CCCGCGCACG TGGCGGCCAT CCAGGAACAC GGCATCGACA CCATCGACCT GCTGGTGGTC AATCTCTATC CGTTCGAAGC CACGGTGGCC AAGGCCGGCT GCACGCTCGA AGACGCAATC GAGAACATCG ACATCGGCGG ACCGGCCATG GTGCGCAGCG CGGCCAAGAA CTGGAAGGAC GTGGGCGTGC TGACCGACGC CTCGCAGTAC GCCGTGGCGC TGGCCGAACT CCAGGCCGGC GGCAAGCTCA GCGACAAGAC CAAGTTCGCG TTCTCGGTGG CCGCGTTCAA CCGCATCGCC GACTACGACG GTGCCATCAG CGACTATCTC TCGGCCATCG ACTTCGACGC CAGCATCGGC CAGGCTTCGC CCACGCGCTC GATGTTCCCG GCGCAAAGCA ACGGCCGCTT CGTGAAGGTG CAGGACCTGC GCTACGGCGA GAACCCGCAC CAGCAGGCCG CGTTCTACCG CGACCTGCAT CCGGCGCCCG GCTCGCTGGT GTCGGCGAAG CAACTGCAGG GCAAGGAGCT CAGCTACAAC AACATCGCCG ATGCCGACGC CGCATGGGAA TGCGTGAAGA GCTTCGACGT GCCCGCGTGC GTGATCGTCA AGCACGCCAA CCCCTGCGGC GTGGCCGTGG GCAAGGACGC GGCCGAAGCC TACGGCAAGG CCTTCAAGAC CGACCCGACC TCGGCCTTCG GCGGCATCAT CGCCTTCAAC CGCCCGGTCG ATGGCGAGAC CGCGCAGGCC ATTGCCAAGC AGTTCGTCGA AGTGCTGATG GCGCCGGGCT ACACGCCCGA GGCGCTCGCC GTGTTCCAGG CCACCAAGGT CAAGCAGAAC GTGCGCGTGC TCGAGATCGC ACTGCCGCCG GGCGGCACCA CCGACTGGGA CAACGGCCGC AACCTCATGG ACGTCAAGCG CGTCGGTTCG GGCCTGTTGA TGCAGACCGC CGACAACCAC GAGCTCGCGG CGAGCGACCT CAAGGTGGTC ACGAAGAAGC AGCCCACGCC CGAGCAACTG CAGGACCTGC TGTTCGCATG GAAGGTCGCC AAGTACGTGA AGAGCAACGC CATCGTGTTC TGCGCCGGCG GCATGACCAT GGGCGTGGGC GCGGGCCAGA TGAGCCGCCT CGACTCCGCG CGCATCGCGA GCATCAAGGC CGAGCATGCG GGCCTCTCGC TGAAGGGCAC GGCGGTGGCG AGCGACGCCT TCTTCCCGTT CCGCGACGGG CTCGACGTGG TGGTCGATGC CGGCGCGAGC TGCGTGATCC AGCCGGGCGG CTCGATGCGC GACCAGGAAG TGATTGATGC CGCCGACGAG CGCGGCGTGG TCATGGTGCT CTCGGGCGTG CGCCACTTCC GGCACTGA
|
Protein sequence | MAQTALISVS DKTGILEFAQ ALHALGIKLL STGGTAKLLA DAGLPVTEVA DHTGFPEMLD GRVKTLHPKI HGGLLARRDL PAHVAAIQEH GIDTIDLLVV NLYPFEATVA KAGCTLEDAI ENIDIGGPAM VRSAAKNWKD VGVLTDASQY AVALAELQAG GKLSDKTKFA FSVAAFNRIA DYDGAISDYL SAIDFDASIG QASPTRSMFP AQSNGRFVKV QDLRYGENPH QQAAFYRDLH PAPGSLVSAK QLQGKELSYN NIADADAAWE CVKSFDVPAC VIVKHANPCG VAVGKDAAEA YGKAFKTDPT SAFGGIIAFN RPVDGETAQA IAKQFVEVLM APGYTPEALA VFQATKVKQN VRVLEIALPP GGTTDWDNGR NLMDVKRVGS GLLMQTADNH ELAASDLKVV TKKQPTPEQL QDLLFAWKVA KYVKSNAIVF CAGGMTMGVG AGQMSRLDSA RIASIKAEHA GLSLKGTAVA SDAFFPFRDG LDVVVDAGAS CVIQPGGSMR DQEVIDAADE RGVVMVLSGV RHFRH
|
| |