Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3189 |
Symbol | purH |
ID | 5324068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3358668 |
End bp | 3360278 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640792137 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001328848 |
Protein GI | 150398381 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTCG CCTCCAAGAA AATTCCCGCC CCGGACGAAG TCCGGATCAA AACCGCCCTC CTTTCGGTCT CCGACAAATC GGGCATCGTC GAACTCGCCC GCCACCTCAA TGACAGGGGC GTGCGGCTGG TATCGACCGG CGGGACGCAC AAGGCGCTTG CCGATGCGGG CCTTCCCGTC AGCGACGTTT CGGAGCTGAC GGGGTTTCCG GAGATCATGG ACGGTCGCGT GAAAACCCTT CATCCCGGCG TTCACGGCGG TCTGCTGGCT ATTCGCGACG ATGCGGAGCA TGCCGGCGCG ATGAGCGCAC ACGGTATTAC AGCGATCGAT CTCGCCGTCA TCAATCTCTA CCCCTTCGAG GAGGTTCGCG CCAAGGGCGG CGACTATCCG ACCACGGTCG AGAATATCGA CATTGGCGGT CCCGCAATGA TCCGGGCGTC GGCCAAGAAC CACGCCTATG TGACCGTTGT GACCGACCCG GCGGATTATC CGCTGCTGCT GGAGGAGATC GCGGGCGGCA CGACGCGCTA TGCCTTCCGC CAGAAAATGG CCGCCAAGGC CTATGCCCGC ACAGCGGCCT ATGATGCGGC AATCTCCAAC TGGTTCGCCG AGGTACTCGA CACGCCCATG CCGCGCCACC GCGTCATCGG CGGGGTGCTC AAGGAAGAGA TGCGCTACGG CGAAAACCCG CATCAGAAGG CCGGTTTCTA TGTGACCGGT GACAAGCGGC CGGGGGTCGC CACTGCAGCG CTTCTCCAAG GCAAGCAGCT CTCCTACAAC AACATCAACG ACACCGACGC CGCCTTCGAG CTGGTGGCGG AATTCCTGCC GGAAAAGGCG CCTGCCTGCG CCATTATCAA GCATGCCAAC CCCTGCGGCG TTGCGACCGC GCCATCGCTC GCGGAAGCAT ATCGCCGGGC GCTTGCCTGT GATTCGACCT CCGCTTTCGG CGGCATTATC GCGCTTAATC AGGAACTCGA CGCGGCGACC GCCGAAGAGA TCGTGAAGCT CTTCACCGAA GTGATCATCG CCCCGTCGGT CAGCGACGAG GCGAAAGCGA TCATCGCCCG GAAGCCCAAT CTGCGGCTGC TTGCGACCGG CGGCCTGCCG GATCCGCGCA CGCCCGGTCT GACGGCAAAG ACGGTGGCCG GGGGCCTTCT TGTCCAGACG CGCGACGACG GCATGATCGA AGACATCGAA CTGAAGGTGG TCACGAAGCG CACGCCGACG GCGCAGGAGC TCGAAGACAT GAAATTTGCC TTCAAGGTGG CCAAGCACGT CAAGTCGAAT GCCGTCGTCT ACGCGAAAGG CGGTCAGACG GCGGGTATCG GCGCCGGACA GATGAGCCGG GTCGATTCCG CGAGAATTGC TGCCATCAAG GCGGAAGAGG CGGCGAAGGC GCTCGGTCTC GCCGAGCCTC TGACACGCGG CTCCGCGGTT GCCTCGGAAG CCTTCCTGCC GTTCGCTGAC GGCCTTCTGT CCGCGATCGC TGCGGGGGCC ACCGCAGTGA TCCAGCCGGG CGGCTCCATG CGCGACGAGG AGGTGATCGC AGCGGCCGAC GAGCACAATG TCGCGATGGT CTTCACCGGG ATGCGGCATT TCCGGCACTG A
|
Protein sequence | MAVASKKIPA PDEVRIKTAL LSVSDKSGIV ELARHLNDRG VRLVSTGGTH KALADAGLPV SDVSELTGFP EIMDGRVKTL HPGVHGGLLA IRDDAEHAGA MSAHGITAID LAVINLYPFE EVRAKGGDYP TTVENIDIGG PAMIRASAKN HAYVTVVTDP ADYPLLLEEI AGGTTRYAFR QKMAAKAYAR TAAYDAAISN WFAEVLDTPM PRHRVIGGVL KEEMRYGENP HQKAGFYVTG DKRPGVATAA LLQGKQLSYN NINDTDAAFE LVAEFLPEKA PACAIIKHAN PCGVATAPSL AEAYRRALAC DSTSAFGGII ALNQELDAAT AEEIVKLFTE VIIAPSVSDE AKAIIARKPN LRLLATGGLP DPRTPGLTAK TVAGGLLVQT RDDGMIEDIE LKVVTKRTPT AQELEDMKFA FKVAKHVKSN AVVYAKGGQT AGIGAGQMSR VDSARIAAIK AEEAAKALGL AEPLTRGSAV ASEAFLPFAD GLLSAIAAGA TAVIQPGGSM RDEEVIAAAD EHNVAMVFTG MRHFRH
|
| |