Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4239 |
Symbol | purH |
ID | 8015022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4339072 |
End bp | 4340688 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826809 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002978018 |
Protein GI | 241206922 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.156466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA TTTCCAAGAA GATCCCCGCC CCCGACAAGG TCGAAATCAA GACCGCGCTC ATCTCCGTCT TCGACAAGAC CGGGATCGTC GACCTCGCCC ACGCCTTGTC TGCCAGAGGT GTGCGCCTGC TTTCGACCGG CGGCACCTAT AAAGCGATCA CTGCTGCCGG TCTTGCCGTC ACCGATGTTT CCGAAGTCAC CGGTTTTCCG GAGATCATGG ATGGGCGTGT GAAGACGCTG CATCCGACGG TGCATGGCGG CCTGCTGGCG ATCCGTGACG ACAGCGAACA CCAGGAAGCG ATGAAAACGC ATGGCATCGA GGGCATCGAC CTCGCAGTCA TCAACCTCTA TCCCTTCGAG CAGGTGCGCG CAGCCGGCGG CGATTATCCG ACGACGGTCG AGAATATCGA CATTGGCGGC CCGGCGATGA TCCGCGCATC GGCCAAGAAC CATGCCTATG TGACAACCTT GACCGATCCG GCCGATTATG CCGAGCTGCT GGAGCAGCTT TCCGCAGATG ACGGCAAGAC CGCCTATGCC TTCCGCCAGC GTATGGCTGC CAAAGCCTAT GCCCGCACCG CCGCCTATGA TGCAATGATC TCCAATTGGT TTGCTGAGGC GCTGTCGATC GACACGCCGC GCCACCGGGT CATCGGCGGC GCGCTGAAGG AAGAGATGCG CTACGGCGAA AACCCGCACC AGAAGGCCGC CTTCTACGTA ACCGGCGAGA AGCGTCCGGG TGTTTCGACG GCCGCTCTTC TCCAGGGCAA GCAGCTCTCC TACAACAATA TCAACGATAC GGATGCGGCC TACGAGCTGG TCGCCGAGTT CCTGCCTGAG AGGGCGCCGG CCTGCGCGAT CATCAAGCAT GCCAATCCCT GCGGCGTCGC CACCGGATCG AGCCTGGTCG AGGCCTATCG GCGGGCGCTC GCCTGCGATT CCGTTTCCGC CTTCGGCGGC ATCATCGCGC TGAACCAAAC GCTGGATGCC GAAACGGCCG AAGAGATCGT CAAGCTGTTC ACCGAAGTGA TCATCGCGCC GGATGTCACG GAGGAGGCGA AGGCGATCGT CGCCCGCAAA CCGAACCTGC GACTATTGTC TGCCGGTGGC CTGCCCGATC CGCGTGCCGC GGGCCTGACG GCAAAGACCG TTTCCGGGGG CCTGCTCGTC CAGAGCCGCG ACAACGGCAT GGTCGAGGAT CTGGAACTCA AGGTCGTCAC CAGGCGTGCG CCGACGGCGC AGGAACTTGA TGACATGAAG TTCGCCTTCA AGGTCGGCAA ACATGTGAAG TCGAACGCCG TGGTCTATGC CAAGGACGGC CAGACCGCTG GCATCGGCGC CGGCCAGATG AGCCGGGTCG ATTCCGCCCG CATTGCCGCG CTGAAGGCCG AAGAGGCTGC CAAGGCGCTC GGCCTCGCAG TGCCGATGAC GCATGGCTCG GCGGTCGCCT CCGAAGCCTT CCTGCCTTTT GCCGACGGTC TTCTGTCGAT GATCGCCGCG GGGGCGACGG CGGTTATCCA GCCGGGCGGT TCGATGCGCG ACCAGGAGGT CATCGATGCC GCTAACGAAC ACGGCGTCGC AATGGTCTTT ACCGGCATGC GCCATTTCCG GCACTGA
|
Protein sequence | MAVISKKIPA PDKVEIKTAL ISVFDKTGIV DLAHALSARG VRLLSTGGTY KAITAAGLAV TDVSEVTGFP EIMDGRVKTL HPTVHGGLLA IRDDSEHQEA MKTHGIEGID LAVINLYPFE QVRAAGGDYP TTVENIDIGG PAMIRASAKN HAYVTTLTDP ADYAELLEQL SADDGKTAYA FRQRMAAKAY ARTAAYDAMI SNWFAEALSI DTPRHRVIGG ALKEEMRYGE NPHQKAAFYV TGEKRPGVST AALLQGKQLS YNNINDTDAA YELVAEFLPE RAPACAIIKH ANPCGVATGS SLVEAYRRAL ACDSVSAFGG IIALNQTLDA ETAEEIVKLF TEVIIAPDVT EEAKAIVARK PNLRLLSAGG LPDPRAAGLT AKTVSGGLLV QSRDNGMVED LELKVVTRRA PTAQELDDMK FAFKVGKHVK SNAVVYAKDG QTAGIGAGQM SRVDSARIAA LKAEEAAKAL GLAVPMTHGS AVASEAFLPF ADGLLSMIAA GATAVIQPGG SMRDQEVIDA ANEHGVAMVF TGMRHFRH
|
| |