Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3915 |
Symbol | purH |
ID | 6982679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4061273 |
End bp | 4062889 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398638 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002283403 |
Protein GI | 209551486 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.230829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA TTTCCAAGAA GATCCCCGCC CCCGACAAGG TCGAAATCAA GACCGCCCTC CTCTCCGTCT TCGACAAGAC CGGGATCGTC GAACTCGCCC AGGCACTGTC GGAACAGGGC GTGCGGCTGC TGTCGACCGG CGGCACCTAC AAGGCGATCG CCGCCGCCGG CCTTGCCGTT ACCGATGTTT CCGAAATTAC CGGCTTTCCC GAGATCATGG ACGGGCGGGT CAAGACGCTG CATCCGACGG TGCATGGCGG CCTGCTGGCG ATCCGCGACG ATAGCGAACA CCAGGAGGCG ATGAAAACCC ACGGCATCGA GGCCATCGAC CTCGCCGTCA TCAACCTCTA TCCCTTCGAA GACGTGCGCG CCGCCGGCGG CGATTATCCG ACGACCGTCG AAAATATCGA CATCGGCGGC CCGGCGATGA TCCGCGCTTC GGCCAAGAAC CATGCCTATG TGACGATCCT GACCGATCCG AACGACTATG CCGAATTCAC CGAGCAGCTT TCCGCGGATG GTGGCAAGAC CGCCTACGCC TTCCGGCAGC GCATGGCCGC CAAGGCCTAT GCCCGCACCG CGGCCTATGA CGCTGTGATT TCCAACTGGT TCGCAGAAGC GCTGTCGATC GACACGCCGC GCCACCGCGT TATCGGCGGT GCGCTGAAGG AAGAGATGCG TTACGGCGAA AATCCGCACC AGAAGGCCGC CTTTTACGTC ACCGGCGAGA AGCGCCCGGG CGTTTCGACG GCTGCCCTCC TCCAGGGCAA GCAGCTCTCC TACAACAATA TCAACGATAC CGATGCCGCT TACGAGCTGG TCGCCGAGTT CCTGCCGGAA AAGGAGCCGG CCTGCGCCAT CATCAAACAT GCCAATCCCT GTGGCGTCGC CACCGGGTCG AGCCTGGTCG AGGCCTATCG GCGGGCGTTG GCCTGCGACA GCGTTTCCGC CTTCGGCGGC ATCATTGCAC TCAATCGGAC GCTGGATGCC GAAACGGCTG AGGAGATCGT CAAGCTCTTC ACCGAAGTGA TCATCGCGCC CGATGTGACC GAGGAGGCGA AGGCGATCAT CGCCCGCAAG CCGAACCTGC GGCTGCTGTC GGCCGGCGGC CTGCCCGATC CGCGTGCCGC GGGCCTGACG GCGAAGACCG TTTCCGGCGG CCTGCTGGTC CAGAGCCGCG ACAACGGCAT GGTCGAGGAT CTGGAGCTCA AGGTCGTCAC CAAGCGCGCG CCGACGGCTC AGGAGCTTGA TGATATGAAG TTCGCCTTCA AGATCGGCAA ACACGTGAAA TCGAACGCCG TGGTCTATGC CAAGGACGGC CAGACCGCCG GCATCGGCGC CGGCCAGATG AGCCGGGTCG ATTCTGCCCG TATCGCCGCG CTGAAGGCGG AAGAAGCCGC CAAGGCGCTC GGCCTTGCCG TGCCGATGAC GCATGGCTCG GCAGTCGCCT CCGAAGCCTT CCTGCCGTTT GCCGACGGTC TCTTGTCGAT GATCGCAGCG GGGGCGACGG CGGTGATCCA GCCTGGCGGT TCGATGCGCG ACCAGGAAGT CATCGATGCC GCCGACGAAC ACGGCATTGC GATGGTCTTT ACCGGCATGC GCCATTTCCG GCACTGA
|
Protein sequence | MAVISKKIPA PDKVEIKTAL LSVFDKTGIV ELAQALSEQG VRLLSTGGTY KAIAAAGLAV TDVSEITGFP EIMDGRVKTL HPTVHGGLLA IRDDSEHQEA MKTHGIEAID LAVINLYPFE DVRAAGGDYP TTVENIDIGG PAMIRASAKN HAYVTILTDP NDYAEFTEQL SADGGKTAYA FRQRMAAKAY ARTAAYDAVI SNWFAEALSI DTPRHRVIGG ALKEEMRYGE NPHQKAAFYV TGEKRPGVST AALLQGKQLS YNNINDTDAA YELVAEFLPE KEPACAIIKH ANPCGVATGS SLVEAYRRAL ACDSVSAFGG IIALNRTLDA ETAEEIVKLF TEVIIAPDVT EEAKAIIARK PNLRLLSAGG LPDPRAAGLT AKTVSGGLLV QSRDNGMVED LELKVVTKRA PTAQELDDMK FAFKIGKHVK SNAVVYAKDG QTAGIGAGQM SRVDSARIAA LKAEEAAKAL GLAVPMTHGS AVASEAFLPF ADGLLSMIAA GATAVIQPGG SMRDQEVIDA ADEHGIAMVF TGMRHFRH
|
| |