Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0657 |
Symbol | purH |
ID | 3902991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 751457 |
End bp | 753103 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637877990 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_479770 |
Protein GI | 86739370 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0944398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCAT CGGGCGAAGG TGTGGCCGGC GAGGGCGGGG CTGTCCGCGG ATCCGAACCC GGCGAAGTGG TCCCCACCGG GCGGCGGCCG CTGCGACGGG CGCTTGTCAG CGTCTACGAC AAGAGCGGGC TCGACGTCCT CGCGGAGGCG TTCCTTGCGG CCGACGTCGA GGTGGTCTCG ACCGGTTCGA CCGCCGACGT CCTGGCCCGT CACGGGCTGG CGGTCACACC GGTGAGCACC GTGACCGGAT TTCCGGAGGT GCTGGGCGGT CGCGTCAAGA CGCTGCACCC CCACGTGCAC GCCGGTCTGT TGGCCGATCT GCGTAACGCC GAGCACGCCG CGGTGTTGGC CGAACTCGAC ATCGCTCCGT TCGACCTGGT CGTCGTCAAT CTGTACCCGT TCGCCGCGAC AGTCGCCGCC GGTGCGAGCG AGGACGAGGC GATCGAGCAG ATCGACATCG GCGGTCCGGC CATGATCCGC GCGGCGGCGA AGAACCATGC GTCGGTCGCC GTTGTCGTCG CACCCGGCGA CTACGCCGAG CTGGCCGCCG CAGTCCGCGG ATCCGGATAT GATCTTCCCG CTCGCCGCCG GCTCGCCGCG AAGGCGTTCG CCCACACCGC GGCGTACGAC ATCGCCGTGT CCTCGTGGTT CGCCGGCGTC GTCGCGCCGG ACGAGGTGGC GCGGGAGAGC GGATGGCCCG ACGTGCTGTC CGCGCAGTGG CACCGTACGG AGGTCCTGCG TTACGGCGAG AACCCCCATC AGCGCGCCGC GCTCTACGTG GAGAGCGACG CCGAGGGTCG GCCCGGCCTC GCCTCGGCCC GTCAGCTGCA CGGCAAGCAG ATGTCCTACA ACAACTACAC CGACACCGAC GCCGCCCGCC GAGCGGTGTT CGACTTCACC GAGCCCGCCG TGGCCGTGAT CAAGCACGCC AACCCCTGCG GCATCGCGAT CGGCGCCACC ATCGCCGAGG CCCACCGCAA GGCGCATGCC TGCGACCCGG TCTCCGCCTT CGGCGGGGTG ATCGCGACCA ACCGTCCGGT CTCGGGCGAG CTCGCCGAAC AGATCGCGGA GATCTTCACC GAGGTCGTCG TCGCACCGGC CTACGAACCC GCCGCGGTGG AGATCCTCTC TCGTAAGCCG TCGATCCGGC TGCTGGAGTG CCCACCGCCG CCGCACCAGC GCGGGATCGA ACTGCGCCAG ATCAGCGGAG GCCTGCTCCT GCAGTCGCGG GACGCCGTCG ATGCGCCGGG CGACGAGCCG TCCGGATGGA CGCTGGAGGC GGGATCGCCT GCGGACGAGG CCCTGCTGGC CGAGTTGCGG TTCGCCTGGC GGGCGGTGCG CTCCGTGAAG TCGAACGCCA TCCTCCTCGC GTCCGGCGGT GCCACCGTCG GAGTCGGGAT GGGCCAGGTG AACCGGGTGG ACGCTGCCCG GCTCGCGGTG ACCCGGGCAG GGGACCGGGC GAAGGGGGCT GTCGCCGCGA GCGACGCCTA TTTCCCGTTC CCCGACGGTT TCGAGGTGCT CGCCGAGGCG GGGGTGCGGG CCGTGGTCGA ACCGGGCGGG TCGGTGCGCG ACGAGCTCGT CATCACGGCT GCCCGGGAGG CCGGCGTCAC GCTCTACTTC AGCGGTGTCC GCCACTTCGC GCACTGA
|
Protein sequence | MTSSGEGVAG EGGAVRGSEP GEVVPTGRRP LRRALVSVYD KSGLDVLAEA FLAADVEVVS TGSTADVLAR HGLAVTPVST VTGFPEVLGG RVKTLHPHVH AGLLADLRNA EHAAVLAELD IAPFDLVVVN LYPFAATVAA GASEDEAIEQ IDIGGPAMIR AAAKNHASVA VVVAPGDYAE LAAAVRGSGY DLPARRRLAA KAFAHTAAYD IAVSSWFAGV VAPDEVARES GWPDVLSAQW HRTEVLRYGE NPHQRAALYV ESDAEGRPGL ASARQLHGKQ MSYNNYTDTD AARRAVFDFT EPAVAVIKHA NPCGIAIGAT IAEAHRKAHA CDPVSAFGGV IATNRPVSGE LAEQIAEIFT EVVVAPAYEP AAVEILSRKP SIRLLECPPP PHQRGIELRQ ISGGLLLQSR DAVDAPGDEP SGWTLEAGSP ADEALLAELR FAWRAVRSVK SNAILLASGG ATVGVGMGQV NRVDAARLAV TRAGDRAKGA VAASDAYFPF PDGFEVLAEA GVRAVVEPGG SVRDELVITA AREAGVTLYF SGVRHFAH
|
| |