Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0206 |
Symbol | purH |
ID | 5207141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 250856 |
End bp | 252373 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640593836 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001274592 |
Protein GI | 148654387 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.951642 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGCAC TGTTGAGCGT TTCTGATAAG CGCGGCATCG AGGCATTCGC TGCCGGTCTG GTTGAACTCG GGTACGAGAT TGTTTCGACC GGCAACACGG CGCGAACGCT TGCGGCCGCC GGTATTCCGG TTCGACCGGT CAGTGATGTC ACCGGTTTTC CTGAGATTCT CGGCGGGCGG GTGAAAACGC TCCACCCTGC CATTCATGCT GGCATCCTGG CGCGTCGTGA CGATCCTGGA CACATGGCGG CGCTGGATGT CCACGGTATT GCGCCGATCG ATATTGTCGC TGTTAATCTC TACCCCTTCA GCGAGACGAT TGCCCGTCCC GATGTTTCCT TTGCCGAAGC AATCGAGCAG ATCGATATCG GCGGTCCTGC CCTGGTACGC GCCGCTGCCA AGAACCACGA CTCGGTGCTG GTGGTGGTCA GTCCTGATGA TTATGATCCG GTGCTGACGG CGCTGCGCTC CGAAGCGGTG ACGCCTAACC TGCGCCGGCG TCTGGCAGCG CGCGCGTTTG CTCATACAGC AGCGTATGAT GCTGCGATTG CTGCGTATCT GTCCGATGAA CCCTTCCCGG AGACGCTGCC GCTGGCGTTC CGCAAGGCGC AGGATTTACG CTACGGTGAG AATCCACATC AGCGCGCTGC GCTTTATGGC GAGTTTCACA CCTTCTTCGA GCAACTGCAC GGTCGTGAGT TGTCGTATAT CAACATCCTT GATATTGCTG CGGTACAGGG GTTGATCGAG GAGTTCGATC CACAGGAAGG CGCCGCACTG GCAATCGTCA AGCATACGAA CCCGTGCGGC GTCGGCATCG GTGCAACGCC GCTCGAAGCC TGGGAAAAGG CATTTGCGAC CGACCGCGAG GCGCCGTTCG GCGGCATCAT TGCGGTGAAC CAGACGCTTG ATCTGCCGCT GGCGCAGGCG ATTGACGAGA TTTTCTCCGA GATTGTCATT GCGCCAGCGT TCGCCGATGA TGCGCTGGCG CTGCTGCGGA AGAAGAAGAA CCGCCGCCTG ATGCGTGCGC TGCGCCCCGT TCGCCTTGCC CGCGGGCTGG CATACCACAG CGTGCCCGGC GGTATCCTGG CGCAGGAGCC AGACCTTGCG CCGCTCGATG AGGAGCCGTT CCAGGTTGTG ACACAGCGCG CTCCGACTGA GACGGAACGG GCTGCGCTGC GCTTTGCCTG GCGCGTGGTG AAGCACGTCA AATCGAACGC GATAGTGTTT GCTGCTGCCG ACCGGACGTT AGGCATCGGC GCCGGGCAGA TGAGTCGCGT CGATAGTACG CGGGTGGCGG TGTGGAAAGC GCAGAACGCT GGTCTCTCGC TCGCCGGTTC GGTCATCGCC AGCGATGCGC TGTTCCCGTT CCCCGATAGT GTCGAGATTG CAGCGGCGGC GGGAGCAACA GCGGTTATTC AGCCCGGCGG ATCGGTGCGC GATGATGAGG TGATCGCCGC CGCCAACCGG CTCGGCATGG CGATGGTGTT CACCGGCAGA CGACATTTTC TGCACTGA
|
Protein sequence | MRALLSVSDK RGIEAFAAGL VELGYEIVST GNTARTLAAA GIPVRPVSDV TGFPEILGGR VKTLHPAIHA GILARRDDPG HMAALDVHGI APIDIVAVNL YPFSETIARP DVSFAEAIEQ IDIGGPALVR AAAKNHDSVL VVVSPDDYDP VLTALRSEAV TPNLRRRLAA RAFAHTAAYD AAIAAYLSDE PFPETLPLAF RKAQDLRYGE NPHQRAALYG EFHTFFEQLH GRELSYINIL DIAAVQGLIE EFDPQEGAAL AIVKHTNPCG VGIGATPLEA WEKAFATDRE APFGGIIAVN QTLDLPLAQA IDEIFSEIVI APAFADDALA LLRKKKNRRL MRALRPVRLA RGLAYHSVPG GILAQEPDLA PLDEEPFQVV TQRAPTETER AALRFAWRVV KHVKSNAIVF AAADRTLGIG AGQMSRVDST RVAVWKAQNA GLSLAGSVIA SDALFPFPDS VEIAAAAGAT AVIQPGGSVR DDEVIAAANR LGMAMVFTGR RHFLH
|
| |