Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4336 |
Symbol | purH |
ID | 5541849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5588663 |
End bp | 5590180 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640896442 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001434378 |
Protein GI | 156744249 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.166921 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGCGC TTGTAAGCGT TTCCGATAAG CGTGGCATCG AAGCATTTGC CGCCGGTCTC GTCGAATTCG GCTTCGAGAT TATCTCGACC GGTCATACGG CGCGAACGCT TGCCACGGCT GGTGTTCCAG TTCGACCGGT CAGTGACGTG ACGGGTTTTC CCGAAATCCT TGGCGGGCGG GTGAAAACGC TCCATCCCGC CATTCACGCT GGCATTCTGG CGCGCCGCGA TGATCCAGAC CATATGGCGG CGCTCGATGT TCATGGTATT GCGCCGATCG ATCTCGTTGT CGTCAATCTC TATCCCTTTA GTGAGACGAT CACCCGTCCC GATGTCACTC TTGCAGAAGC GATCGAACAG ATTGACATCG GCGGTCCTGC TATGGTTCGC GCCGCCGCCA AGAACCACCC ATCGGTGCTG GTGGTGGTCA GCCCCGACGA CTACGACGCG GTGCTGACGG CGTTGCGCAC CGAAACGGTG ACGCCAGAGT TGCGGCGGCG CCTGGCAGCG CGTGCCTTTG CTCACACTGC TGCGTATGAT GCCGCCATCG CGGGGTATTT GTCTGGGGAA CTCTTCCCGG AGACACTGCC ACTGGCGTTT CGTAAGGCGC AGGATCTGCG TTACGGCGAA AATCCGCATC AGCGCGCGGC GCTCTACGGT GATTTCCATG CCTTCTTCGA GCAACTGCAC GGACGTGAGT TGTCGTATAT CAATATTCTC GATATTGCTG CGGCTCAGTC GCTGATCGAG GAGTTCGACC CGGCAGCGGG CGCGGCGCTG GCGATTGTCA AGCATACGAA TCCGTGTGGC GCGGGAGTGG GCGCAACGCC GCTCGAAGCC TGGGAGAAAG CGTTCGCCAC CGACCGGGAA GCGCCATTTG GCGGTATTAT CGCGGTCAAC CAGATGCTCG ATCTTCCGTT GGCGCAGGCG ATTGACGAGA TTTTCTCCGA GATCGTCATT GCGCCCGCCT TCGCCGATGA TGCACTGGCA TTGCTGCGCA AGAAGAAAAA CCGGCGTTTG ATGCGTGCTC TGCGCCCGGT CGGGCAGTCG CGCGGGCTGG TATACCATAG CGTGCCGGGT GGCATCCTGG CGCAGGAGCC GGACCTTGCG CCGCTTGATG AGGAACCGTT CGAGGTGGTG ACACAGCGCA CGCCGACCGA TGCCGAACGC GCTGCGCTAC AGTTTGCCTG GCGGATCGTG AAGCATGTCA AGTCGAATGC GATCGTCTTT GCCGCTGCCG ATCGCACCCT GGGGATCGGC GCCGGGCAGA TGAGCCGGGT CGATAGTACG CGGGTGGCGG TGTGGAAGGC GCAGAATGCC GGTCTCTCGC TCGCCGGGTC GGTCATTGCC AGTGATGCGC TCTTCCCGTT CCCCGATAGC GTCGAGATCG CGGCGGAGGC TGGAGCAACG GCAGTGATTC AGCCCGGCGG GTCGGTGCGC GACGACGAGG TGATTGCTGC CGCCAACCGG CTTGGTATGG CGATGGTGTT CACCGGAAGA CGCCACTTCT TGCACTAG
|
Protein sequence | MRALVSVSDK RGIEAFAAGL VEFGFEIIST GHTARTLATA GVPVRPVSDV TGFPEILGGR VKTLHPAIHA GILARRDDPD HMAALDVHGI APIDLVVVNL YPFSETITRP DVTLAEAIEQ IDIGGPAMVR AAAKNHPSVL VVVSPDDYDA VLTALRTETV TPELRRRLAA RAFAHTAAYD AAIAGYLSGE LFPETLPLAF RKAQDLRYGE NPHQRAALYG DFHAFFEQLH GRELSYINIL DIAAAQSLIE EFDPAAGAAL AIVKHTNPCG AGVGATPLEA WEKAFATDRE APFGGIIAVN QMLDLPLAQA IDEIFSEIVI APAFADDALA LLRKKKNRRL MRALRPVGQS RGLVYHSVPG GILAQEPDLA PLDEEPFEVV TQRTPTDAER AALQFAWRIV KHVKSNAIVF AAADRTLGIG AGQMSRVDST RVAVWKAQNA GLSLAGSVIA SDALFPFPDS VEIAAEAGAT AVIQPGGSVR DDEVIAAANR LGMAMVFTGR RHFLH
|
| |