Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_3123 |
Symbol | purH |
ID | 4082709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | - |
Start bp | 3274335 |
End bp | 3275993 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638011508 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_618159 |
Protein GI | 103488598 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCTTGG GGAGTGACCC GACTAGGGGA GCCGCGTCTT TCCCGTCTCG CAAAAGGCCG CCCTCCATGA CCGACCTGAT TCCCGTCCGC CGCGCGCTCT TGTCCGTCAG CGACAAGGCG GGGCTTGCCG ATCTGGCCGC GGCGCTCGTC CGCCACGGGG TCGAACTGGT GTCGACCGGG GGGACTGCGA AGGCATTGCG CGAGGCGGGT CATAGCGTGC TCGATGTCGC CGATTTGACC GGCTTTCCCG AGATGATGGA CGGCCGCGTC AAGACGCTGC ACCCGGCGGT GCATGGCGGC ATATTGGCGG TGCGCGACGA CGAGCGCCAC GTCGCCGCGA TGGACGCGCA CGGGATCGGC GCGATCGATC TGGTCGTCGT CAATCTCTAC CCCTTCGCCG CGACCGTCGC GAAGGGCGCG GCGCGCGACG AGATCATCGA AAATATAGAC ATCGGCGGCC CCGCGATGGT GCGCTCGGCA GCGAAGAACC ATGCGTTCGT CGGCATCGTC ACCGAGCCCG AGGATTATGC CGCGGTGATC GCGGAGATGG ACGCCAACGG CGGCGCGATG ACGCTGGACC TGCGCAAGCG GCTCGCCGCG ACCGCCTTTG CCCACACCGC CACCTATGAC GGGACGATCG CGAGCTGGTT CGCCTTTGCC GACCAGGGCA AGCTGTTTCC CGACACGCTG CCGCTGACCG CCAAGCTGTC GGCCGAACTG CGCTATGGCG AAAATCCGCA CCAAAAGGCC GCGCTTTACC TGCCCGCCGG TCCCGCCGGG CGCGGGATAG CGCAAGCCGA ACAGGTGCAG GGCAAGGAAC TCAGCTACAA CAATATCAAC GACGCCGATG CCGCGCTCGA ACTCGTCGCG GAGTTTCGCG AGGCCGATCC GACCTGCGTG ATCGTCAAGC ACGCCAATCC GTGCGGCGTC GCGACCGCCG CGAGTTTGAG CCAGGCCTAT GACGCGGCGC TGAAATGCGA CGATGTGTCG GCGTTCGGCG GGATCATCGC GGTCAACCGA CCACTCGACG GGCCGACGGC GGAGGCGATC AGCGGCATTT TCACCGAGGT CGTCTGCGCC CCCGACGCCG ATGCCGATGC CCGTGCGGTG TTCGCGAAGA AGAAGAACCT CCGCCTGCTG CTCACCGGCG ACTTGCCCGA TCCGGCGCGC GGCGGGTTGA TGCTGAAGAC GATCGCCGGC GGCTGGCTCG CGCAGAGCCG CGACAACGGC CGCATCACCC GCGCCGACCT GAAGGTCGTG ACCGACCGCG CGCCGACCGA GGAAGAACTG GCCGACGCGC TATTCGCGTG GACGGTTGCC AAGCATGTGA AGTCGAACGC GATCGTCTAT GCCAAGGGCG GCGCAACCGC GGGCATCGGC GCGGGGCAGA TGAACCGCCG CGACAGCGCG CGCATTGCCG CGGCGAAAGC GCGCGAAGCG GCCGAATCCC ATGGCTGGGC AAGCCCGCGC ACCATTGGCA GCGCGGTCGC CAGCGACGCC TTCTTCCCCT TTGCCGACGG GTTGCTCGCG GCGGTCGAGG CGGGCGCGAC CTGCGTGATC CAGCCCGGCG GATCGATCCG CGACGATGAG GTGATCGCAG CCGCGAACAA AGCCGGGCTG GCGATGGTCT TCACCGGAAT GCGGCATTTC CGGCATTGA
|
Protein sequence | MLLGSDPTRG AASFPSRKRP PSMTDLIPVR RALLSVSDKA GLADLAAALV RHGVELVSTG GTAKALREAG HSVLDVADLT GFPEMMDGRV KTLHPAVHGG ILAVRDDERH VAAMDAHGIG AIDLVVVNLY PFAATVAKGA ARDEIIENID IGGPAMVRSA AKNHAFVGIV TEPEDYAAVI AEMDANGGAM TLDLRKRLAA TAFAHTATYD GTIASWFAFA DQGKLFPDTL PLTAKLSAEL RYGENPHQKA ALYLPAGPAG RGIAQAEQVQ GKELSYNNIN DADAALELVA EFREADPTCV IVKHANPCGV ATAASLSQAY DAALKCDDVS AFGGIIAVNR PLDGPTAEAI SGIFTEVVCA PDADADARAV FAKKKNLRLL LTGDLPDPAR GGLMLKTIAG GWLAQSRDNG RITRADLKVV TDRAPTEEEL ADALFAWTVA KHVKSNAIVY AKGGATAGIG AGQMNRRDSA RIAAAKAREA AESHGWASPR TIGSAVASDA FFPFADGLLA AVEAGATCVI QPGGSIRDDE VIAAANKAGL AMVFTGMRHF RH
|
| |