Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0358 |
Symbol | purH |
ID | 5711267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 340610 |
End bp | 342199 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641266256 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001531708 |
Protein GI | 159042914 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.278248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TCGTGCCCCT GCGCCGCGCG CTGCTTTCCG TATCCGACAA GACCGGGCTT GTGCCCCTGG GTCAGGCGCT GGCCGCGCGG GGGGTGGAGT TGCTGTCCAC CGGGGGGACG GCGAAGGCCT TGCGCGAGGC CGGGCTGGAC GTGGTGGACG TGTCCGATGT AACGGGCTTT CCCGAGATGA TGGACGGGCG GGTCAAGACC CTGCATCCCA AGGTCCATGG CGGGCTTCTG GCGCTGCGCG ACAACGCGGC CCATGTCTCG GCGATGGAGC GGCACGGCAT CGGGGCGATC GACCTTTTGG TGGTGAACCT CTACCCGTTC GAGGCCACCG TGGCGGCGGG CGCGGATTAC GCGGCCTGCA TCGAGAATAT CGACATCGGC GGGCCGGCGA TGATCCGGGC GGCGGCGAAG AATCACAGCT TCGTCACGGT GCTGACGGAT GTGGAGGATT ACGAGGCCCT GCTGGGGGAG CTGGAGGCGC GGGAGGGGGC CACCGGCTAT CCGTTCCGGC AGAAGATGGC GCTCAATGCC TATGCGCGCA CGGCGGCCTA CGACGCGGCG GTGTCGGGCT GGATGACCGA TGCGCTGGCC GAGGTGGCGC CGCGGCGGCG GGCGGTGGCG GGCACGCTGG CGCAGACCCT GCGTTATGGC GAGAACCCGC ACCAGGGGGC GGCGTTCTAC GTGGACGGCT CGGACCGGCC CGGGGTGGCG ACGGCGGTGC AGCACCAGGG CAAGGAGCTG AGCTACAACA ACATCAACGA CACGGACGCG GCCTTCGAGC TGGTGGCGGA GTTCGCGCCC GAGGACGGGC CGGCATGCGC GATCATCAAG CACGCCAATC CCTGCGGCGT GGCGCGGGGT GCCACCCTGG CGGAGGCCTA TACCAAGGCG TTCCAATGCG ACCAGACCTC GGCCTTCGGT GGAATCATCG CGTTGAACCG GCCACTGGAC GGCCCCACGG CCGAGGCGAT TTCCGGCATC TTCACCGAGG TGGTGATCGC CCCGGGGGCC GATGAGACGG CGCGCGCGGT CTTCGCCGCC AAGAAGAACC TGCGCCTGCT GACGACCGAG GGCCTGCCGG ACCCCAAGGC CCCGGCGCTG ACCGTGCGGC AGGTGTCGGG CGGCTACCTG GTGCAGGACA AGGACAACGG CAATATCGGC TGGGACGACC TGAAGGTGGT CACGAAGCGC GCGCCGAGCG AGGCGGAGAT CGCGGACCTT CTGTTCGCGT GGAAGGTCGC CAAGCATGTG AAATCCAACG CCATCGTCTA TGTCAAGGAC GGCGCCACCG TGGGTGTGGG CGCGGGCCAG ATGAGCCGGG TGGACAGCGC CCGGATCGCC GCGCGCAAGT CCGCCGACAT GGCCGAGGCG CTGGGCCTCG AGACGCCGCT GATCCAGGGC TCGGTCGTAG CGTCGGATGC GTTCTTTCCC TTCCCTGACG GGCTCTTGAC GGCGGCCGAG GCCGGGGCCA CGGCGGTGAT CCAGCCGGGC GGGTCGATGC GCGATGTCGA GGTGATCGCG GCGGCCGACG CGGCCGGGCT GGCCATGGTC TTCACCGGCA TGCGCCATTT CCGGCACTGA
|
Protein sequence | MTDLVPLRRA LLSVSDKTGL VPLGQALAAR GVELLSTGGT AKALREAGLD VVDVSDVTGF PEMMDGRVKT LHPKVHGGLL ALRDNAAHVS AMERHGIGAI DLLVVNLYPF EATVAAGADY AACIENIDIG GPAMIRAAAK NHSFVTVLTD VEDYEALLGE LEAREGATGY PFRQKMALNA YARTAAYDAA VSGWMTDALA EVAPRRRAVA GTLAQTLRYG ENPHQGAAFY VDGSDRPGVA TAVQHQGKEL SYNNINDTDA AFELVAEFAP EDGPACAIIK HANPCGVARG ATLAEAYTKA FQCDQTSAFG GIIALNRPLD GPTAEAISGI FTEVVIAPGA DETARAVFAA KKNLRLLTTE GLPDPKAPAL TVRQVSGGYL VQDKDNGNIG WDDLKVVTKR APSEAEIADL LFAWKVAKHV KSNAIVYVKD GATVGVGAGQ MSRVDSARIA ARKSADMAEA LGLETPLIQG SVVASDAFFP FPDGLLTAAE AGATAVIQPG GSMRDVEVIA AADAAGLAMV FTGMRHFRH
|
| |