Gene Dshi_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0358 
SymbolpurH 
ID5711267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp340610 
End bp342199 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content70% 
IMG OID641266256 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001531708 
Protein GI159042914 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.278248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TCGTGCCCCT GCGCCGCGCG CTGCTTTCCG TATCCGACAA GACCGGGCTT 
GTGCCCCTGG GTCAGGCGCT GGCCGCGCGG GGGGTGGAGT TGCTGTCCAC CGGGGGGACG
GCGAAGGCCT TGCGCGAGGC CGGGCTGGAC GTGGTGGACG TGTCCGATGT AACGGGCTTT
CCCGAGATGA TGGACGGGCG GGTCAAGACC CTGCATCCCA AGGTCCATGG CGGGCTTCTG
GCGCTGCGCG ACAACGCGGC CCATGTCTCG GCGATGGAGC GGCACGGCAT CGGGGCGATC
GACCTTTTGG TGGTGAACCT CTACCCGTTC GAGGCCACCG TGGCGGCGGG CGCGGATTAC
GCGGCCTGCA TCGAGAATAT CGACATCGGC GGGCCGGCGA TGATCCGGGC GGCGGCGAAG
AATCACAGCT TCGTCACGGT GCTGACGGAT GTGGAGGATT ACGAGGCCCT GCTGGGGGAG
CTGGAGGCGC GGGAGGGGGC CACCGGCTAT CCGTTCCGGC AGAAGATGGC GCTCAATGCC
TATGCGCGCA CGGCGGCCTA CGACGCGGCG GTGTCGGGCT GGATGACCGA TGCGCTGGCC
GAGGTGGCGC CGCGGCGGCG GGCGGTGGCG GGCACGCTGG CGCAGACCCT GCGTTATGGC
GAGAACCCGC ACCAGGGGGC GGCGTTCTAC GTGGACGGCT CGGACCGGCC CGGGGTGGCG
ACGGCGGTGC AGCACCAGGG CAAGGAGCTG AGCTACAACA ACATCAACGA CACGGACGCG
GCCTTCGAGC TGGTGGCGGA GTTCGCGCCC GAGGACGGGC CGGCATGCGC GATCATCAAG
CACGCCAATC CCTGCGGCGT GGCGCGGGGT GCCACCCTGG CGGAGGCCTA TACCAAGGCG
TTCCAATGCG ACCAGACCTC GGCCTTCGGT GGAATCATCG CGTTGAACCG GCCACTGGAC
GGCCCCACGG CCGAGGCGAT TTCCGGCATC TTCACCGAGG TGGTGATCGC CCCGGGGGCC
GATGAGACGG CGCGCGCGGT CTTCGCCGCC AAGAAGAACC TGCGCCTGCT GACGACCGAG
GGCCTGCCGG ACCCCAAGGC CCCGGCGCTG ACCGTGCGGC AGGTGTCGGG CGGCTACCTG
GTGCAGGACA AGGACAACGG CAATATCGGC TGGGACGACC TGAAGGTGGT CACGAAGCGC
GCGCCGAGCG AGGCGGAGAT CGCGGACCTT CTGTTCGCGT GGAAGGTCGC CAAGCATGTG
AAATCCAACG CCATCGTCTA TGTCAAGGAC GGCGCCACCG TGGGTGTGGG CGCGGGCCAG
ATGAGCCGGG TGGACAGCGC CCGGATCGCC GCGCGCAAGT CCGCCGACAT GGCCGAGGCG
CTGGGCCTCG AGACGCCGCT GATCCAGGGC TCGGTCGTAG CGTCGGATGC GTTCTTTCCC
TTCCCTGACG GGCTCTTGAC GGCGGCCGAG GCCGGGGCCA CGGCGGTGAT CCAGCCGGGC
GGGTCGATGC GCGATGTCGA GGTGATCGCG GCGGCCGACG CGGCCGGGCT GGCCATGGTC
TTCACCGGCA TGCGCCATTT CCGGCACTGA
 
Protein sequence
MTDLVPLRRA LLSVSDKTGL VPLGQALAAR GVELLSTGGT AKALREAGLD VVDVSDVTGF 
PEMMDGRVKT LHPKVHGGLL ALRDNAAHVS AMERHGIGAI DLLVVNLYPF EATVAAGADY
AACIENIDIG GPAMIRAAAK NHSFVTVLTD VEDYEALLGE LEAREGATGY PFRQKMALNA
YARTAAYDAA VSGWMTDALA EVAPRRRAVA GTLAQTLRYG ENPHQGAAFY VDGSDRPGVA
TAVQHQGKEL SYNNINDTDA AFELVAEFAP EDGPACAIIK HANPCGVARG ATLAEAYTKA
FQCDQTSAFG GIIALNRPLD GPTAEAISGI FTEVVIAPGA DETARAVFAA KKNLRLLTTE
GLPDPKAPAL TVRQVSGGYL VQDKDNGNIG WDDLKVVTKR APSEAEIADL LFAWKVAKHV
KSNAIVYVKD GATVGVGAGQ MSRVDSARIA ARKSADMAEA LGLETPLIQG SVVASDAFFP
FPDGLLTAAE AGATAVIQPG GSMRDVEVIA AADAAGLAMV FTGMRHFRH