Gene Rsph17025_2928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2928 
SymbolpurH 
ID5084528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2985824 
End bp2987413 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content70% 
IMG OID640484499 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001169119 
Protein GI146278960 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0919623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0651041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TTGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGGCTC 
CTCGACCTCG CACGCGCCCT GGCCGAGCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG
GCCGCCACGC TGCGGGCCGC GGGGCTCAAG GTGCGCGACG TGGCCGAGGT CACGGGCTTC
CCCGAGATGA TGGACGGCCG GGTCAAGACG CTGCATCCGA TGGTGCATGG CGGGCTTCTG
GCGCTGCGCG ACGATGACGA GCATCTGGTG GCGATGGCCG CGCACGGGAT CGAGCCGATC
GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCGG TGGCGCGCGG CGCCTCCTAC
GATGACTGCA TCGAGAACAT CGACATCGGC GGTCCGGCCA TGATCCGGGC GGCGGCCAAG
AACCACCGCT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAG
CTGCGCGCGC ACGACGGGGC CACGACGCTC GCCTTCCGGC AGAAGCTGGC GCTGACGGCC
TATTCGCGCA CCGCCGCCTA TGATGCGGCC GTGTCGGCCT GGATGGCCGG GGCGCTGAAG
TCCGAGGCGC CGCGCCGCCG CACCTTTGCC GGCACACTGG CCCAGACCAT GCGCTACGGC
GAGAATCCGC ACCAGAAGGC GGCCTTCTAC ACCGACGGCT CGCACCGGCC GGGCGTCGCC
ACCGCGAAAC AGTGGCAGGG CAAGGAGCTC TCCTACAACA ACATCAACGA CACCGATGCG
GCCTTCGAGC TGGTGGCCGA GTTCGATCCC TCCGAGGGTC CGGCCTGCGT GATCGTCAAG
CACGCCAACC CCTGCGGCGT GGCGCGGGGC GCGACGCTGG CCGAGGCCTA CGGGCGCGCC
TTCGACTGCG ACCGCGTCTC GGCCTTTGGC GGCATCATCG CGCTGAACCA GCCGCTCGAC
GCGGCGACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC
GACGAGGAGG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TGCGGCTGCT GACGACCGAG
GCCCTGCCCG ATCCGCTCGC GCCGGGGCTG GCGTTCAAGC AGGTGGCGGG CGGCTTCCTC
GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC
GCGCCTTCGG ACGCGGAACT GGCCGACCTG CTCTTCGCCT GGACCGTGGC CAAGCATGTC
AAATCCAACG CCATCGTCTA TGTGAAGGAC GGCGCCACCG TGGGCGTGGG TGCGGGCCAG
ATGAGCCGGG TCGATTCCAC CCGCATCGCC GCGCGCAAGT CGCAGGACAT GGCGCAGGCG
CTCGGCCTCG CGCAGCCGCT GACTCAAGGC TCGGTCGTGG CCTCGGACGC CTTCTTCCCC
TTCGCCGACG GCCTTCTCGC CGCCGCCGAG GCGGGGGCGA CGGCCATCAT CCAGCCCGGC
GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGCCT TGCGATGGTC
TTCACCGGCC AGCGGCACTT CCGGCACTGA
 
Protein sequence
MTNLVPVGRA LLSVSDKSGL LDLARALAEL EVELISTGGT AATLRAAGLK VRDVAEVTGF 
PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY
DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATTL AFRQKLALTA
YSRTAAYDAA VSAWMAGALK SEAPRRRTFA GTLAQTMRYG ENPHQKAAFY TDGSHRPGVA
TAKQWQGKEL SYNNINDTDA AFELVAEFDP SEGPACVIVK HANPCGVARG ATLAEAYGRA
FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE
ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV
KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP
FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH