Gene Rsph17029_2763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2763 
SymbolpurH 
ID4897104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2904960 
End bp2906549 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content70% 
IMG OID640113365 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001044637 
Protein GI126463523 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC TCGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGTCTC 
CTCGACCTCG CCCGTGCCCT GGCAGATCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG
GCGGCCGCGC TGCGGGCGGC GGGGCTGAAG GTGCGGGATG TGGCCGAGGT CACGGGCTTT
CCCGAGATGA TGGACGGCCG GGTCAAGACC CTGCATCCGA TGGTCCATGG CGGGCTTCTG
GCGCTGCGCG ACGATGACGA GCATCTGGTG GCGATGGCCG CCCACGGGAT CGAGCCGATC
GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCGG TCGCGCGGGG TGCCTCCTAT
GACGACTGCA TCGAGAACAT CGACATCGGC GGGCCGGCCA TGATCCGGGC CGCGGCCAAG
AACCACCGCT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAA
CTGCGCGCCC ATGACGGCGC CACGAGGCTC TCCTTCCGCC AGAAGCTCGC GCTGACCGCC
TATGCGCGCA CCGCGGCCTA CGACACGGCC GTCTCGACCT GGATGGCGGG CGCGCTGAAG
GCCGAGGCGC CGCGCCGCCG CTCCTTCGCG GGCACGCTCG CCCAGACGAT GCGCTACGGC
GAGAATCCGC ACCAGAAGGC CGCCTTCTAC ACCGACGGCT CGGCCCGTCC GGGCGTCGCC
ACCGCGAAAC AGTGGCAGGG CAAGGAGCTT TCCTACAACA ACATCAACGA CACCGACGCG
GCCTTCGAGC TGGTGGCCGA GTTCGACCCG GCCGAGGGCC CGGCCTGCGT CATCGTCAAG
CACGCCAACC CCTGCGGCGT GGCCCGGGGC GCGACACTGG CCGAAGCCTA TGCCCGCGCC
TTCGACTGCG ACCGCGTCTC GGCCTTCGGC GGCATCATCG CGCTGAACCA GCCGCTCGAT
GCGGCCACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC
GACGAGGAAG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TCCGGCTGCT GACGACCGAG
GCGCTGCCCG ATCCGCTGGC GCCGGGGCTC GCCTTCAAGC AGGTGGCGGG GGGCTTCCTC
GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC
GCGCCGTCGG ACGCCGAACT CGCCGACCTC CTCTTTGCCT GGACCGTGGC GAAGCATGTG
AAATCGAACG CCATCGTCTA TGTGAAGGAC GGGGCCACAG TGGGCGTGGG GGCGGGGCAG
ATGAGCCGCG TCGACTCGAC CCGGATCGCC GCGCGCAAGT CGCAGGACAT GGCGCAGGCG
CTGGGCCTGG CCCAGCCGCT GACGCAAGGG TCCGTCGTGG CCTCCGACGC CTTCTTCCCC
TTCGCCGACG GCCTGCTCGC CGCGGCCGAG GCGGGCGCCA CGGCGATCAT CCAGCCCGGC
GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGGCT CGCCATGGTC
TTCACCGGCC AGCGTCACTT CCGGCACTGA
 
Protein sequence
MTNLVPVGRA LLSVSDKSGL LDLARALADL EVELISTGGT AAALRAAGLK VRDVAEVTGF 
PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY
DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATRL SFRQKLALTA
YARTAAYDTA VSTWMAGALK AEAPRRRSFA GTLAQTMRYG ENPHQKAAFY TDGSARPGVA
TAKQWQGKEL SYNNINDTDA AFELVAEFDP AEGPACVIVK HANPCGVARG ATLAEAYARA
FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE
ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV
KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP
FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH