Gene RSP_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1100 
SymbolpurH 
ID3720859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2857807 
End bp2859396 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content70% 
IMG OID640072332 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_354187 
Protein GI77464683 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.759839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACC TCGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGTCTC 
CTCGACCTCG CCCGTGCCCT GGCAGATCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG
GCGGCCGCGC TGCGGGCGGC GGGGCTGAAG GTGCGGGACG TCGCCGAGGT CACGGGCTTT
CCCGAGATGA TGGACGGCCG GGTCAAGACC CTGCATCCGA TGGTCCACGG CGGGCTTCTG
GCGCTGCGCG ACGACGACGA GCATCTGGTG GCGATGGCCG CCCACGGGAT CGAGCCGATC
GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCCG TCGCGCGGGG CGCCTCCTAC
GACGACTGCA TCGAGAACAT CGACATCGGC GGGCCGGCCA TGATCCGGGC CGCGGCCAAG
AACCACCGTT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAG
CTGCGCGCCC ATGACGGCGC CACGAGGCTC TCCTTCCGCC AGAAACTCGC GCTGACCGCC
TATGCGCGCA CCGCGGCCTA CGACACGGCC GTCTCGACCT GGATGGCGGG CGCGCTGAAG
GCCGAGGCGC CGCGCCGCCG CTCCTTCGCG GGCACGCTCG CCCAGACCAT GCGCTACGGC
GAGAACCCGC ACCAGAAGGC CGCCTTCTAC ACCGACGGCT CGGCCCGTCC GGGCGTCGCC
ACCGCGAAAC AGTGGCAGGG CAAGGAGCTT TCCTACAACA ACATCAACGA CACCGACGCG
GCCTTCGAGC TGGTGGCCGA GTTCGACCCG GCCGAGGGCC CGGCCTGCGT CATCGTCAAG
CACGCCAACC CCTGCGGCGT GGCCCGGGGC GCGACACTGG CCGAAGCCTA TGCCCGCGCC
TTCGACTGCG ACCGCGTCTC GGCCTTCGGC GGCATCATCG CGCTGAACCA GCCGCTCGAT
GCGGCCACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC
GACGAGGAAG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TCCGGCTGCT GACGACCGAG
GCGCTGCCCG ATCCGCTGGC GCCGGGGCTC GCCTTCAAGC AGGTGGCGGG CGGCTTCCTC
GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC
GCGCCCTCGG ACGCGGAACT CGCCGACCTC CTCTTTGCCT GGACCGTGGC CAAGCATGTG
AAATCGAACG CCATCGTCTA TGTGAAGGAC GGGGCCACCG TGGGCGTGGG GGCGGGGCAG
ATGAGCCGCG TCGACTCGAC CCGGATCGCT GCGCGCAAGT CGCAGGACAT GGCGCAGGCG
CTGGGTCTGG CCCAGCCGCT GACGCAAGGG TCCGTCGTGG CCTCCGACGC CTTCTTCCCC
TTCGCCGACG GCCTGCTCGC CGCGGCCGAG GCGGGCGCCA CCGCGATCAT CCAGCCCGGC
GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGGCT CGCCATGGTC
TTCACCGGCC AGCGTCACTT CCGGCACTGA
 
Protein sequence
MTNLVPVGRA LLSVSDKSGL LDLARALADL EVELISTGGT AAALRAAGLK VRDVAEVTGF 
PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY
DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATRL SFRQKLALTA
YARTAAYDTA VSTWMAGALK AEAPRRRSFA GTLAQTMRYG ENPHQKAAFY TDGSARPGVA
TAKQWQGKEL SYNNINDTDA AFELVAEFDP AEGPACVIVK HANPCGVARG ATLAEAYARA
FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE
ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV
KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP
FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH