Gene RPB_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0084 
SymbolpurH 
ID3908726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp88475 
End bp90067 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID637881965 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_483707 
Protein GI86747211 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.309239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.165447 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TTCCGCGCCG CGTGACCCGC GCTCTGTTGT CCGTTTCCGA CAAGACCGGG 
CTGGTCGATT TCGCCCGCGC GCTGGCCGGC CACGGCGTCG AACTGGTCTC GACCGGCGGC
ACCGCCAAGG CGATCGCGGC GGCCGGGCTG CCGGTCAAGG ACGTCTCCGA GATCACCGGC
TTTCCCGAGA TGATGGACGG CCGGGTCAAG ACGCTGCATC CCAAGGTGCA TGGCGGCCTG
CTGGCGGTCC GCGACAATGA CGAGCACAAG CAGGCGATGG CGGCGCACGG CATCGCCCAG
ATCGACCTCC TCGTGGTCAA TCTGTATCCG TTCGAGGCCA CCGTCGACAA AGGCGGCTCC
TACGAGGACT GCATCGAGAA CATCGACATC GGCGGCCCGG CGATGATCCG CGCCGCAGCG
AAGAATCACG ACGACGTCGC GGTGATCGTC GAATCGTCCG ACTATCAGGC GGTGCTCGAC
GAACTCGCGG CCAATGCCGG CGCCACCTCG CACGGCTTGC GCAAGCGCCT CGCCGCCAAG
GCCTATGCCC GCACCGCGGC CTACGACGCC GCGATCTCCA ATTGGTTCGC GCAGCAATTG
AAGACCGATG CGCCGGATTT CCGCGCGATC GGCGGCCGGC TGATCCAGAG CCTGCGCTAC
GGCGAGAACC CGCATCAGAC CGCGGCGTTC TACGCCACCC CGGAGAAGCG TCCGGGCGTC
GCCACCGCGC GGCAGGTGCA GGGCAAGGAA CTGTCCTACA ACAACATCAA CGATACCGAC
GCGGCCTATG AATGCGTCGG CGAGTTCGAC GCCAAGCGCA CTGCAGCCTG CGTCATCGTC
AAGCACGCCA ATCCCTGCGG CGTCGCCGAA GGATCGAGCC TGCTCGATGC CTATCGCAAG
GCGCTGGCGT GCGATTCGAC CTCGGCGTTC GGCGGCATCG TCGCGCTCAA CCGCACGCTC
GACGCCGAAG CCGCACGCGC GATCGTCGAG ATCTTCACCG AAATGATCAT CGCGCCCGAG
GCGAGCGAGG AAGCGATCGC GATCGTGGCG GCGAAGAAAA ACTTGCGGCT GCTGCTGGCC
GGCAGCCTGC CCAACCCGCG CGCCGCCGGC CTGACCTACA AGAGCGTGTC CGGAGGGCTG
CTGGTGCAGT CGCGCGACAA TGCGGTGGTC GACGACATGG CGCTCAAGGT CGTCACCAAG
CGGCAGCCGA GCGAGGCCGA ACTGCGCGAC CTGAAATTCG CCTTCCGGGT CGCCAAGCAC
GTCAAGTCCA ACACCATCAT CTACGCCAAG GATCTGGCCA CCGTCGGCAT CGGCGCCGGC
CAGATGAGCC GGGTCGATTC CGCCCGCATT GCCGCGCGAA AAGCGCAGGA TGCCGCCGCC
GAGCTGAAAC TCGCGGCGCC GATGACCAAG GGCTCGGTGG TGGCATCGGA CGCGTTCTTC
CCGTTCGCCG ACGGCATGCT CGCCTGCATC GAAGCCGGCG CCACCGCGGT GATCCAGCCC
GGCGGCTCGG TGCGCGACGA CGAAGTCATC AAGGCCGCGG ACGACGCCGG CATCGCCATG
GTGTTCACCG GGACCAGGCA TTTCCGGCAT TGA
 
Protein sequence
MTDLPRRVTR ALLSVSDKTG LVDFARALAG HGVELVSTGG TAKAIAAAGL PVKDVSEITG 
FPEMMDGRVK TLHPKVHGGL LAVRDNDEHK QAMAAHGIAQ IDLLVVNLYP FEATVDKGGS
YEDCIENIDI GGPAMIRAAA KNHDDVAVIV ESSDYQAVLD ELAANAGATS HGLRKRLAAK
AYARTAAYDA AISNWFAQQL KTDAPDFRAI GGRLIQSLRY GENPHQTAAF YATPEKRPGV
ATARQVQGKE LSYNNINDTD AAYECVGEFD AKRTAACVIV KHANPCGVAE GSSLLDAYRK
ALACDSTSAF GGIVALNRTL DAEAARAIVE IFTEMIIAPE ASEEAIAIVA AKKNLRLLLA
GSLPNPRAAG LTYKSVSGGL LVQSRDNAVV DDMALKVVTK RQPSEAELRD LKFAFRVAKH
VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAA ELKLAAPMTK GSVVASDAFF
PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH