Gene RPD_0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0719 
SymbolpurH 
ID4021192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp808392 
End bp809984 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID637960908 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_567858 
Protein GI91975199 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC ATCCGCGCCG CGTGACCCGC GCTCTGTTGT CCGTTTCCGA TAAGTCCGGG 
CTGATCGACT TTGCCCGCGC GCTGTCCGGC CACGGCGTCG AACTGGTCTC GACCGGCGGC
ACCGCCAAGG CGATCGCGGC GGCGGGGCTT GCGGTCAAGG ACGTCTCCGA GCTGACCGGC
TTTCCCGAGA TGATGGACGG TCGGGTCAAG ACGCTGCATC CGAAGGTGCA TGGCGGCCTG
CTGGCGATTC GCGACAATGC CGAGCACAAG CAGGCGATGG CCGCGCACGG CATCGCGCAG
ATCGATCTGC TGGTGGTCAA TCTCTATCCA TTCGAGGCGA CCGTCGACAA AGGCGCGTCC
TACGAGGATT GCATCGAGAA CATCGATATC GGCGGTCCGG CGATGATCCG CGCGGCGGCG
AAAAATCACG ACGACGTCGC GGTGGTGGTC GAGGCGTCCG ATTATTCTGC GGTGCTCGAC
GAACTCGCCG CCAATGCCGG CGCGACCTCG CTCGATCTGC GCAAGCGCCT CGCCGCCAAG
GCCTATGCCC GCACCGCGGC CTATGACGCG GCGATCTCGA ACTGGTTCGC GCTGCAGCTC
GAGACCGACG CGCCGGATTT CCGCGCGATC GGCGGCCGGC TGATCCAGAG CCTGCGCTAC
GGCGAGAACC CGCACCAGAG CGCCGCGTTC TACGCCACGC CGGAGAAGCG TCCGGGTGTC
GCCACCGCGC GCCAGGTGCA GGGCAAGGAG CTGTCCTACA ACAACATCAA CGACACCGAC
GCCGCCTATG AATGCGTCGG CGAATTCGAC GCCGGGCGCA CCGCCGCCTG CGTCATCGTC
AAGCACGCCA ACCCCTGCGG CGTCGCCGAA GGGGCGAGCC TGTTTGAGGC CTATCGCAAG
GCTCTGGCCT GCGATTCGAC CTCCGCCTTC GGCGGCATCG TCGCGCTCAA CCGCACGCTC
GATGCCGAAG CGGCGCGCGC GATTACCGAG ATCTTCACCG AAGTGATCAT CGCGCCGGAC
GCCAGCGAAG AGGCGATTGC GATCGTCGCT GCGAAGAAGA ATTTGCGGCT GCTGCTGGCG
GGCGCGCTGC CCGATCCGCG CGCCATCGGC CTCACCTACA AGACCGTCGC CGGCGGCCTC
TTGGTGCAGT CGCGCGACAA CGCCGTGGTC GACGACATGG CGCTGAAGGT CGTTACCAAG
CGGCAGCCGA CCGAGGCGGA GCTGCGCGAT CTGAAATTCG CCTTCCGCGT CGCCAAGCAC
GTCAAGTCGA ACACCATCAT CTATGCCAAG GACCTCGCCA CCGTCGGCAT CGGCGCCGGC
CAGATGAGCC GCGTCGATTC CGCCCGCATC GCCGCCCGCA AGGCGCAGGA TGCCGCTACC
GAGTTGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA CGCGTTCTTC
CCGTTCGCCG ACGGGATGCT CGCCTGTATC GAGGCCGGCG CCACCGCGGT GATCCAGCCC
GGCGGCTCGG TCCGCGACGA CGAAGTGATC AAGGCCGCGG ACGATGCCGG CATCGCGATG
GTGTTCACCG GCACCCGGCA TTTCCGGCAC TAG
 
Protein sequence
MTHHPRRVTR ALLSVSDKSG LIDFARALSG HGVELVSTGG TAKAIAAAGL AVKDVSELTG 
FPEMMDGRVK TLHPKVHGGL LAIRDNAEHK QAMAAHGIAQ IDLLVVNLYP FEATVDKGAS
YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EASDYSAVLD ELAANAGATS LDLRKRLAAK
AYARTAAYDA AISNWFALQL ETDAPDFRAI GGRLIQSLRY GENPHQSAAF YATPEKRPGV
ATARQVQGKE LSYNNINDTD AAYECVGEFD AGRTAACVIV KHANPCGVAE GASLFEAYRK
ALACDSTSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ASEEAIAIVA AKKNLRLLLA
GALPDPRAIG LTYKTVAGGL LVQSRDNAVV DDMALKVVTK RQPTEAELRD LKFAFRVAKH
VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAT ELKLAAPMTK GSVVASDAFF
PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH