Gene Rleg_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4239 
SymbolpurH 
ID8015022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4339072 
End bp4340688 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content63% 
IMG OID644826809 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002978018 
Protein GI241206922 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.156466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA TTTCCAAGAA GATCCCCGCC CCCGACAAGG TCGAAATCAA GACCGCGCTC 
ATCTCCGTCT TCGACAAGAC CGGGATCGTC GACCTCGCCC ACGCCTTGTC TGCCAGAGGT
GTGCGCCTGC TTTCGACCGG CGGCACCTAT AAAGCGATCA CTGCTGCCGG TCTTGCCGTC
ACCGATGTTT CCGAAGTCAC CGGTTTTCCG GAGATCATGG ATGGGCGTGT GAAGACGCTG
CATCCGACGG TGCATGGCGG CCTGCTGGCG ATCCGTGACG ACAGCGAACA CCAGGAAGCG
ATGAAAACGC ATGGCATCGA GGGCATCGAC CTCGCAGTCA TCAACCTCTA TCCCTTCGAG
CAGGTGCGCG CAGCCGGCGG CGATTATCCG ACGACGGTCG AGAATATCGA CATTGGCGGC
CCGGCGATGA TCCGCGCATC GGCCAAGAAC CATGCCTATG TGACAACCTT GACCGATCCG
GCCGATTATG CCGAGCTGCT GGAGCAGCTT TCCGCAGATG ACGGCAAGAC CGCCTATGCC
TTCCGCCAGC GTATGGCTGC CAAAGCCTAT GCCCGCACCG CCGCCTATGA TGCAATGATC
TCCAATTGGT TTGCTGAGGC GCTGTCGATC GACACGCCGC GCCACCGGGT CATCGGCGGC
GCGCTGAAGG AAGAGATGCG CTACGGCGAA AACCCGCACC AGAAGGCCGC CTTCTACGTA
ACCGGCGAGA AGCGTCCGGG TGTTTCGACG GCCGCTCTTC TCCAGGGCAA GCAGCTCTCC
TACAACAATA TCAACGATAC GGATGCGGCC TACGAGCTGG TCGCCGAGTT CCTGCCTGAG
AGGGCGCCGG CCTGCGCGAT CATCAAGCAT GCCAATCCCT GCGGCGTCGC CACCGGATCG
AGCCTGGTCG AGGCCTATCG GCGGGCGCTC GCCTGCGATT CCGTTTCCGC CTTCGGCGGC
ATCATCGCGC TGAACCAAAC GCTGGATGCC GAAACGGCCG AAGAGATCGT CAAGCTGTTC
ACCGAAGTGA TCATCGCGCC GGATGTCACG GAGGAGGCGA AGGCGATCGT CGCCCGCAAA
CCGAACCTGC GACTATTGTC TGCCGGTGGC CTGCCCGATC CGCGTGCCGC GGGCCTGACG
GCAAAGACCG TTTCCGGGGG CCTGCTCGTC CAGAGCCGCG ACAACGGCAT GGTCGAGGAT
CTGGAACTCA AGGTCGTCAC CAGGCGTGCG CCGACGGCGC AGGAACTTGA TGACATGAAG
TTCGCCTTCA AGGTCGGCAA ACATGTGAAG TCGAACGCCG TGGTCTATGC CAAGGACGGC
CAGACCGCTG GCATCGGCGC CGGCCAGATG AGCCGGGTCG ATTCCGCCCG CATTGCCGCG
CTGAAGGCCG AAGAGGCTGC CAAGGCGCTC GGCCTCGCAG TGCCGATGAC GCATGGCTCG
GCGGTCGCCT CCGAAGCCTT CCTGCCTTTT GCCGACGGTC TTCTGTCGAT GATCGCCGCG
GGGGCGACGG CGGTTATCCA GCCGGGCGGT TCGATGCGCG ACCAGGAGGT CATCGATGCC
GCTAACGAAC ACGGCGTCGC AATGGTCTTT ACCGGCATGC GCCATTTCCG GCACTGA
 
Protein sequence
MAVISKKIPA PDKVEIKTAL ISVFDKTGIV DLAHALSARG VRLLSTGGTY KAITAAGLAV 
TDVSEVTGFP EIMDGRVKTL HPTVHGGLLA IRDDSEHQEA MKTHGIEGID LAVINLYPFE
QVRAAGGDYP TTVENIDIGG PAMIRASAKN HAYVTTLTDP ADYAELLEQL SADDGKTAYA
FRQRMAAKAY ARTAAYDAMI SNWFAEALSI DTPRHRVIGG ALKEEMRYGE NPHQKAAFYV
TGEKRPGVST AALLQGKQLS YNNINDTDAA YELVAEFLPE RAPACAIIKH ANPCGVATGS
SLVEAYRRAL ACDSVSAFGG IIALNQTLDA ETAEEIVKLF TEVIIAPDVT EEAKAIVARK
PNLRLLSAGG LPDPRAAGLT AKTVSGGLLV QSRDNGMVED LELKVVTRRA PTAQELDDMK
FAFKVGKHVK SNAVVYAKDG QTAGIGAGQM SRVDSARIAA LKAEEAAKAL GLAVPMTHGS
AVASEAFLPF ADGLLSMIAA GATAVIQPGG SMRDQEVIDA ANEHGVAMVF TGMRHFRH