Gene Rleg2_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3915 
SymbolpurH 
ID6982679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4061273 
End bp4062889 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID643398638 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002283403 
Protein GI209551486 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.230829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTA TTTCCAAGAA GATCCCCGCC CCCGACAAGG TCGAAATCAA GACCGCCCTC 
CTCTCCGTCT TCGACAAGAC CGGGATCGTC GAACTCGCCC AGGCACTGTC GGAACAGGGC
GTGCGGCTGC TGTCGACCGG CGGCACCTAC AAGGCGATCG CCGCCGCCGG CCTTGCCGTT
ACCGATGTTT CCGAAATTAC CGGCTTTCCC GAGATCATGG ACGGGCGGGT CAAGACGCTG
CATCCGACGG TGCATGGCGG CCTGCTGGCG ATCCGCGACG ATAGCGAACA CCAGGAGGCG
ATGAAAACCC ACGGCATCGA GGCCATCGAC CTCGCCGTCA TCAACCTCTA TCCCTTCGAA
GACGTGCGCG CCGCCGGCGG CGATTATCCG ACGACCGTCG AAAATATCGA CATCGGCGGC
CCGGCGATGA TCCGCGCTTC GGCCAAGAAC CATGCCTATG TGACGATCCT GACCGATCCG
AACGACTATG CCGAATTCAC CGAGCAGCTT TCCGCGGATG GTGGCAAGAC CGCCTACGCC
TTCCGGCAGC GCATGGCCGC CAAGGCCTAT GCCCGCACCG CGGCCTATGA CGCTGTGATT
TCCAACTGGT TCGCAGAAGC GCTGTCGATC GACACGCCGC GCCACCGCGT TATCGGCGGT
GCGCTGAAGG AAGAGATGCG TTACGGCGAA AATCCGCACC AGAAGGCCGC CTTTTACGTC
ACCGGCGAGA AGCGCCCGGG CGTTTCGACG GCTGCCCTCC TCCAGGGCAA GCAGCTCTCC
TACAACAATA TCAACGATAC CGATGCCGCT TACGAGCTGG TCGCCGAGTT CCTGCCGGAA
AAGGAGCCGG CCTGCGCCAT CATCAAACAT GCCAATCCCT GTGGCGTCGC CACCGGGTCG
AGCCTGGTCG AGGCCTATCG GCGGGCGTTG GCCTGCGACA GCGTTTCCGC CTTCGGCGGC
ATCATTGCAC TCAATCGGAC GCTGGATGCC GAAACGGCTG AGGAGATCGT CAAGCTCTTC
ACCGAAGTGA TCATCGCGCC CGATGTGACC GAGGAGGCGA AGGCGATCAT CGCCCGCAAG
CCGAACCTGC GGCTGCTGTC GGCCGGCGGC CTGCCCGATC CGCGTGCCGC GGGCCTGACG
GCGAAGACCG TTTCCGGCGG CCTGCTGGTC CAGAGCCGCG ACAACGGCAT GGTCGAGGAT
CTGGAGCTCA AGGTCGTCAC CAAGCGCGCG CCGACGGCTC AGGAGCTTGA TGATATGAAG
TTCGCCTTCA AGATCGGCAA ACACGTGAAA TCGAACGCCG TGGTCTATGC CAAGGACGGC
CAGACCGCCG GCATCGGCGC CGGCCAGATG AGCCGGGTCG ATTCTGCCCG TATCGCCGCG
CTGAAGGCGG AAGAAGCCGC CAAGGCGCTC GGCCTTGCCG TGCCGATGAC GCATGGCTCG
GCAGTCGCCT CCGAAGCCTT CCTGCCGTTT GCCGACGGTC TCTTGTCGAT GATCGCAGCG
GGGGCGACGG CGGTGATCCA GCCTGGCGGT TCGATGCGCG ACCAGGAAGT CATCGATGCC
GCCGACGAAC ACGGCATTGC GATGGTCTTT ACCGGCATGC GCCATTTCCG GCACTGA
 
Protein sequence
MAVISKKIPA PDKVEIKTAL LSVFDKTGIV ELAQALSEQG VRLLSTGGTY KAIAAAGLAV 
TDVSEITGFP EIMDGRVKTL HPTVHGGLLA IRDDSEHQEA MKTHGIEAID LAVINLYPFE
DVRAAGGDYP TTVENIDIGG PAMIRASAKN HAYVTILTDP NDYAEFTEQL SADGGKTAYA
FRQRMAAKAY ARTAAYDAVI SNWFAEALSI DTPRHRVIGG ALKEEMRYGE NPHQKAAFYV
TGEKRPGVST AALLQGKQLS YNNINDTDAA YELVAEFLPE KEPACAIIKH ANPCGVATGS
SLVEAYRRAL ACDSVSAFGG IIALNRTLDA ETAEEIVKLF TEVIIAPDVT EEAKAIIARK
PNLRLLSAGG LPDPRAAGLT AKTVSGGLLV QSRDNGMVED LELKVVTKRA PTAQELDDMK
FAFKIGKHVK SNAVVYAKDG QTAGIGAGQM SRVDSARIAA LKAEEAAKAL GLAVPMTHGS
AVASEAFLPF ADGLLSMIAA GATAVIQPGG SMRDQEVIDA ADEHGIAMVF TGMRHFRH