Gene Smed_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3189 
SymbolpurH 
ID5324068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3358668 
End bp3360278 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content64% 
IMG OID640792137 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001328848 
Protein GI150398381 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCG CCTCCAAGAA AATTCCCGCC CCGGACGAAG TCCGGATCAA AACCGCCCTC 
CTTTCGGTCT CCGACAAATC GGGCATCGTC GAACTCGCCC GCCACCTCAA TGACAGGGGC
GTGCGGCTGG TATCGACCGG CGGGACGCAC AAGGCGCTTG CCGATGCGGG CCTTCCCGTC
AGCGACGTTT CGGAGCTGAC GGGGTTTCCG GAGATCATGG ACGGTCGCGT GAAAACCCTT
CATCCCGGCG TTCACGGCGG TCTGCTGGCT ATTCGCGACG ATGCGGAGCA TGCCGGCGCG
ATGAGCGCAC ACGGTATTAC AGCGATCGAT CTCGCCGTCA TCAATCTCTA CCCCTTCGAG
GAGGTTCGCG CCAAGGGCGG CGACTATCCG ACCACGGTCG AGAATATCGA CATTGGCGGT
CCCGCAATGA TCCGGGCGTC GGCCAAGAAC CACGCCTATG TGACCGTTGT GACCGACCCG
GCGGATTATC CGCTGCTGCT GGAGGAGATC GCGGGCGGCA CGACGCGCTA TGCCTTCCGC
CAGAAAATGG CCGCCAAGGC CTATGCCCGC ACAGCGGCCT ATGATGCGGC AATCTCCAAC
TGGTTCGCCG AGGTACTCGA CACGCCCATG CCGCGCCACC GCGTCATCGG CGGGGTGCTC
AAGGAAGAGA TGCGCTACGG CGAAAACCCG CATCAGAAGG CCGGTTTCTA TGTGACCGGT
GACAAGCGGC CGGGGGTCGC CACTGCAGCG CTTCTCCAAG GCAAGCAGCT CTCCTACAAC
AACATCAACG ACACCGACGC CGCCTTCGAG CTGGTGGCGG AATTCCTGCC GGAAAAGGCG
CCTGCCTGCG CCATTATCAA GCATGCCAAC CCCTGCGGCG TTGCGACCGC GCCATCGCTC
GCGGAAGCAT ATCGCCGGGC GCTTGCCTGT GATTCGACCT CCGCTTTCGG CGGCATTATC
GCGCTTAATC AGGAACTCGA CGCGGCGACC GCCGAAGAGA TCGTGAAGCT CTTCACCGAA
GTGATCATCG CCCCGTCGGT CAGCGACGAG GCGAAAGCGA TCATCGCCCG GAAGCCCAAT
CTGCGGCTGC TTGCGACCGG CGGCCTGCCG GATCCGCGCA CGCCCGGTCT GACGGCAAAG
ACGGTGGCCG GGGGCCTTCT TGTCCAGACG CGCGACGACG GCATGATCGA AGACATCGAA
CTGAAGGTGG TCACGAAGCG CACGCCGACG GCGCAGGAGC TCGAAGACAT GAAATTTGCC
TTCAAGGTGG CCAAGCACGT CAAGTCGAAT GCCGTCGTCT ACGCGAAAGG CGGTCAGACG
GCGGGTATCG GCGCCGGACA GATGAGCCGG GTCGATTCCG CGAGAATTGC TGCCATCAAG
GCGGAAGAGG CGGCGAAGGC GCTCGGTCTC GCCGAGCCTC TGACACGCGG CTCCGCGGTT
GCCTCGGAAG CCTTCCTGCC GTTCGCTGAC GGCCTTCTGT CCGCGATCGC TGCGGGGGCC
ACCGCAGTGA TCCAGCCGGG CGGCTCCATG CGCGACGAGG AGGTGATCGC AGCGGCCGAC
GAGCACAATG TCGCGATGGT CTTCACCGGG ATGCGGCATT TCCGGCACTG A
 
Protein sequence
MAVASKKIPA PDEVRIKTAL LSVSDKSGIV ELARHLNDRG VRLVSTGGTH KALADAGLPV 
SDVSELTGFP EIMDGRVKTL HPGVHGGLLA IRDDAEHAGA MSAHGITAID LAVINLYPFE
EVRAKGGDYP TTVENIDIGG PAMIRASAKN HAYVTVVTDP ADYPLLLEEI AGGTTRYAFR
QKMAAKAYAR TAAYDAAISN WFAEVLDTPM PRHRVIGGVL KEEMRYGENP HQKAGFYVTG
DKRPGVATAA LLQGKQLSYN NINDTDAAFE LVAEFLPEKA PACAIIKHAN PCGVATAPSL
AEAYRRALAC DSTSAFGGII ALNQELDAAT AEEIVKLFTE VIIAPSVSDE AKAIIARKPN
LRLLATGGLP DPRTPGLTAK TVAGGLLVQT RDDGMIEDIE LKVVTKRTPT AQELEDMKFA
FKVAKHVKSN AVVYAKGGQT AGIGAGQMSR VDSARIAAIK AEEAAKALGL AEPLTRGSAV
ASEAFLPFAD GLLSAIAAGA TAVIQPGGSM RDEEVIAAAD EHNVAMVFTG MRHFRH