Gene Gura_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3847 
SymbolpurH 
ID5166435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4494327 
End bp4495892 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content60% 
IMG OID640551329 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001232570 
Protein GI148265864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0964081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TTACCCGTGC GCTGATCAGC GTATCTGACA AGACCGGCAT CGTCGAGTTC 
TCCAGGGAAC TGGCCGGCTA TGGCGTTGAA ATTCTCTCTA CCGGCGGTAC TGCCAAGCTG
CTGCGGGAGG CGGGTCTAAA GGTCAAGGAC GTTTCGGAGT TTACCGGTTT TCCGGAAATG
CTAGACGGAC GCGTCAAGAC CCTCCATCCA AAGGTTCACG GCGGCCTCCT CGGCATGCGG
GGGAATCCTG AACACGTGGC AACCATGAAG GCGCACGGCA TCGAGCCGAT CGACATGGTG
GTCGTCAACC TTTACCCCTT CGAGGCGACC GTTGCCAAGC CGGACTGCAC GCTTGAGGAC
GCCATCGAGA ACATCGATAT CGGCGGTCCG ACCATGCTCC GTTCGGCAGC CAAGAACAAC
GCAGATGTGA CTGTTGTCGT CGATCACAAC GACTATCGGC TGGTGCTGGA TGAAATGAAG
GCTGCGGGAG GCGGCGTGTC GAAGGAGACC AACTTCCGCC TGGCGGTAAA GGTCTACCAG
CACACCGCAG CCTATGACGG TGCGATCTCC AACTGGTTGG GAGCCCGGAC CGGCGAAGGG
GTTGCCGCAT ACCCCGACAC CCTGACTCTC CAGTTCAAAA AGGCCCAGGG GATGCGTTAC
GGCGAGAATC CCCACCAGTC GGCAGCTTTC TATGTGGAGC GGGACGTCAA GGAGGCGTCG
GTCGCTACTG CACGGCAGCT CCAGGGGAAA GAGCTTTCCT ACAACAACAT CGGCGACACC
GATGCCGCCC TGGAGTGCGT GAAACAGTTC GCTGAAGGGC CGGGCTGCGT CATCGTCAAG
CATGCAAACC CATGCGGGGT CGCGATCGGC GATACGCTTC TGGACGCCTA CGATCGTGCC
TACAAGACCG ATCCCGAGTC CGCTTTCGGC GGAATCATTG CCTTCAACGG CGAACTGGAC
GAGGCGACGG CCAAAGCCAT TGTCGAGCGG CAGTTCGTTG AGGTTATCAT CGCCCCCAAG
GTTTCCGCAA AGGCGAGCGA GGTGGTTGCT GCCAAGAAGA ACGTGCGTCT TCTGGAGTGC
GGCACGTGGC AGAAGGAACC GATGCCGCGA CTGGATTTCA AGCGTGTCAA CGGAGGTCTC
CTGGTCCAGT ATACGGATCT CGCCCTCCAC GGGGAGCTGA AGGTCGTGAC CAAGCGGGCT
CCGACGGAAA AGGAGATGAT CGACCTTCTT TTCACCTGGC GGGTAGCCAA GTTCGTCAAA
TCCAACGCCA TTGTCTACGG CAAGGACGGC ATGACCATCG GCGTCGGCGC GGGGCAGATG
AGCCGGGTCA ACTCGGCCCG CATTGCGGCC ATCAAGGCGG AACATGCCGG CCTTCCTGTA
GCCGGTTCGG TGATGGCTTC CGACGCCTTC TTCCCGTTCC GCGACGGGTT GGATAATGCA
GCTGCGGTTG GCATCACCGC CGTCATCCAG CCGGGTGGGA GCATGCGCGA CGAGGAAGTG
ATCGCAGCAG CCGATGAACA CGGCATGGCG ATGGTGTTTA CTTCCATGAG GCATTTCAGG
CATTGA
 
Protein sequence
MAKITRALIS VSDKTGIVEF SRELAGYGVE ILSTGGTAKL LREAGLKVKD VSEFTGFPEM 
LDGRVKTLHP KVHGGLLGMR GNPEHVATMK AHGIEPIDMV VVNLYPFEAT VAKPDCTLED
AIENIDIGGP TMLRSAAKNN ADVTVVVDHN DYRLVLDEMK AAGGGVSKET NFRLAVKVYQ
HTAAYDGAIS NWLGARTGEG VAAYPDTLTL QFKKAQGMRY GENPHQSAAF YVERDVKEAS
VATARQLQGK ELSYNNIGDT DAALECVKQF AEGPGCVIVK HANPCGVAIG DTLLDAYDRA
YKTDPESAFG GIIAFNGELD EATAKAIVER QFVEVIIAPK VSAKASEVVA AKKNVRLLEC
GTWQKEPMPR LDFKRVNGGL LVQYTDLALH GELKVVTKRA PTEKEMIDLL FTWRVAKFVK
SNAIVYGKDG MTIGVGAGQM SRVNSARIAA IKAEHAGLPV AGSVMASDAF FPFRDGLDNA
AAVGITAVIQ PGGSMRDEEV IAAADEHGMA MVFTSMRHFR H