Gene GSU0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0609 
SymbolpurH 
ID2685298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp644573 
End bp646138 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content65% 
IMG OID637125276 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionNP_951667 
Protein GI39995716 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.444523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TAACACGTGC GCTCATCAGC GTCTCGGACA AGACCGGCAT CCTTGATTTC 
GCCCGGGAAC TGGCCGGCTA CGGCGTGGAG ATCCTCTCCA CCGGCGGTAC CGCAAAGCTT
CTCCGCGACG CGGGACTCGC GGTCAAGGAC GTCTCCGATT TTACCGGCTT TCCGGAGATG
CTCGACGGCC GGGTCAAGAC GCTTCACCCC AAGGTCCACG GGGGCCTTCT GGGAATGCGC
TCCAACCCAG ACCATGTGGC AACCATGAAG GCGCACGGCA TCGAGCCCAT CGACTTGGTG
GTGGTGAACC TCTACCCCTT CGAGGCCACC GTGGCCAAGC CCGAATGCAC CCTGGAAGAT
GCCATCGAGA ACATCGATAT CGGCGGTCCC ACCATGCTCC GCTCCGCGGC CAAGAACAAC
GCCGACGTGA CCGTGCTCGT GGATCCGGCG GACTATCGGC CGGTTCTCGA TGAAATGAAG
GCATCCGGCG GCGCCGTGTC CCGGGAGACC AACTTTTGCC TGGCGGTGAA AGTCTACCAG
CACACTGCAG CCTATGACGG CGCCATTTCC AACTGGCTCG GCGCCCGGAC CGGCGAAGGG
ATCGCCGCCT ATCCCGACAC CGTCACGCTT CAGTTCAGAA AGGCCCAGGA GATGCGCTAC
GGCGAGAACC CCCACCAGGG TGCCGCCTTC TATGTGGAGC GCCAGGTGAA GGAAGCGTCC
GTCGCCACGG CCCGCCAGCT CCAGGGCAAG GAGCTTTCCT ACAACAACAT CGCCGACACC
GACGCGGCCC TGGAGTGCGT GAAGCAGTTT GCCGAAGGCC CCGCCTGCGT CATCGTAAAG
CATGCCAACC CCTGCGGCGT GGCCGTGGGC GGGACGTTGC TGGAGGCCTA CGACCGGGCC
TATGCCACCG ATCCCGAGTC GGCCTTCGGG GGCATCATCG CCTTCAACCG GGAACTGGAC
GCCGACACGG CGCGGGCAAT CTGCGACCGC CAGTTCGTGG AGGTCATCAT CGCTCCCGCC
GTATCGCCGG AGGCCACGGA AGTTGTTGCC GCCAAGAAGA ACGTGCGTCT CCTGGAGTGC
GGCACCTGGC CGGAGAAGCA ACAGCCGCGC CTCGACCTGA AGCGGGTGAA CGGCGGCATC
CTGGTGCAGG ACACCGATCT CGACCTGTAC GCCGAACTGA AGGTCGTGAC CAAGCGGCAG
CCCACGGAGC AGGAGATGAA GGACCTGCTC TTTGCCTGGC GCGTGGCCAA GTTCGTCAAG
TCCAACGCCA TTGTCTACGG CAAGGGCAAT ATGACCATCG GCGTGGGGGC CGGCCAGATG
AGCCGGGTCA ACTCGGCCCG CATCGCCGCC ATCAAGGCCG AGCACGCGGG GCTTGAGGTG
AAGGGGGCCG TTATGGCGTC CGATGCCTTC TTCCCCTTCC GCGACGGCAT CGACAACGCG
GCTGCCGTGG GCATCACCGC GGTTATCCAG CCGGGCGGCA GCATGCGCGA CGCTGAGGTG
ATCGCCGCCG CCGACGAGCA CGGCATGGCG ATGGTATTCA CCGGCATGAG GCATTTCAGA
CACTGA
 
Protein sequence
MAKITRALIS VSDKTGILDF ARELAGYGVE ILSTGGTAKL LRDAGLAVKD VSDFTGFPEM 
LDGRVKTLHP KVHGGLLGMR SNPDHVATMK AHGIEPIDLV VVNLYPFEAT VAKPECTLED
AIENIDIGGP TMLRSAAKNN ADVTVLVDPA DYRPVLDEMK ASGGAVSRET NFCLAVKVYQ
HTAAYDGAIS NWLGARTGEG IAAYPDTVTL QFRKAQEMRY GENPHQGAAF YVERQVKEAS
VATARQLQGK ELSYNNIADT DAALECVKQF AEGPACVIVK HANPCGVAVG GTLLEAYDRA
YATDPESAFG GIIAFNRELD ADTARAICDR QFVEVIIAPA VSPEATEVVA AKKNVRLLEC
GTWPEKQQPR LDLKRVNGGI LVQDTDLDLY AELKVVTKRQ PTEQEMKDLL FAWRVAKFVK
SNAIVYGKGN MTIGVGAGQM SRVNSARIAA IKAEHAGLEV KGAVMASDAF FPFRDGIDNA
AAVGITAVIQ PGGSMRDAEV IAAADEHGMA MVFTGMRHFR H