Gene Daud_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1629 
SymbolpurH 
ID6026187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1717671 
End bp1719212 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content65% 
IMG OID641594452 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001717763 
Protein GI169831781 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGC AGCGGGCACT GATCAGTGTT TCCGACAAGC GGGGGTTGCT GGAGCTGGCT 
CAGGGCCTGA CCGAACTGGG GATGGAGATC GTGTCCACCG GGGGCACGGC CCGGGTACTC
CGGGAGATGG GCTTCGGGGT GCTTGGAGTG TCCGAGGTCA CCGAGTTTCC CGAGATCCTC
GGAGGCCGGG TAAAGACGCT GCACCCCCGC ATCCACGGGG GAATTCTAGC CCGGCGCACG
CCCGAGCACA TGGGGCAACT GGCCGAGTTT GGGATCCGCC CGGTGGACCT GGTGGTGGTC
AACCTCTATC CGTTTAAGGA GACCATCGCC CGGCAGGGCG TCACCCTGGA GGAGGCCGTT
GAACAGATCG ACGTCGGCGG GCCGGCGATG CTTCGGGCAG CGGCCAAGAA CCACCGTCAC
GTGCTGGTGG TCGTGAACCC GGACCGGTAC CCGGAAGTGC TGGCCGCCCT CAAGGCGGGA
ACGGTCGACG ACCGGATGCG CCTGACCTTG GCCCGGGAGG CCTTTGCGCA CACCGCCCAC
TACGACGCCG TGATCGCCGC TTACCTGGGC GAGTTCGTGG AGGAACAGGA CCTCTTCCCG
GGGGAAATCG CGCTGCCGTT TGAGAGAAAG CAGCTCTTGC GCTACGGCGA GAACCCCCAC
CAGAAGGCGG CCTTTTACCA GGACCCGCGC CGGCGGGGAG CTTCGGTGAC TTCCGCCGTG
CAGCGGCAGG GCAAGGAGCT TTCGTACAAC AACATCCTCG ACCTGAATGC CGCCCTGGAA
CTGGTCCGGG AATTCAGTAC GCCGGCGGCG GTGATCGTCA AGCACAACAA CCCGTGTGGA
ACGGCCTGCC GCCCGTCTCC GGCCGAGGCG TACCGCCGGG CCTTTGCGGC CGACGAGGTT
TCCGCCTTCG GCGGAATTGT CGCTTTTAAC TGCCCGGTGG ACGAAGAGGC GGCGCATGAG
ATGGTCAAGA TTTTCCTGGA GGCGGTCATC GCCCCGCAGT TCACGCCCGA GGCGCTGGCG
GTATTGAGTG ACAAGAAGAA TCTGCGGGTG CTCGAAACTG GAGACCTGAC CCCGCTCACC
CTGGACTGGA TGGACGTCCG GAAAGTGAAC GGGGGCCTTC TGGTGCAGCA GGCTGACCGC
CAGCTCTTTC CCTACACCAA CTTCCGGGTG GTGACCCGGC GCGCACCCAC ACCTGAAGAA
CTTGTCGAGA TGGATTTCGC TTTCAAGATC GTCAAGCACG TCAAGTCCAA CGCCATCGTG
GTGACCCGTG AGCAGACGCT CATCGGCGTG GGGGCCGGGC AGATGAACCG GGTCGGAGCG
GCGCGGATCG CCCTGGAACA GGCCGGGGAC AAGGCTCTGG GCGCCGTGCT GGCATCCGAC
GCCTTTTTCC CGTTTGCGGA CACCGTGGTC GCGGCGGCCG AGGCGGGCAT TACAGCCATC
GTCCAGCCAG GAGGCTCGAT GCGGGACCAG GAGTCGATCG AAGCTGCGGA CGCCCGGGGG
ATCGCGATGG TGTTCACCGG CGTCCGCCAC TTCAAGCACT AA
 
Protein sequence
MAMQRALISV SDKRGLLELA QGLTELGMEI VSTGGTARVL REMGFGVLGV SEVTEFPEIL 
GGRVKTLHPR IHGGILARRT PEHMGQLAEF GIRPVDLVVV NLYPFKETIA RQGVTLEEAV
EQIDVGGPAM LRAAAKNHRH VLVVVNPDRY PEVLAALKAG TVDDRMRLTL AREAFAHTAH
YDAVIAAYLG EFVEEQDLFP GEIALPFERK QLLRYGENPH QKAAFYQDPR RRGASVTSAV
QRQGKELSYN NILDLNAALE LVREFSTPAA VIVKHNNPCG TACRPSPAEA YRRAFAADEV
SAFGGIVAFN CPVDEEAAHE MVKIFLEAVI APQFTPEALA VLSDKKNLRV LETGDLTPLT
LDWMDVRKVN GGLLVQQADR QLFPYTNFRV VTRRAPTPEE LVEMDFAFKI VKHVKSNAIV
VTREQTLIGV GAGQMNRVGA ARIALEQAGD KALGAVLASD AFFPFADTVV AAAEAGITAI
VQPGGSMRDQ ESIEAADARG IAMVFTGVRH FKH