Gene Acid345_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4470 
SymbolpurH 
ID4070953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5304131 
End bp5305702 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content60% 
IMG OID637986509 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_593544 
Protein GI94971496 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TTCGTCGCGC TATTCTCTCC GTTACTGACA AAATCGGCCT GTCCGACTTC 
GCACGCACGT TGGCGAAGCA TGGCGTTGAA CTGATCTCCA CCGGCGGCAC GGCGAAGATG
CTGCGCGACG CCGGCATCCC CGTCAAAGAC ATCTCCGAAT TGACCGGCTT TCCCGAGATG
CTCGATGGCC GCGTGAAGAC CCTTCACCCC AAGGTGCATG GCGGCATCCT GCACCTGCGC
GCCAACGAAG AGCACGTGGC GACGGTGAAA GAGCACGGCA TCCAGCCAAT CGACATGGTG
GTGGTGAACC TGTACGCGTT CGAAAAGACG GCATCGAAGC CCGGGGCGCA CTTCGAAGAA
ATCATCGAGA ACATCGATAT CGGCGGGCCG AGCATGGTGC GCTCGGCGGG CAAGAATTTC
CAGGACGTGG CGATCGTCAC TTCGCCCGAC CAGTACGCGC AGGTCGCTGA AGAGATGGAC
AAGAGCGGCG GTTCGGTTTC CAAGCAGATG CATTGGAAGC TGGCGCAGCG CGCGTTCGCC
ACGACCGCCG CGTACGACTC AGCAATTGCT TCGGCGCTGG AGCGCGTGAT GGTGGACGAT
GCCGGAAAGT TCGACATATC GAACATCCAC GGCGGCACTG GTTTCCCTGA GATCTTGCGA
CTATTGTTCC GCAAATCCAT GGATCTTCGC TACGGCGAGA ACCCGCACCA GAAGGCGGCG
CTCTACTCCA ACGGCACCGA TCTTGGCGTC GCCAACGGCA AGCAGCTCCA GGGCAAGGAG
CTTTCGTACA ACAATATCGT CGATCTGCAG GCAGCGTGGG ACCTGGCGCA GGAGTTCGAT
GAGCCCGTCT GCGCGATCAT CAAGCACACC AATCCGTGCG GCACGGCAGT CAGTTCGATA
CTTGTCGAAG CGTATAAACG TGCTCTCGAA GCCGATCCGG TTTCGGCGTT CGGGGGTGTG
ATTGGCGTAA ACCGCGAGAT CGACGAAGCA ACAGCGGAAG AAATGGCGAA GCTTTTCCTC
GAAGTGATCG CCGCTCCGAG TTTCAGCGAG GGAGCGAAGG CGCGCTTCGC CGCGAAGAAG
AACTTGCGGC TCGTCGAAGT AAAGGCGCTC GACCAGAAAT ACACGCTGAA GAATGTATCC
GGCGGCGTGC TGGTGCAGGA CAACGACATT CGTCCGCTGA CCGACGCAGA TTTGAAAGTT
GTCAGCGAGC GCAAGCCCAC TGAATCCGAG ATGAAGGACC TGCTCTTCGC GTGGAAGGTC
TGCAAGCATG TGAAATCGAA TGCGATCCTC TACGCGAAAG ACGGCCGCAG CGTGGGCGTG
GGCGCCGGCC AGATGAGCCG CGTGGATTCA GCGCGCATCG GTGCGATGAA AGCCGTGTTG
CCTCTGAAGG GTTGCGTCGC GGCGAGCGAT GCGTTCTTCC CGTTCCCTGA TGGAGTCGAA
GTCATCGCCG AAGCTGGAGC GACGGCGATC ATCCAGCCTG GCGGATCAGT GAAAGACCAG
GAAGTGATTG ACACCGCGAA CCGGTTGGGA CTGGCAATGG TGCTCACGGG TGTGCGGCAC
TTCCGGCACT AA
 
Protein sequence
MAKIRRAILS VTDKIGLSDF ARTLAKHGVE LISTGGTAKM LRDAGIPVKD ISELTGFPEM 
LDGRVKTLHP KVHGGILHLR ANEEHVATVK EHGIQPIDMV VVNLYAFEKT ASKPGAHFEE
IIENIDIGGP SMVRSAGKNF QDVAIVTSPD QYAQVAEEMD KSGGSVSKQM HWKLAQRAFA
TTAAYDSAIA SALERVMVDD AGKFDISNIH GGTGFPEILR LLFRKSMDLR YGENPHQKAA
LYSNGTDLGV ANGKQLQGKE LSYNNIVDLQ AAWDLAQEFD EPVCAIIKHT NPCGTAVSSI
LVEAYKRALE ADPVSAFGGV IGVNREIDEA TAEEMAKLFL EVIAAPSFSE GAKARFAAKK
NLRLVEVKAL DQKYTLKNVS GGVLVQDNDI RPLTDADLKV VSERKPTESE MKDLLFAWKV
CKHVKSNAIL YAKDGRSVGV GAGQMSRVDS ARIGAMKAVL PLKGCVAASD AFFPFPDGVE
VIAEAGATAI IQPGGSVKDQ EVIDTANRLG LAMVLTGVRH FRH