Gene Cag_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0258 
SymbolpurH 
ID3748179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp288300 
End bp289874 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content47% 
IMG OID637772784 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_378577 
Protein GI78188239 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CTGTTATTAA ACGGGCGCTG GTGTCGGTGT CCGATAAAAC TGGTATTGTT 
GATTTTTGCC GTGAGCTTTC AGAGCTTGGC GTTGAAATTT TTTCAACCGG TGGTACCTTA
AAAACCCTTC AGGATGCAGG CATTGCGGCT GCATCAATCT CCACCATTAC AGGTTTTCCT
GAAATTATGG ATGGACGTGT TAAAACACTG CATCCCAAAA TTCACGGTGG ATTGCTTGCT
GTACGCGAAA ACGCCAACCA CGTCAAGCAG GCAGCAGATA ATGGCATTAG CTTTATTGAT
TTGGTTGTGG TGAACCTTTA TCCTTTTGAG GCAACGGTTG CAAAACCCAA CGTTTCATTT
GAAGATGCTA TTGAAAATAT TGACATCGGT GGACCTTCAA TGTTGCGCAG TGCAGCAAAA
AATAATGAAT CCGTAACGGT GGTAACCGAT AGCGCCGATT ATGCGTTAGT GTTGCAAGAA
ATGCGTGCCA ACAATGGAGC AACAACTCGT GCTACCCGCT TGCATTTAGC ACTAAAAGTG
TTTGAACTAA CCTCCCGCTA CGATCGCGCT ATTGCAACCT ACCTTGCAGG CAAAGTATCA
GCAGCCGAAG CCGCCGCCAG CACCATGAGC GTTCAACTTG CTAAAGAGCT TGATATGCGC
TATGGCGAAA ATCCTCACCA AAACGCAGGC TTGTATCGGC TAACCGATAG CAACGGTACC
CGCTCTTTTG AAGAGTTTTT CGAAAAACTG CACGGCAAAG AGCTTTCCTA CAACAACATG
CTCGATATTG CCGCTGCAAC CTCGCTTATT GAAGAATTCC GTGGCGAAGA GCCAACGGTG
GTGATTATTA AGCACACCAA TCCTTGCGGT GTTGCACAAG CCTCAAGCTT AGTTGATGCG
TGGCACCGTG CATTCTCAAC TGATACGCAA GCACCATTTG GCGGCATTGT AGCTTTTAAC
CGTCCGCTTG ATATGGCTGC GGCTCAAGCT GTTAATGAAA TTTTCACCGA AATTTTGATT
GCTCCCGCCT TTGAGGATGG AGTTCTTGAG TTGTTGATGA AAAAGAAGGA TCGTCGTCTT
GTGGTGCAGA AAAAAGCGTT ACCACAAAGT GGATGGGAAT TTAAATCCAC GCCATTTGGC
ATGTTGGTGC AAGAGCGCGA CAGCAAAATT GTTGCCAAAG AGGATCTCAA AGTAGTAACC
AAACGCCAGC CAACTGAGGC TGAAATTGCC GACTTGATGT TTGCATGGAA AATTTGCAAA
CACATTAAAT CGAACACCAT TCTGTACGTT AAAAACCGCC AAACCTATGG CGTTGGTGCA
GGGCAGATGT CGCGCGTGGA TTCCTCAAAA ATTGCACGTT GGAAGGCTTC CGAAGTAAAC
CTTGATCTGC ACGGCTCCGT TGTAGCATCC GACGCATTCT TCCCCTTTGC TGACGGCTTA
CTTGCAGCGG CTGAAGCGGG CGTAACGGCA GTTATTCAAC CCGGTGGCTC CATTCGCGAT
AACGAGGTAA TTGAAGCGGC TGATGCCAAC AACCTCGCTA TGGTCTTTAC CGGCATGAGG
CATTTCAAGC ACTAA
 
Protein sequence
MSDPVIKRAL VSVSDKTGIV DFCRELSELG VEIFSTGGTL KTLQDAGIAA ASISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRENANHVKQ AADNGISFID LVVVNLYPFE ATVAKPNVSF
EDAIENIDIG GPSMLRSAAK NNESVTVVTD SADYALVLQE MRANNGATTR ATRLHLALKV
FELTSRYDRA IATYLAGKVS AAEAAASTMS VQLAKELDMR YGENPHQNAG LYRLTDSNGT
RSFEEFFEKL HGKELSYNNM LDIAAATSLI EEFRGEEPTV VIIKHTNPCG VAQASSLVDA
WHRAFSTDTQ APFGGIVAFN RPLDMAAAQA VNEIFTEILI APAFEDGVLE LLMKKKDRRL
VVQKKALPQS GWEFKSTPFG MLVQERDSKI VAKEDLKVVT KRQPTEAEIA DLMFAWKICK
HIKSNTILYV KNRQTYGVGA GQMSRVDSSK IARWKASEVN LDLHGSVVAS DAFFPFADGL
LAAAEAGVTA VIQPGGSIRD NEVIEAADAN NLAMVFTGMR HFKH