Gene Cagg_2677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2677 
SymbolpurH 
ID7269584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3280415 
End bp3281932 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content57% 
IMG OID643567503 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002463981 
Protein GI219849548 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000481305 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGCAT TAATCAGCGT TTACGATAAG TCTGGGATTG TTGAGTTCGC ACAGGAGTTA 
CATGCACTTG ACGTTGAGAT TATCTCAACC GGTCAAACCC AGCGAGTCTT ACGTGAGGCC
GGCATCCCGG CAGTAGCAGT AAGCGACATC ACCCATTTTC CTGAAATTCT TGACGGTCGA
GTCAAAACAC TGCATCCGGC AATCCATGCC GGTCTCCTGG CTCGCCGCGA TGTACCAACC
CACCTGGCCG AACTTGCTGC TCATGGCCTC AAACCCATCG ATTTGGTCGT TGTGAACCTG
TACCCCTTTG CCGCCACGAT CGGTCGTCCC GGCGTAACGA TGGCCGAGGC CCAAGAGCAG
ATTGATATTG GTGGCGTTGC TCTGCTACGC GCCGCCGCCA AGAACTTCCC GGCTGTACTC
GTGCTGGTTG ACCCGGCTGA CTACGCAGGG GTATTGGCCG GGTTACGCGC CGGCGAGGTG
CCGTTGGCCG AACGGCAGCG GCTAGCAGCA AAGGCCTTTG CCCATACCGC CGAATACGAT
GCAGCGATCG CAGCCTATTT GCGTACCGAT CCCTTCCCTG ATGTGTTACC GATGGCATGG
CGCAAATACC AATCCTTGCG CTACGGCGAA AATCCCCACC AAGCCGCTGC ACTCTACGGC
AATTTCGGTG CGTTCTTCCA ACAGTTGCAC GGCAAAGAGC TGAGCTATAA CAATATTCTG
GATACAACCG CGGCTCAAGA ACTTATCGAA GAGTTTCCTC CCGCTGAGGG AGCGGCAGTG
GCGATTATCA AGCATACGAA TCCCTGCGGT GTAGCCATCG GCCCCGATCT GCGCAGTGCT
TGGGAAGCAG CCTTCGCCAC CGATCGTGAT GCCCCTTTTG GCGGTATCAT TGCCGTGAAC
CGTCCGGTCG ATCTTGCCTT TGCTGAAGCG GTAAATGAAA TCTTCTCCGA AATTATTATC
GCTCCAGAAT TCCAACCTGA TGCGCTCGAA TTACTGCAAC GGAAGAAAAA TCGCCGCTTA
CTGCGCAATC TGCAACCGGT CACCCGCACC GGTGAATGGC AGATTCGCAG TGTACCCGGC
GGAGTACTCG TCCAGGAAGC CGATCATGCG CCGCTAGCAG CCGAAGAATG GCGGGTAGTT
ACCAAACGCG CTCCTACCGA TGCCGAAGTA GCCGCCCTCC GGTTTGGGTG GCGCGTCGTC
AAACATGTGA AATCCAATGC AATCGTTTAT GCAGCGGCTG ACCGCACCCT CGGCATTGGG
GCCGGTCAAA TGAGCCGTGT TGATAGCTCA CGACTCGCAG TTTGGAAAGC CCAACAAGCG
GGGCTTGATC TACGTGGGAG TATCGTGGCA AGTGATGCCC TGTTCCCTTT CGCCGATGGG
GTCGAAGCAG CCATTGCCGC CGGAGCAACA GCAATCATTC AGCCCGGTGG TTCGGTCCGT
GATGAAGAGG TTATCGCCGC CGCAGATGCC GCCGGAGCCG CGATGGTCTT CACCGGCCAC
CGCCATTTCC GCCACTAG
 
Protein sequence
MRALISVYDK SGIVEFAQEL HALDVEIIST GQTQRVLREA GIPAVAVSDI THFPEILDGR 
VKTLHPAIHA GLLARRDVPT HLAELAAHGL KPIDLVVVNL YPFAATIGRP GVTMAEAQEQ
IDIGGVALLR AAAKNFPAVL VLVDPADYAG VLAGLRAGEV PLAERQRLAA KAFAHTAEYD
AAIAAYLRTD PFPDVLPMAW RKYQSLRYGE NPHQAAALYG NFGAFFQQLH GKELSYNNIL
DTTAAQELIE EFPPAEGAAV AIIKHTNPCG VAIGPDLRSA WEAAFATDRD APFGGIIAVN
RPVDLAFAEA VNEIFSEIII APEFQPDALE LLQRKKNRRL LRNLQPVTRT GEWQIRSVPG
GVLVQEADHA PLAAEEWRVV TKRAPTDAEV AALRFGWRVV KHVKSNAIVY AAADRTLGIG
AGQMSRVDSS RLAVWKAQQA GLDLRGSIVA SDALFPFADG VEAAIAAGAT AIIQPGGSVR
DEEVIAAADA AGAAMVFTGH RHFRH