Gene Cphamn1_1877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1877 
SymbolpurH 
ID6375568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2037180 
End bp2038757 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content53% 
IMG OID642684373 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001960275 
Protein GI189500805 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0128464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CTGTTATCAA GCGTGCATTA GTGTCAGTTT CTGATAAATC CGGCGTTGTT 
GAATTCTGCC GCGAACTCTC ATCCATGGGG GTTGAAATCT TCTCGACCGG AGGAACTCTG
CGGAAACTTC AGGAATCCGG TGTTGCAGCG GCTTCCATTT CAACCATTAC GGGCTTTCCG
GAAATTATGG ATGGTCGGGT GAAAACGCTG CATCCGAAAA TCCATGGCGG ACTGCTTGCT
GTGCGTGATA ATGCCGATCA TATCGCTCAG GCGCGGGATA ACGGTATCGG TTTTATCGAC
ATGGTTGTCG TCAATCTCTA TCCGTTCCAG GAGACAGTCG CGAAACCTGA TGTGACGTTT
GAAGAGGCGA TTGAAAATAT CGATATCGGC GGACCTTCGA TGCTTCGCAG CGCGGCAAAG
AACCATGAGT CGGTGACCGT TATCACTGAA AGCGCCGATT ACCGGACGGT ACTCGATGAA
ATGCGGGAGA ACAACGGCGC GACCACACGT TCGACCCGAC TGAAGCTGGC AGGAAAAGTG
TTTACACTGA CATCCCGTTA CGACCGGGCA ATCGCGGATT ACCTGGCTGC ATCTTCAGAG
GGAGAGGCAT CTTCGGAAGC TGGATCGATC AGTGTCCGGC TGGAAAAAGA GATCGATATG
CGCTATGGTG AGAACCCGCA TCAGAACGCC GGTTTCTATC GTATGGACGA CGGCAGCGGG
TCACGCTCGT TTGAGGAGTA TTTCCGGAAA CTTCACGGTA AGGATCTTTC ATACAACAAC
ATGCTCGATA CTGCCGCGGC GACCGCTCTG ATTGAAGAGT TCAGGGATGA AGCGCCGGCG
GTGGTTATTA TCAAACATAC CAATCCTTGC GGTGTCGCGC AGGCCGATAC GCTTGTCGAG
GCCTATCGCA AGGCGTTCTC AACCGATACA CAGTCTCCTT TCGGCGGGAT CATCGCATGC
AACAGACCGC TCGATATGGA AACCGCGAAG GCCATTGATG AAATCTTCAC CGAAATCCTT
ATTGCTCCGG CCTATGAAGA AGGGGTTCTT GATATGCTGA TGAAGAAGAA GAACCGGCGT
CTTCTTCTCC AGAGAAAACC TCTTCTGCAG GAGGTTACGG AATACAAGTC AACCCGGTTC
GGCATGCTGG TACAGGAAAG AGACAGCCGG ATTGCTTCCC GGGATGACCT GAAAGTCGTC
ACGAAACGTC AGCCTTCAGC GCAGGAGCTC GATGATCTCA TGTTTGCATG GAAGATCTGC
AAGCATGTGA AGTCAAACAC GATCGTCTAT GTGAAGAACC GACAGACAGT CGGGGTTGGA
GCAGGACAGA TGTCCCGTGT CGATTCAGCG AAAATCGCCC GTTCAAAAGC TGCCGAGGCG
GGCCTTGACC TGAACGGATC CGCGGTCGCG TCAGACGCGT TTTTCCCGTT TGCCGACGGA
CTGCTCGCAG CGGCAGAAGC GGGAGCTATG GCGGTTATAC AGCCCGGCGG ATCGGTTCGC
GATGATGAGG TTATCGCCGC CGCCGACGAG CATGACCTCG CGATGGTGTT CACCTCTATG
CGGCACTTCA AGCATTGA
 
Protein sequence
MSDPVIKRAL VSVSDKSGVV EFCRELSSMG VEIFSTGGTL RKLQESGVAA ASISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRDNADHIAQ ARDNGIGFID MVVVNLYPFQ ETVAKPDVTF
EEAIENIDIG GPSMLRSAAK NHESVTVITE SADYRTVLDE MRENNGATTR STRLKLAGKV
FTLTSRYDRA IADYLAASSE GEASSEAGSI SVRLEKEIDM RYGENPHQNA GFYRMDDGSG
SRSFEEYFRK LHGKDLSYNN MLDTAAATAL IEEFRDEAPA VVIIKHTNPC GVAQADTLVE
AYRKAFSTDT QSPFGGIIAC NRPLDMETAK AIDEIFTEIL IAPAYEEGVL DMLMKKKNRR
LLLQRKPLLQ EVTEYKSTRF GMLVQERDSR IASRDDLKVV TKRQPSAQEL DDLMFAWKIC
KHVKSNTIVY VKNRQTVGVG AGQMSRVDSA KIARSKAAEA GLDLNGSAVA SDAFFPFADG
LLAAAEAGAM AVIQPGGSVR DDEVIAAADE HDLAMVFTSM RHFKH