Gene Cpha266_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0657 
SymbolpurH 
ID4569811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp748982 
End bp750547 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content50% 
IMG OID639765255 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_911136 
Protein GI119356492 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CTGTCATCAA ACGGGCGCTG GTCTCTGTTT CCGACAAAAC CGGTATTGTG 
GATTTCTGCC GGGAGCTTTC GCTTCTCGGC GTTGAGGTGT TTTCAACGGG CGGAACCCTG
AAGACTCTTC AGGATGCCGG AATAGCTGCG GCTTCTATTT CGACCATCAC CGGATTTCCG
GAAATTATGG ATGGGCGGGT CAAAACCCTC CATCCTAAAA TACATGGAGG ACTGCTCGCC
GTAAGGGAAA ATCCTGATCA TGTCAACCAG GCGAACGAAA ACGGGATCAG CTTTATTGAT
CTTGTTGTTG TTAACCTTTA TCCATTCGAG GCCACAGTTG CAAAACCGGA CGTGACCTTC
GAGGATGCCA TAGAAAATAT CGATATCGGC GGTCCCTCCA TGCTTCGCAG TGCTGCCAAG
AACAACGAAT CGGTAACAGT GGTCACGGAT AGCGCCGACT ATGCGCTTGT GTTGCAGGAG
ATGCGTAATA ATAACGGTGC GACGAAAAGG GAGACCCGGC TGGCGCTTGC TCTGAAGGTT
TTTGAACTTA CCTCTCGTTA TGATCGCGCA ATCGCCTCTT ATCTTGCAGG AGCTCAGCAT
GAAGCAGATT CTTCCATGAC GGTAAAACTT GAACGTGAGC TCGATATGCG CTATGGCGAA
AATCCTCATC AGAGCGCTGG GCTTTACCGC CTGACTGATG AGAACGGAAC GCGTTCTTTT
AGCGATTATT TCGAGAAACT GCATGGCAAG GAGCTCTCTT ACAACAATAT GCTCGATATT
GCCGCCGCAG TCTCCCTTAT TGAGGAGTTC CGTGGTGAAG AGCCGACAGT AGTCATTATC
AAACATACAA ACCCCTGCGG TGTTGCGCAG GCCCCGACAC TTGCCGAAGC ATACCGGAGA
GCATTCTCAA CCGATACCCA GGCCCCTTTT GGCGGCATTA TTGCCTTTAA CCATCCTCTC
GACATGGAAG CGGCAACGGC GGTCAATGAG ATTTTTACCG AGATTCTTAT TGCTCCGGCA
TTTGAGGATG GCGTGCTTGA GATGCTGATG AAGAAAAAAG ATCGCAGGCT TGTGCGGCAG
ACGAGTGCCC TGCCCAAAGG TGGTTGGGAG TTCAAGTCTA CTCCGTTCGG GATGCTTGTT
CAGGAACGTG ACAGCAAAAT CGTCACAAAA GAGGATCTGA CTGTTGTGAC CAAACGGCAG
CCAACAGAAG AGGAGGTTGC AGACATGATG TTTGCCTGGA AAATCTGCAA GCACATCAAG
TCAAACACGA TTCTTTATGT TAAAAATCGC CAGACCTTTG GAGTTGGTGC CGGTCAGATG
TCCCGTGTTG ACTCTTCAAA AATCGCGCGT TGGAAAGCTT CTGAAGTCAA TCTCGATCTG
CATGGCTCGG TGGTTGCTTC AGATGCGTTT TTCCCGTTTG CCGATGGTCT TCTTGCCGCA
GCAGAAGCAG GCGTTACCGC AGTTATTCAG CCAGGCGGTT CGATCAGGGA TAACGAGGTG
ATTGAAGCGG CAGACGCTAA CAATCTTGCC ATGGTTTTTA CAGGAATGCG CCACTTCAAA
CACTGA
 
Protein sequence
MSDPVIKRAL VSVSDKTGIV DFCRELSLLG VEVFSTGGTL KTLQDAGIAA ASISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRENPDHVNQ ANENGISFID LVVVNLYPFE ATVAKPDVTF
EDAIENIDIG GPSMLRSAAK NNESVTVVTD SADYALVLQE MRNNNGATKR ETRLALALKV
FELTSRYDRA IASYLAGAQH EADSSMTVKL ERELDMRYGE NPHQSAGLYR LTDENGTRSF
SDYFEKLHGK ELSYNNMLDI AAAVSLIEEF RGEEPTVVII KHTNPCGVAQ APTLAEAYRR
AFSTDTQAPF GGIIAFNHPL DMEAATAVNE IFTEILIAPA FEDGVLEMLM KKKDRRLVRQ
TSALPKGGWE FKSTPFGMLV QERDSKIVTK EDLTVVTKRQ PTEEEVADMM FAWKICKHIK
SNTILYVKNR QTFGVGAGQM SRVDSSKIAR WKASEVNLDL HGSVVASDAF FPFADGLLAA
AEAGVTAVIQ PGGSIRDNEV IEAADANNLA MVFTGMRHFK H