Gene Clim_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1749 
SymbolpurH 
ID6354577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1925445 
End bp1927022 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content56% 
IMG OID642669353 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001943769 
Protein GI189347240 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CTGTCATCAA GCGGGCGCTG GTATCTGTAT CCGATAAAAC CGGAATTGTG 
GATTTCTGCC GGGAGCTTTC GCTCCTCGGC GTCGAGATTT TTTCAACGGG CGGAACCCTG
AAAACGCTTC AGGATGCCGG AGTTGCTGCA GCCTCAATTT CAACCATTAC CGGATTTCCC
GAGATTATGG ACGGGCGGGT CAAGACGCTG CATCCTAAAA TTCATGGCGG TCTGCTTGCC
GTGAGAGAGA ACGCCGATCA TGTGAAACAG GCACTGGAGA ACGGTATCGG GTTTATCGAC
ATGGTGGTGG TGAACCTGTA TCCTTTCGAG GCTACGGTGG CAAAGCCGGA TGTTACCTTC
GAGGATGCTA TCGAGAATAT CGATATAGGT GGTCCCTCGA TGCTCCGCAG CGCTGCGAAA
AACAACGAGT CGGTGACCGT TTTGACCGAT AGCGCTGATT ACGCGCAGGT GCTTGGTGAG
ATGCGTGCCG GTGCCGGCGC TACGACTCGC GCAACCCGTC TTATGCTGGC TCGCAAGGTG
TTTGCACTCA CCTCCCGTTA CGACAGGGCT ATTGCGGCCT ACCTTACCGG CGCTGCAGGT
GCAGAGGTGG AGGGCGCGGC AGCAGGCATG ACCGTATCGC TCGAAAAAGA GCTTGACATG
CGTTATGGCG AGAACCCGCA TCAGAATGCG GGCTTTTACC GTCTTACCGA CAGCGAGGGC
AGCCGTTCGT TCGGCGCCTG TTTCGAGAAG CTGCACGGTA AGGAGCTGTC TTACAATAAT
ATGCTCGATA TTGCCGCGGC AACCTCGCTT ATCGAGGAGT TTCGCGGAGA GGATCCTTCA
GTCGTGATCG TCAAACATAC CAATCCGTGC GGTGTCGCTC AGGCCGATAC GCTTGTGGAG
GCTTACCGCA GGGCGTTCTC GACCGATACG CAGGCTCCTT TCGGTGGCAT CATCGCTTTC
AACCGTCCGC TCGATATGGA TACCGCAGTT GCGGTTAACG GGATATTTAC CGAAATCCTG
ATCGCTCCTG CTTTCGAGGA TGGCGTGCTC GATCTGCTTA TGAAGAAGAA AGACCGCAGG
CTGGTGCTGC AGAAGCAGCC GCTGCCGAAA GGCGGATGGG AGTTCAAGTC CACACCCTTC
GGCATGCTTG TTCAGGAGCG CGATGCCCGT ATCGTTGCCG TCGAGGATCT GAAGGTGGTC
ACCAAACGGC AGCCGACTGA AGAGGAACTT TCCAACCTGA TGTTTGCCTG GAAGATCTGC
AAGCATATCA AATCGAACAC TATCCTGTAT GTGAAGAACC GCCAGACCTA TGGCGTAGGA
GCGGGGCAGA TGTCCCGTGT GGATTCGTCG AAAATCGCGC GCTGGAAGGC TTCGGAGGTC
AATCTCGACC TGAACGGATC GGTTGTGGCT TCAGATGCGT TTTTCCCGTT CGCTGACGGG
CTGCTTGCCG CTGCCGAGGC TGGCGTGACC GCCGTGATCC AGCCGGGCGG TTCGATCAGG
GATAACGAGG TGATCGAGGC TGCGGATGCC AACAATCTGG CTATGGTGTT TACGGGGATG
AGGCATTTCA AACACTGA
 
Protein sequence
MSDPVIKRAL VSVSDKTGIV DFCRELSLLG VEIFSTGGTL KTLQDAGVAA ASISTITGFP 
EIMDGRVKTL HPKIHGGLLA VRENADHVKQ ALENGIGFID MVVVNLYPFE ATVAKPDVTF
EDAIENIDIG GPSMLRSAAK NNESVTVLTD SADYAQVLGE MRAGAGATTR ATRLMLARKV
FALTSRYDRA IAAYLTGAAG AEVEGAAAGM TVSLEKELDM RYGENPHQNA GFYRLTDSEG
SRSFGACFEK LHGKELSYNN MLDIAAATSL IEEFRGEDPS VVIVKHTNPC GVAQADTLVE
AYRRAFSTDT QAPFGGIIAF NRPLDMDTAV AVNGIFTEIL IAPAFEDGVL DLLMKKKDRR
LVLQKQPLPK GGWEFKSTPF GMLVQERDAR IVAVEDLKVV TKRQPTEEEL SNLMFAWKIC
KHIKSNTILY VKNRQTYGVG AGQMSRVDSS KIARWKASEV NLDLNGSVVA SDAFFPFADG
LLAAAEAGVT AVIQPGGSIR DNEVIEAADA NNLAMVFTGM RHFKH