Gene Lcho_0532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0532 
SymbolpurH 
ID6160749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp579463 
End bp581070 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content69% 
IMG OID641663281 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001789572 
Protein GI171057223 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCG CCCTGATTTC CGTCTCCGAC AAGACCGGCA TCGTCGAGCT TGCCCAGGCG 
CTGCACGCCA CCGGCGTCAA GCTGCTGTCC ACCGGCGGCA CCGCCAAACT GCTGGCCGAC
GCCGGCCTGC CGGTCACCGA GGTGGCCGAA CACACCGGTT TCCCGGAGAT GCTCGACGGC
CGCGTCAAGA CGCTGCACCC GAAGATCCAC GGCGGCCTGC TGGCGCGGCG CGACCTGCCC
GAGCACATGG CGGCGCTGGC GGCGCACGGC ATCGAGACCA TCGACCTGCT GATCGTCAAC
CTCTACCCCT TCGAGGCCAC CGTGGCCAAG GCCGGCTGCA CGCTCGACGA CGCGATCGAG
AACATCGACA TCGGCGGCCC GGCGATGGTG CGCAGCGCCG CCAAGAACTG GAAGGACGTG
GCCGTGCTCA CCGACGCCAG CCAGTACGCG GGCGTGATCG CCGAGCTGCA GCAGGCCGGT
ACGGTGAGCC GCGCGACGCG TTTCGCGCTG TCGGTGGCGG CCTTCAACCG CATCAGCAAC
TACGACGCCG CGATCTCCGA CTACCTGTCG TCGATCACCG ACGGTGGTGC CGGTGACGGC
GATGCGGCCC CGGCGCGCAT CGAGTTCCCG GGCCAGAGCA ACGGCCGCTT CGTCAAGCTG
CAAGACCTGC GTTACGGCGA GAACCCGCAT CAGGCCGCCG CCTTCTACCG CGACCTCTAC
CCCGCGCCCG GCTCGCTGGT CAGCGCCGTG CAGCTGCAGG GCAAGGAGCT GTCGTACAAC
AACATCGCCG ACGCCGACGC GGCGTGGGAG TGCGTGAAGA GCTTCGACAC CCCCGCCTGC
GTGATCGTCA AGCACGCCAA CCCTTGCGGC GTGGCGCTCG GTGCGGACGC CGGCGCGGCC
TACGCCAAGG CGTTCAAGAC CGATCCGACC TCGGCCTTCG GCGGCATCAT CGCCTTCAAC
ACGGTGGTCG ACAAAACCGC CGCCGAGCAG GTCGCCAAGC AGTTCGTCGA GGTGCTGATC
GCGCCGGCCT ACACCGACGA GGCGCGCGCC ATCTTCGCCG CCAAGGCCAA CACCCGCGTG
CTGCTGATCG ATCTGAGCCA GGTCCAGCGT GACGGCGCCA GCGCCTGGGC GCGCGGCCAG
AACGCGCACG ACATCAAGCG CATCGGCTCG GGCCTGCTGA TCCAGAGCGC CGACAACCAC
GTGCTCAAGC GCGAAGACCT GAAGATCGTC ACGAAGCTGG CGCCGACCGC GCAGCAGATC
GACGACCTGA TGTTCGCCTG GAGCGTGGCC AAGTTCGTCA AGAGCAATGC GATCGTGTTC
TGCAGCGGCG GCATGACGGT GGGCGTGGGC GCGGGCCAGA TGAGCCGGCT CGACTCGGCG
CGCATCGCCA GCATCAAGGC TGGCCACGCC GGTCTGACCC TGGCCGGCAG CGCGGTGGCG
AGCGACGCGT TCTTCCCGTT CCGCGACGGC CTCGACGTGG TGGCCGATGC CGGCGCCACC
TGCGTGATCC AGCCGGGCGG CTCGATGCGC GACCAGGAGG TGATCGACGC GGCCAACGAG
CGTGGCATCG CGATGGTGTA CACCGGCGTG CGGCATTTCC GGCATTGA
 
Protein sequence
MPTALISVSD KTGIVELAQA LHATGVKLLS TGGTAKLLAD AGLPVTEVAE HTGFPEMLDG 
RVKTLHPKIH GGLLARRDLP EHMAALAAHG IETIDLLIVN LYPFEATVAK AGCTLDDAIE
NIDIGGPAMV RSAAKNWKDV AVLTDASQYA GVIAELQQAG TVSRATRFAL SVAAFNRISN
YDAAISDYLS SITDGGAGDG DAAPARIEFP GQSNGRFVKL QDLRYGENPH QAAAFYRDLY
PAPGSLVSAV QLQGKELSYN NIADADAAWE CVKSFDTPAC VIVKHANPCG VALGADAGAA
YAKAFKTDPT SAFGGIIAFN TVVDKTAAEQ VAKQFVEVLI APAYTDEARA IFAAKANTRV
LLIDLSQVQR DGASAWARGQ NAHDIKRIGS GLLIQSADNH VLKREDLKIV TKLAPTAQQI
DDLMFAWSVA KFVKSNAIVF CSGGMTVGVG AGQMSRLDSA RIASIKAGHA GLTLAGSAVA
SDAFFPFRDG LDVVADAGAT CVIQPGGSMR DQEVIDAANE RGIAMVYTGV RHFRH