Gene Caul_4782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4782 
SymbolpurH 
ID5902244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5164375 
End bp5165964 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content70% 
IMG OID641565302 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001686400 
Protein GI167648737 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.391725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCG CCCCCGACTA TCCGTCCGCC CCCGACCTCG TCGCCCCCAA GCGCGCCCTG 
CTGTCGGTCT CCGACAAGAC CGGCCTGGTG GAGGCCGCCC AGATCCTGCA CGCGGCCGGT
GTCGAGCTGG TCTCGACCGG CGGCACCAAG GCGGCGATCG CGGCTGCCGG GATCCCGGTC
AAGGACGTCT CCGACCTGAC GGGCTTCCCG GAGATGATGG ACGGACGGGT CAAGACCCTG
CACCCCGTCG TCCATGGCGG GCTGCTGGGC GTCCGCGACG CCCCCGAGCA CGCCAAGGCC
ATGGCCGACC ACGGCATCGG CGGCATCGAT ATCCTCTATG TGAACCTCTA TCCGTTCGAG
GCCACGGTCG CGAAGGGCGG AACCTACGCC GAGTGCGTCG AGAACATCGA CATCGGCGGC
CCGGCGATGA TCCGCTCGGC GGCCAAGAAC CACGGCTATG TCGCCGTCTG CACCGATCCG
TCGGACCTGG CCGAGGTGCT GGACGCGCTG AAGGCCGGCG GCACGACCCT GGCGCTGCGC
CAGACCCTGG CGGCCCGCGC CTATGCTCGC ACGGCGGCCT ATGACGCGGC GATCTCCACC
TGGTTCGCCG CCCAGTTGGG CCAGGACTTC CCGGCTCGCA AGACCATCGC CGGCCAATTG
CGCCAGACGA TGCGCTACGG CGAGAACCCG CACCAGAAGG CGGCCTTCTA CACCTTCGCC
AATCCGCGCA CCGGCGTGGC CACGGCCACC CAGCTGCAGG GCAAGGAACT CAGCTACAAC
AACATCAACG ACACCGACGC GGCCTTCGAA CTGATCGCCG AGTTCGATCC GGCGGCCGGC
CCGGCGGTGG CGATCATCAA GCACGCCAAT CCCTGCGGCG TGGCCGTGGG CGCCAGCCAG
CGCGAGGCCT ATGAGCGCGC CCTGGCCTGC GACCCGACCT CGGCGTTCGG CGGCATCGTC
GCCGTCAACA GCCGCCTGAC CCGCGACGCG GCCCTGGCGA TGGTCGAGAT CTTCACCGAG
GTGGTGATCG CCCCGGAAGC CGACGACGAC GCCGTCGCGG TGTTCGCCGC CAAGAAGAAC
CTGCGCCTGC TGGTGACCGG CGGCCTGCCC GACGCCCTGT CGAGCGGCGA CACCTTCAAG
TCGGTGGCCG GCGGCTTCCT GGTGCAATCC CGGGATGACG CGCGGATCAC GGCTTCGGAC
CTGAAGATCG TCACCAAGCG TCAGCCTACG GAGGAAGAGG TGCGCGACAT GCTGTTCGCC
TTCACCGTCG GCAAGCACGT CAAGTCCAAC GCCATCGTCT ATGCCCGCGA AGGCCAGACC
CTGGGCGTCG GCGCCGGCCA GATGAACCGC AAGGACAGCG CCCGGATCGC GGCCCTGCGC
GCCGCCGATT TCGGCCTGGA CCTGAAGGGC TGCGCCTGCG CCTCCGAAGC CTTCTTCCCG
TTCGCCGACG GCCTGATCCA GGCGGCGGAG GCCGGAGCGA CGGCGATCAT CCAGCCCGGC
GGCTCGATGC GCGACCCCGA GGTGATCGAG GCCGCCGACA AGCTGGGCCT TACAATGGCC
TTCACGGGTG TGCGAGTGTT CCGCCACTAA
 
Protein sequence
MPAAPDYPSA PDLVAPKRAL LSVSDKTGLV EAAQILHAAG VELVSTGGTK AAIAAAGIPV 
KDVSDLTGFP EMMDGRVKTL HPVVHGGLLG VRDAPEHAKA MADHGIGGID ILYVNLYPFE
ATVAKGGTYA ECVENIDIGG PAMIRSAAKN HGYVAVCTDP SDLAEVLDAL KAGGTTLALR
QTLAARAYAR TAAYDAAIST WFAAQLGQDF PARKTIAGQL RQTMRYGENP HQKAAFYTFA
NPRTGVATAT QLQGKELSYN NINDTDAAFE LIAEFDPAAG PAVAIIKHAN PCGVAVGASQ
REAYERALAC DPTSAFGGIV AVNSRLTRDA ALAMVEIFTE VVIAPEADDD AVAVFAAKKN
LRLLVTGGLP DALSSGDTFK SVAGGFLVQS RDDARITASD LKIVTKRQPT EEEVRDMLFA
FTVGKHVKSN AIVYAREGQT LGVGAGQMNR KDSARIAALR AADFGLDLKG CACASEAFFP
FADGLIQAAE AGATAIIQPG GSMRDPEVIE AADKLGLTMA FTGVRVFRH