Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4782 |
Symbol | purH |
ID | 5902244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5164375 |
End bp | 5165964 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641565302 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001686400 |
Protein GI | 167648737 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.391725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCCG CCCCCGACTA TCCGTCCGCC CCCGACCTCG TCGCCCCCAA GCGCGCCCTG CTGTCGGTCT CCGACAAGAC CGGCCTGGTG GAGGCCGCCC AGATCCTGCA CGCGGCCGGT GTCGAGCTGG TCTCGACCGG CGGCACCAAG GCGGCGATCG CGGCTGCCGG GATCCCGGTC AAGGACGTCT CCGACCTGAC GGGCTTCCCG GAGATGATGG ACGGACGGGT CAAGACCCTG CACCCCGTCG TCCATGGCGG GCTGCTGGGC GTCCGCGACG CCCCCGAGCA CGCCAAGGCC ATGGCCGACC ACGGCATCGG CGGCATCGAT ATCCTCTATG TGAACCTCTA TCCGTTCGAG GCCACGGTCG CGAAGGGCGG AACCTACGCC GAGTGCGTCG AGAACATCGA CATCGGCGGC CCGGCGATGA TCCGCTCGGC GGCCAAGAAC CACGGCTATG TCGCCGTCTG CACCGATCCG TCGGACCTGG CCGAGGTGCT GGACGCGCTG AAGGCCGGCG GCACGACCCT GGCGCTGCGC CAGACCCTGG CGGCCCGCGC CTATGCTCGC ACGGCGGCCT ATGACGCGGC GATCTCCACC TGGTTCGCCG CCCAGTTGGG CCAGGACTTC CCGGCTCGCA AGACCATCGC CGGCCAATTG CGCCAGACGA TGCGCTACGG CGAGAACCCG CACCAGAAGG CGGCCTTCTA CACCTTCGCC AATCCGCGCA CCGGCGTGGC CACGGCCACC CAGCTGCAGG GCAAGGAACT CAGCTACAAC AACATCAACG ACACCGACGC GGCCTTCGAA CTGATCGCCG AGTTCGATCC GGCGGCCGGC CCGGCGGTGG CGATCATCAA GCACGCCAAT CCCTGCGGCG TGGCCGTGGG CGCCAGCCAG CGCGAGGCCT ATGAGCGCGC CCTGGCCTGC GACCCGACCT CGGCGTTCGG CGGCATCGTC GCCGTCAACA GCCGCCTGAC CCGCGACGCG GCCCTGGCGA TGGTCGAGAT CTTCACCGAG GTGGTGATCG CCCCGGAAGC CGACGACGAC GCCGTCGCGG TGTTCGCCGC CAAGAAGAAC CTGCGCCTGC TGGTGACCGG CGGCCTGCCC GACGCCCTGT CGAGCGGCGA CACCTTCAAG TCGGTGGCCG GCGGCTTCCT GGTGCAATCC CGGGATGACG CGCGGATCAC GGCTTCGGAC CTGAAGATCG TCACCAAGCG TCAGCCTACG GAGGAAGAGG TGCGCGACAT GCTGTTCGCC TTCACCGTCG GCAAGCACGT CAAGTCCAAC GCCATCGTCT ATGCCCGCGA AGGCCAGACC CTGGGCGTCG GCGCCGGCCA GATGAACCGC AAGGACAGCG CCCGGATCGC GGCCCTGCGC GCCGCCGATT TCGGCCTGGA CCTGAAGGGC TGCGCCTGCG CCTCCGAAGC CTTCTTCCCG TTCGCCGACG GCCTGATCCA GGCGGCGGAG GCCGGAGCGA CGGCGATCAT CCAGCCCGGC GGCTCGATGC GCGACCCCGA GGTGATCGAG GCCGCCGACA AGCTGGGCCT TACAATGGCC TTCACGGGTG TGCGAGTGTT CCGCCACTAA
|
Protein sequence | MPAAPDYPSA PDLVAPKRAL LSVSDKTGLV EAAQILHAAG VELVSTGGTK AAIAAAGIPV KDVSDLTGFP EMMDGRVKTL HPVVHGGLLG VRDAPEHAKA MADHGIGGID ILYVNLYPFE ATVAKGGTYA ECVENIDIGG PAMIRSAAKN HGYVAVCTDP SDLAEVLDAL KAGGTTLALR QTLAARAYAR TAAYDAAIST WFAAQLGQDF PARKTIAGQL RQTMRYGENP HQKAAFYTFA NPRTGVATAT QLQGKELSYN NINDTDAAFE LIAEFDPAAG PAVAIIKHAN PCGVAVGASQ REAYERALAC DPTSAFGGIV AVNSRLTRDA ALAMVEIFTE VVIAPEADDD AVAVFAAKKN LRLLVTGGLP DALSSGDTFK SVAGGFLVQS RDDARITASD LKIVTKRQPT EEEVRDMLFA FTVGKHVKSN AIVYAREGQT LGVGAGQMNR KDSARIAALR AADFGLDLKG CACASEAFFP FADGLIQAAE AGATAIIQPG GSMRDPEVIE AADKLGLTMA FTGVRVFRH
|
| |