Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0449 |
Symbol | purP |
ID | 4909352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | - |
Start bp | 431249 |
End bp | 432250 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640124201 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001055348 |
Protein GI | 126459070 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG CGCTGAAGAG GTACGACTTG GATAAGTTGG CCGTGGCCAC GGTGGCCTCC CACACCGCGC TCCAAATCCT CAGGGGAGCC AAGAGGTACG GCTTTAGGAC AATAGCCGTG GCACAGAGAA ACGCCGACTT CTACCGCCAG TTCTCCTTCA TCGACGAAGT GTGGACCGCC GACTTCTCCA ACTTTAGGCA CGTCGCTGAG AAGCTCGTGG AGAAAAACGC CTTGTTCATA CCCCACGGCT CATACGTCGA GTACGTGGGG TGGAGACAGG CGCTGGAGGC CCCAGTGCCC ACCCTAGGCT GTAGAGAGCT GTTGCGCTGG GAGGCCGACC AGTACAAGAA GATGGAGCTG CTGGCCGCCG CGGGGATACC CACGCCGAGG TACTACAAGA GGGCCGAAGA GGCCGAGGGC CCCGTAATAG TCAAACTCTT CGGCGCCAAG GGCGGCAGGG GGTACTTCGT GGCAAAGAAC AGAGAGGAGT TGGCCAAAAG GATCAAGGCG GTGGAGGGCG ACTACATAAT TCAGGAATAC GTCTTCGGCG TGCCCGCCTA CTACCACTAC TTCGCCTCGC CGGTGTACAA CAGAGTGGAG ATCTTCGGCA TGGACATCAG ATACGAGACC AATGTAGATG GGAGAACCTT CGGCTGGGTA GAGCCCACCT TCGTAGTGGT GGGGAATCTC CCGCTGGTGT TAAGGGAGTC TCTGCTCCCC GTGGTGCACA AGTACGGAGT CGACTTCGCC AAGGCAGTGA GAGAAAAGGT CAGCTGCGAG CTGGCGGGAC CCTACTGTCT CGAGAGCATA ATAAGAGACG ACATGACCAT CGTCGTCTTT GAATTCTCGG GGAGGATCGT GGCTGGGACA AACGTCTACA TGGGCGTGGG CTCCCCCTAC TCTGTCCTCT ACTTCGACGA GCCCATGGAC ATGGGAGAGA GGATAGCCCA CGAGATAAAA GAGGCCGCTG CGAGGGGCAT ATTGGAGAAA CTTTTCACAT AA
|
Protein sequence | MSAALKRYDL DKLAVATVAS HTALQILRGA KRYGFRTIAV AQRNADFYRQ FSFIDEVWTA DFSNFRHVAE KLVEKNALFI PHGSYVEYVG WRQALEAPVP TLGCRELLRW EADQYKKMEL LAAAGIPTPR YYKRAEEAEG PVIVKLFGAK GGRGYFVAKN REELAKRIKA VEGDYIIQEY VFGVPAYYHY FASPVYNRVE IFGMDIRYET NVDGRTFGWV EPTFVVVGNL PLVLRESLLP VVHKYGVDFA KAVREKVSCE LAGPYCLESI IRDDMTIVVF EFSGRIVAGT NVYMGVGSPY SVLYFDEPMD MGERIAHEIK EAAARGILEK LFT
|
| |