Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2262 |
Symbol | purP |
ID | 5054848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2025209 |
End bp | 2026252 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640469814 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_001154458 |
Protein GI | 145592456 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.141593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0244292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTTT CAGTGGCCGT GTTGGCCAGC CACAGCGCCC TCGACGTGCT AGACGGCGCC AAGGACGAGG GGTTGAGGAC GGTAGCTGTG GCGAAGAAGG GCCGCGACCG GGCCTACAGG GAGTTCCCCG TAGTGGACAA GCTCATAGTT CTTGACGACT ATGTAGACAT CTTGTATATT GTCGATATGC TTAAGGCTGA AGGCTCCGTG TTTGTGCCGA ACCGCTCATT CGCCGTGTAC GTCGGCTACG ACAACATAGA GAGGAGGTTC CCCGTCCCCG TCTTCGGGAA CAGATATCTG CTGAGGTGGG AGGAGCGGAC AGGTCCACAG AGCTACTACC GCCTCCTAGA CGAGGCTGGG GTCAAAAGAC CTAGGACCTT CCGCCCCGAC GAGGTGGACC GCCCCGTCAT AGTGAAGATG CCAGAGGCCG AGCGGAGGGT CGAGCGGGGC TTCTTCGTGG CGAGGGACAG AGACGACTTG TGGAGGAAGG CCAAAAGGCT GGCGGAGGCC GGGATCATAA GGCTCGAGGA CCTAGAAGCC GCCTCCATTG AGGAGTTGGT GCTGGGGGCC CACTTCAACG CCAACTACTT CTACTCCCCC CTCCGCAAGA GGCTTGAGCT ACACAGCTTC GACAGGAGGA TCCAGTCTAA CCTAGACGGG GTATTCCGCC TCCCAGCGAG GGACCAGCTA GACCTCGATC CAGATGTGCG CTACATCGAG GTGGGCCATG AACCAGCCAC AATTAGGGAA TCCCTCCTCG AAAAGGTCTT CGACATTGGG TACCGCTTCT TGGAGGCTAC CAGAAGGCTG GTGCCGCCGG GCGTGATCGG CCCCTTCACC CTACAGTTCA TAGTAACACC CCAGCTAGAC CTCGTGGTGT ACGACGTGGC TCCGCGCATC GGGGGAGGCA CCAACGTCTA CATAGGGATC GGGGGGCAGT ACTCGAAGCT CTACTTCGGC AAGCCCATAT CCATTGGGAG GAGGATTGCA ATGGAGATAA GAGAGGCTGC CGAGCAGAAG AGGCTGGAGG AGGTCACGAC TTGA
|
Protein sequence | MAVSVAVLAS HSALDVLDGA KDEGLRTVAV AKKGRDRAYR EFPVVDKLIV LDDYVDILYI VDMLKAEGSV FVPNRSFAVY VGYDNIERRF PVPVFGNRYL LRWEERTGPQ SYYRLLDEAG VKRPRTFRPD EVDRPVIVKM PEAERRVERG FFVARDRDDL WRKAKRLAEA GIIRLEDLEA ASIEELVLGA HFNANYFYSP LRKRLELHSF DRRIQSNLDG VFRLPARDQL DLDPDVRYIE VGHEPATIRE SLLEKVFDIG YRFLEATRRL VPPGVIGPFT LQFIVTPQLD LVVYDVAPRI GGGTNVYIGI GGQYSKLYFG KPISIGRRIA MEIREAAEQK RLEEVTT
|
| |