Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0095 |
Symbol | purP |
ID | 4616974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 88692 |
End bp | 89762 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639783176 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_929621 |
Protein GI | 119871614 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATAA ACTGTTGCTT ACTTCTGTCT ATGGTTTCTG TGGCTGTGTT GGCGTCCCAT AGCGCATTAG ACGTGTTAGA TGGCGCCAGA GACGAAGGGC TGAAGACAGT GGCTATAGCA AAGAAGGGGA GGGAGAGGGC CTACAGAGAG TTCCCCGTGG TAGACAAGCT AATTGTATTA GATGACTATA GAGACATATT GAAGATCGTA GACTTATTAA AGGCCGAGGA GGCGGTTTTT GTCCCAAATA GATCTTTCGC AGTATATGTG GGCTACGACG CCATAGAGAG AGAGTTCCCA GTGCCGATCT TCGGGAACCG GTTCCTACTA AGGTGGGAGG AGAGGACGGG GCCTCAGAAC TACTACCGTT TGCTAGACGA GGCAGGGATA AGGCGGCCTA GGACTTTTAG ACCGGACGAG GTGGACCGCC CAGTTATCGT CAAAATGCCA GAGGCGGAGA GGAGGGTCGA GAGGGGGTTC TTCATCGCCC GCGACCGCGA CGACCTATAC AGAAAGGCCA AACGTCTGGC AGACGCCGGA GTGATAAAGC TGGAGGACTT AGAGCGGTCT TCCATCGAGG AGCTGGTCCT CGGGGCCCAT TTCAACGCCA ACTACTTCTA CTCAGTCATG AGGAGACGGC TTGAGTTACA CAGTTTCGAC AGGAGGATAC AGAGCAACCT AGATGGGGTG TTCCGCCTGC CTGCCAGAGA CCAGCTGGAG GTAGACCCTG AGGTGAGGTA TATAGAGGTG GGCCACGAGC CTGCGACAAT ACGAGAGTCC CTCCTCGAGA AGGTGTTCGA CGTCGGCTAC AGATTTGTGG AGGCGGCACG CCGTCTCGTC CCCCCGGGGG TGATCGGGCC TTTCACCCTT CAGTTCATTG TGACACCCCA GCTGGACCTC GTGGTTTACG ACGTCGCGCC GAGGATAGGC GGCGGCACTA ACGTATACAT TGGGATCGGG GGGCAGTACT CCAAGCTCTA CTTCGGCAAA CCGATATCGA TAGGGAGGAG GATAGCTATG GAGATCCGCG AAGCCGCCGA GCAGGGGAGA CTCCCTGAAA TTACTACGTG A
|
Protein sequence | MSINCCLLLS MVSVAVLASH SALDVLDGAR DEGLKTVAIA KKGRERAYRE FPVVDKLIVL DDYRDILKIV DLLKAEEAVF VPNRSFAVYV GYDAIEREFP VPIFGNRFLL RWEERTGPQN YYRLLDEAGI RRPRTFRPDE VDRPVIVKMP EAERRVERGF FIARDRDDLY RKAKRLADAG VIKLEDLERS SIEELVLGAH FNANYFYSVM RRRLELHSFD RRIQSNLDGV FRLPARDQLE VDPEVRYIEV GHEPATIRES LLEKVFDVGY RFVEAARRLV PPGVIGPFTL QFIVTPQLDL VVYDVAPRIG GGTNVYIGIG GQYSKLYFGK PISIGRRIAM EIREAAEQGR LPEITT
|
| |