Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0099 |
Symbol | |
ID | 4616959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 93081 |
End bp | 94157 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639783181 |
Product | phosphoribosylaminoimidazole carboxylase |
Protein accession | YP_929625 |
Protein GI | 119871618 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0100977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTAT TGGTTCTCGG AGGGGGCCAG CTGGCACTTA TGATGTCCTG GGCAGCCGCC AGACTGCCGC TGGACTTTGT CGTCTACGAC CCAGACCCCA GAGCCCCTGC GTATAAATAT GCTATTAAGG CGAGCGACCC GCTGCGGGAG GTAGAGAGGG CGGACTACGT CACCTTCGAG TTTGAAAATG TGGAATGGGA GGTGGCGGAA TACGCCCACA AACTGGGCAA GCTGAGGCCC CATCTGGACT ACCTTAGGGT GAAGAAGAGT AGGATATGGG AGAGGGAGGC GCTCGGCGAG CTGGGGGTGC CGACCCCCCG CTGGGCTGTG GCTAGAGATG GCGCAGAGGC TATGGAACTG GCGGCCAAGT GGGGGAGAGC CATCGCCAAG GTGCCCTCCG GCGGCTACGA CGGCAAGGGC CAGTATCTGC TCCCCAGAGA GAGGCCGCCC GCGGAGGGCC CCCTCCTGGT GGAGGAATAC GTAAACATCG CGAGGGAGTT CTCAATAATA GCGGCTAGGT CTGAAGACGG CGACGTGTAC TTCTACCCGC CCGCCCAGAA CTACTACGTC CAAGGCATCT TGGTGTGGAA CTACGCCCCC GCCGAGGCGC CGCCCGAGGC CTATAGATAC GTGGAGAAGA TCCTGGAGTG GCGTAGATAC GTAGGCATCC TCGCCGTGGA GTTCTTCGAA GACAGAAGCG GCAGGATTCT CGTCAATGAG ATCGCGCCCC GCGTCCACAA CACAGGCCAC TGGACCCTAG AGACAGACGC CTCCCAGTTC GAAAACCACG TCAGAGCAGC CGTCGGCCTC CCCCTCAGGA GGCCACGCGC CTTAGCGCCC ACCGCCATGG TGAACCTACT GGGCGTCGCC AGCCCACCCA TTAGGGAGCT GGAGAGGCTC GGCAAGGTGT ACTGGTACGG CAAGGCGGAG GCGCGGCCCA GGCGCAAGAT GGGCCACGTA AACATCACAG CAGACACCAC GGCGGAGGCC ATAGCCAAGG CCCGAGAGGC GATGAGGATA ATCTACGGCA GAAAATTCCC AGAACTCGTC CTCAAAAACT ACGGCAGTAG ACAATAG
|
Protein sequence | MRLLVLGGGQ LALMMSWAAA RLPLDFVVYD PDPRAPAYKY AIKASDPLRE VERADYVTFE FENVEWEVAE YAHKLGKLRP HLDYLRVKKS RIWEREALGE LGVPTPRWAV ARDGAEAMEL AAKWGRAIAK VPSGGYDGKG QYLLPRERPP AEGPLLVEEY VNIAREFSII AARSEDGDVY FYPPAQNYYV QGILVWNYAP AEAPPEAYRY VEKILEWRRY VGILAVEFFE DRSGRILVNE IAPRVHNTGH WTLETDASQF ENHVRAAVGL PLRRPRALAP TAMVNLLGVA SPPIRELERL GKVYWYGKAE ARPRRKMGHV NITADTTAEA IAKAREAMRI IYGRKFPELV LKNYGSRQ
|
| |