Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1483 |
Symbol | |
ID | 5054244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1341496 |
End bp | 1342683 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640469023 |
Product | imidazolonepropionase |
Protein accession | YP_001153692 |
Protein GI | 145591690 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATAA TAAGGGCGAA GCAACTTGTA ACTGCTTTAA ACATGCCCTG GAGATACAGA GACGAAGTCG CCGTAATTAA CGACGCCGCT GTCGTGATTA GAGACGGCGT GATAATAGAC GTCGGCACGT GGGAAGAAAT AAAACGAAGA CATCCACATG CCAATATTTG GGATTTCGGC GACAATCTCA TAACGCCAGG CCTAGTGGAC CCACATACGC ACTTACTTTT TGCAGGTTCG CGAGAAGACG AGCTTGAAAG AAAGCTACAG GGCGAGTCGT ACGAAGAAAT AACGAGAAAA GGCGGCGGCA TATACAAAAC CGTGAAATAT ACGAAAGAGA CAAGCGACCA AGAGCTGTTG AATATCCTAC AGAAAAGAAT TCAATTAGCT ACATCTTTTG GCACAACAAC AGTTGAGGTA AAAACTGGAT ATGGGCTAGA CATAGATCAA GAACTGAGAC TCGCCAGAAT TTTAAAGAGC GTGAAAAGCC CCATAGACGT AGTCACAACA TTTCTAGTAC ACATCCCGCC GCCGGCAGGA AGAGAAAATT ATGTAAAAGA AGTGCTTAAG GCCATTCCAC ATGCTGGCAC AACATATGTA GACGTCTTCT GTGACTCTAT AGCGTTTAAT GTGGAGGAGA CAAGAACTAT TTTGAAGAAG GCCGCCGAGG CCGGCTACAA GCTTAGGCTA CATGCAGACG AGCTTGAGTA CATAGGATGT AGCGACTTGG TAGAAGAATT GCCTATAGAC TCAGCCGACC ATCTCCTAAA TACGCCGCCT GAGAATGTAA GAAAAATCGC AAAATCCGGA ACAGTGGCGA CTCTGCTACC GGTTACTATT CTTACACTCA GAACGTCTAA AAAACCGCCC ATTGATGAGA TGAGGCGACT TAGAGTTCCC ATAGCTATAG GAACAGACTT TAGCCCTAAC AGCTGGTGTC TAAACATGCA AACGGCAATA GAACTCGCAG TATATCTCCT GGGGCTCACC CCCTTAGAGG CGCTGATTGC CGCTACGGCT AACGCCGCCT ACAGCCTACG TCTTACAGAC AGGGGGATTA TCCAGCCGGG CAAAATTGCA GACTTGGTAA TATGGGATGT ACCGAACTAC CACTGGCTCG CGTATGAAAT TGGCAGAAAT AAGGCGAAGC TTGTACTGAA AAAAGGGGAG CCACTACGGT TTCTCTAA
|
Protein sequence | MVIIRAKQLV TALNMPWRYR DEVAVINDAA VVIRDGVIID VGTWEEIKRR HPHANIWDFG DNLITPGLVD PHTHLLFAGS REDELERKLQ GESYEEITRK GGGIYKTVKY TKETSDQELL NILQKRIQLA TSFGTTTVEV KTGYGLDIDQ ELRLARILKS VKSPIDVVTT FLVHIPPPAG RENYVKEVLK AIPHAGTTYV DVFCDSIAFN VEETRTILKK AAEAGYKLRL HADELEYIGC SDLVEELPID SADHLLNTPP ENVRKIAKSG TVATLLPVTI LTLRTSKKPP IDEMRRLRVP IAIGTDFSPN SWCLNMQTAI ELAVYLLGLT PLEALIAATA NAAYSLRLTD RGIIQPGKIA DLVIWDVPNY HWLAYEIGRN KAKLVLKKGE PLRFL
|
| |