Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1944 |
Symbol | |
ID | 5054790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1744528 |
End bp | 1745418 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469490 |
Product | phosphoesterase domain-containing protein |
Protein accession | YP_001154143 |
Protein GI | 145592141 |
COG category | [R] General function prediction only |
COG ID | [COG0618] Exopolyphosphatase-related proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.754503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAGA AGCTGAGGGA GCTGGTTCAA GGCGCCAAGA GGGTGGCAAT AGTAACCCAC AAGAGGGCGG ATGCCGATGC GCTTGCCTGC GCCAAGGTGC TAGAGCTTGT TCTGACACGG CTTGGGCTTG AGGTCGCCGG GGTGTTCTGT CCCGAAGGCT CTCCAATAAG GGGTTGCGAC AAGGAGGTGC CGCTTGGCGT AGATCTGTAC GTGTTGGCAG ACGTGGCGTC CATGAGCCAG ATACCCCCCA TATGCGGTAG ATGTATTAGG GTAGACCACC ACGTTGCTGG GGACGATCTT CCTGGCATTG TGGCGGAGAG GCCCAGCTGT ACAGAGGTGG CGCTGGAGCT AGCCGAGGAG GTAGGTGTGG AGATCCCGCC CGACGTTGCG AAGCTTGCGG TGCTTGGTAT TTATACAGAT ACGGGGAGGC TGAGGCGTGC AGACGCCAAG ACTTTCAGGT TGCTGGCGCA ACTTCTAGAA AAAACAGGCG GCGTGTTGGG GGACTTAACA GGCTCCGAGG AGGGAGTAAG AGAGGAGCCC GCGGTCTTGG CCCTGTTAAA GGGAATGCAG AGAGTTGAGT TTTACAAATC GCAAATAGGC TTAATATGTA CATCCCATGT AAGTGCCTAC GAGGCGGATC TTGCCACGTT GCTAGTGTCG GCAGGTTGCC GCATCGCAAT AGTGGCATCG AGGAAGGACG ACGGCATACA CATAGTCTTT AGATCAAGGG GGGTAGACGT GGCTACATTG GCGAAATCAA TAGGCGCCGG AGGAGGGCAC CGGGAGGCGG CGGTGTCCGT AATAAGTGAG AGACTTCCCA AAAGCCAGTT ACCCGACTTC CTAAGAAGTC TCGTTAAGAG GCTGTTCCAA AACGCAACCC CCCTAGTCTA A
|
Protein sequence | MLEKLRELVQ GAKRVAIVTH KRADADALAC AKVLELVLTR LGLEVAGVFC PEGSPIRGCD KEVPLGVDLY VLADVASMSQ IPPICGRCIR VDHHVAGDDL PGIVAERPSC TEVALELAEE VGVEIPPDVA KLAVLGIYTD TGRLRRADAK TFRLLAQLLE KTGGVLGDLT GSEEGVREEP AVLALLKGMQ RVEFYKSQIG LICTSHVSAY EADLATLLVS AGCRIAIVAS RKDDGIHIVF RSRGVDVATL AKSIGAGGGH REAAVSVISE RLPKSQLPDF LRSLVKRLFQ NATPLV
|
| |