Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0512 |
Symbol | |
ID | 5056068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 463928 |
End bp | 464698 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640468074 |
Product | HAD family hydrolase |
Protein accession | YP_001152759 |
Protein GI | 145590757 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.425077 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA CAACTTATTT CGTTGACGTA CAAGGGACGC TTGTAAGAAG GAATCCAAAG ACGTTAAAAA GCCAGCTTAT AGGCGGAGTG AAGGCCTTTG AGAAAATACG GGGGGCTGGG GGGAGGATAT ACATTTTATC CAACGCCCCA AGACTGACCG AGGAGGTGCA TAAAGATCTC TTATCTGTGG GCCTCCCCGT GGACATAGAG CAAATTATCA CCTCAGCTCA AGTTACTGGC GAGTACATCG CCAAGAAATT CGGCCCCTCG CGGCTTTATG TAATTGGGTC GGATAGCTTT AAACAAGAGC TGTCAAAATA CGGCCACACC GTTGTGGAGG AGGGCGCCGA TGTGGTAGTA GTCGGCATAG ATAGGCAGCT AACCTTTGAA AAGCTTAATA AAGCCATGCA ACTGATCATG GCGGGGGCAA AGCTAGTGGC CGCGGGGATG TCCCGATATA TCCCGGAGGA GAAGCCGACC ATCTCCATAG GCCCAATCGC CATGGCGCTA GCATACGCCA CTGGAGTCAA GCCGATAAAC ACGGGGAAGC CCTCGCGCAT AATGTACACC TACGCCTTAG TCCGGGCGAG GGCAGTGCCT GAGGAAAGCG CCGTGATAAG CGACGACCTA GAGGATTTAA TATACGCAAA AAGGATGGGC TTGGCCACTG TGCTCGTCTT GACGGGGGCC ACCACGCCTG AGAAGTTAAA GGCGTCCGGC TTCCAGCCCG ATTACGTCGT CAACAACATA GACGAGTTAA ACCCGTGGTG A
|
Protein sequence | MKITTYFVDV QGTLVRRNPK TLKSQLIGGV KAFEKIRGAG GRIYILSNAP RLTEEVHKDL LSVGLPVDIE QIITSAQVTG EYIAKKFGPS RLYVIGSDSF KQELSKYGHT VVEEGADVVV VGIDRQLTFE KLNKAMQLIM AGAKLVAAGM SRYIPEEKPT ISIGPIAMAL AYATGVKPIN TGKPSRIMYT YALVRARAVP EESAVISDDL EDLIYAKRMG LATVLVLTGA TTPEKLKASG FQPDYVVNNI DELNPW
|
| |