Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0862 |
Symbol | |
ID | 5054238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 765041 |
End bp | 766006 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640468422 |
Product | phosphoesterase, DHHA1 |
Protein accession | YP_001153099 |
Protein GI | 145591097 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0608] Single-stranded DNA-specific exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGGC GGTTGACAAT ACTGGCACAC GGCGACGCAG ACGGGGTCTG CTCGGCGGCA CTAGTGAAGG CGGCGCTAAG AGATCAGTAC AGCGAAATTC AGGTGGTATT CACCCACCCT GTGGATCTGG TCAAGGACTT CCAGCAATAC GCAAGGGGAG ACGTCTACAT AGTAGACGTG GCGATAGATG AGAAGGCGGC CCAGGAGGTG CAAAAACTAT TTAGAGCCTA CGGCGGCAGG GTCGTGTACA TCGACCACCA CCCCCTGTCA GCTGATTTGG CCGGAGCTGA GGTGATCCAC GAAGAGGGCC CCTCCGCCTC GGAGCTCACC TATAGGAAGC TAGGCGGCTT GTTGCCTCCT TCTTATAGCC GGGTTGCGCT GTACGGCGCC ATTAGCGACT ACATGGACTA CACAGAGTGG GTCAAGTCGG CGCTGGAAAA GTGGGACAAG CGTATTGTAT ACTTCGAGGC CGGGGTCCTA ATGCAGGGCC TAGAGCGGGC GCGCAAAGAC CACGACTTCA AGCGGGCGGT GGTGGACCAC CTAGCCGAGA ACAGGACGCC CTCCTCCATG GAGAGGCTCA TGAAGCTGGC GGAGGAGCAA GCGGGGATAA ACGAGGCCCT GGTGGGCTGG GTTGAGAGAT ACGTGGCTAA GAGAGGCGGC GTCGCCTTCG TCGTCAATCC ACCAGGCCCC CTCGGCCTCG CGGCCAACTT AGCGAGGGGT CTCACAGACT CGCCCGTGGG AATAGCGGCG GAGGAGCGGG GTGACATATA CGTGATGAGC CTCCGCTCTG TCCAAGTGGA CCTAAACCAG TTTCTAAGGG ATTTTGCCCG GCGCTATTCC GTATCTGGCG GCGGCCACAA AAACGCCGCG GGGGCGCGCA TCCCGAAGCG CCTTTTCGAC GTCTTCGTAG AGGAGCTCAG CTCGTATATA TCACGGCTGA GATGGGGCCC TGCTTTCTCT CAATAG
|
Protein sequence | MGRRLTILAH GDADGVCSAA LVKAALRDQY SEIQVVFTHP VDLVKDFQQY ARGDVYIVDV AIDEKAAQEV QKLFRAYGGR VVYIDHHPLS ADLAGAEVIH EEGPSASELT YRKLGGLLPP SYSRVALYGA ISDYMDYTEW VKSALEKWDK RIVYFEAGVL MQGLERARKD HDFKRAVVDH LAENRTPSSM ERLMKLAEEQ AGINEALVGW VERYVAKRGG VAFVVNPPGP LGLAANLARG LTDSPVGIAA EERGDIYVMS LRSVQVDLNQ FLRDFARRYS VSGGGHKNAA GARIPKRLFD VFVEELSSYI SRLRWGPAFS Q
|
| |