Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1255 |
Symbol | |
ID | 5055791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1134028 |
End bp | 1135524 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468798 |
Product | restriction endonuclease |
Protein accession | YP_001153471 |
Protein GI | 145591469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.185862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.543832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTCC TGGATATCTT AACGTCTCTC AGCTCTGAAG AGTTTGAAAA ATATGTAGCT GACTATGTGC TACCGGTCTT GGGCTTAAGA GTTCACAACG TTGTGGGTGG GCCGTACGAC AGAGGTTGTG ACATAATTGC TGAGGACACG CGGTTTGGGA GTAGGGTATG CGTCCAGGTT AAGAGGTACT CGCCGGAGAG GAAAGTGACG GAGAAAGATG TCAGAAACGT TTTGTTCGGC ATGGAGCAAC ACCGCTGTGA CCGCGGGCTC ATTGTCACCA CCTCTGATCT CAACGGACCT GCGCTGAGCT TAGCGAGGCA GTACCGGATA GACTACATAA ACGGCGCGAG GCTTGCCAGG ATGGTGGAGG AGCAGTTAAT TCCCCTGGTG ATGCCCAAGG CCGTGGTCGC AGCGGTGCAT CAAGAAGAGG GGAGTCATGA GGCCGTGGAG AGGGAGGTGA GGGATGACGG GGTGTTTATC CCGCTGGGAG TCACCAACGC CGTGGAGGTT GCAAGGGCCT ATCTGAAGTC CAAGGGGGCG CTACATCCTC AGCTCGGGGG CGTATCCGCA CTTTTGAAAA GGCTCTACGT GTTTAAAGCC AAGGCGAGCT ACAAGCTGGG GAGGAGGAGG TCGGAGGAGG CCGTAATTTC GGTGGACGCA GAGGGTGAGG TATACGAGGG CGTGCCGCCT CTGATCAATA CGGTTAACTT CTATGTGGAG TACGAGACGA GCAGGGAGGA CTACTACTCG GCCCGGGAAA TCGCCATCCG CTATATCACA AGTAGAATAG TCCCGGAGGG GGCGCAAGAT ATCAAGATCC AGCTCAAGAA CCACGCCCTT GCGTGGGTTG CGGCGCTGTA CGCCATACGC TTTAAGGTGG GGCTCGTTGA CGTGGTTGTA CACGTCGATA AGAAGGGAAG AGTTGTGAAA ATGGAGCGAG GCCGCCTTAC CGATGACTTA GTGAGGGGAG CGTACGGCGG CGAGGTGGTG AGAGGCGATG GTTACAAGGT GAGGCTAGAT CAGGGCAATT TCGTGGAGGA GCTAAAGCTC AACGAGTTTG GAGAGGTTGT GGCAAGGGCT CGGGCAGTCA AGGAGAGCTA CGCCGTGGAG GTGGCATCTA AGTTTTTCGG CATTGCCGGG GAGGACGTGA GGTATAAACG CGAAGGCGGC GCGGTAAAAG TAGACATTTT TCTAAACGGC CACCACCACC TCGCCAAAGT GGACGAAAAC GGCGAGGTGG TTGACTACGT GGTGGTGCCC GACGCAGAAA TTTACGAAGG CTTCGAAAAG GGGTATAACA TAAGGATGAG GGCGCTCATT GTGAAGACAG TGGAAGATGG CGAAGAAGTG GTGCGGGTTG TGACAAGCGA AGGCGTAGTT GACGAAAAAA GAGCGAAGAG GTCTCTTCTA AGGAAAATCG GGAGCAGTCT AGCGGGGCTT GTCAAGAAGT CGGAGGAGTA CTCAATAGAT ACGGCTGACC CCCTCAACTT AATCTAG
|
Protein sequence | MGVLDILTSL SSEEFEKYVA DYVLPVLGLR VHNVVGGPYD RGCDIIAEDT RFGSRVCVQV KRYSPERKVT EKDVRNVLFG MEQHRCDRGL IVTTSDLNGP ALSLARQYRI DYINGARLAR MVEEQLIPLV MPKAVVAAVH QEEGSHEAVE REVRDDGVFI PLGVTNAVEV ARAYLKSKGA LHPQLGGVSA LLKRLYVFKA KASYKLGRRR SEEAVISVDA EGEVYEGVPP LINTVNFYVE YETSREDYYS AREIAIRYIT SRIVPEGAQD IKIQLKNHAL AWVAALYAIR FKVGLVDVVV HVDKKGRVVK MERGRLTDDL VRGAYGGEVV RGDGYKVRLD QGNFVEELKL NEFGEVVARA RAVKESYAVE VASKFFGIAG EDVRYKREGG AVKVDIFLNG HHHLAKVDEN GEVVDYVVVP DAEIYEGFEK GYNIRMRALI VKTVEDGEEV VRVVTSEGVV DEKRAKRSLL RKIGSSLAGL VKKSEEYSID TADPLNLI
|
| |