Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1198 |
Symbol | |
ID | 5055983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1085469 |
End bp | 1086944 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640468746 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001153419 |
Protein GI | 145591417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.135202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.356317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTT TGCTCGCGGC GGCGGGGCTC GTGCTGGTAT ATGCGGGTAT TGTCTCTTTG GTAAGTCCAG GCGAGCCCGC CGCCTCTCCA CAACCTTGGG GATTCAGCAG AGGCATAGCC GTGAGGCCCC CGGAGGTGAG CTTGTCGGGT GGGCTGTACT TCAAGACCTA TCTCTGGGCC CGGGGCGGGG GCGGGGTCTT GTACCTACGT TGCTACGTGT ATAGTAGGTA CGAGGGGGGG ACGTGGTTGC CAACTCCGGA GGAGTACCCA GTCCTCGGCG TCTACTCCGT GACGGTGGGG AGAGGTCCTT TTACAGAGGG CGGAGCTTTG AACTTAACTC TGCCGCTTAT GGGCGGCTGC GTCCCGGTGG CTACACCTTC TGTAGACGGG CTTCAGCTGG GGTCGATAAA GGTGTCCGCT CCCGGCGCCA ACTTAGCCGC CAGCCGCACA GGCCTGTACG TAGCATACGC CGGGGGGAGA TTGGGCGAGG TGACCTCTTA CTACGGCCCA GGTGATTTGC CGCCTTCGCC GGATGACTTG TCGCTCCCTT CCGGGCAGGC CGGGGTTCTG CTGGCTCTTG CCCGAAACAT AACCGCCGGT TGTGGAGATG TGTCGTGTAA GGTTGAGAGG ATTAAAGAAT TTCTAAAGGG GTTTACCTAC GACGGGACAA TGGATGCGCC ATGGCCCCAT ATCCCCCCCG GCGTGGACCC CTTGATGTGG TTTCTCCAAA ACAAGAGGGG GGTCTGCGTC CACTTCGCCA CAGCCTTTGT AATGTTGGCG AGGGCTTCCG GGGTGTATGT CAGGCTCGTG GTTGGCTACA TGAGCGATGG CCCCGTGCCG ACGCAGTGGT CGCTGACGGC CTTCAGCCCC CACGCCTGGG CTGAGTACTA CCAGCCGGGC GTGGGCTGGA TCGGGGTAGA GGCGACTCCT CCCATGGGCG CCCCATCGCC GGCGGCGCCG CCGCCTCGCG AGACGCCTAC CCCAGCCGCG ACGACGCCGG AGGTACCGCC CGCCTCGCCG GGTCCCTACC AGTGGCCATC TATTGGCCTC GGCGTGTTTC TGCCTCTGGC TGGGATGGCC GCAGCGGCGC TGGTGGGTGG GGCCCTCTTC AAGAAGAGGG TGGTAATCAC TGTGGGGGAG GCGCTGAGGG TGGGCGCACC CAGGGGGTTT TGGGTGTATG TTAACAGAAG GCGAGTGGGA CGCGCGCCTG TGGAGATCGT CTTCGACAAG CCTGGCCTCT ACCTCGTAGC GGTGGGGCCC TTTGTGCGGG TGGTGAGGGT TGTGGACTAC AGGTCTATGG CTGGGAGGGC TTTTGAGAAG CTCCTCAAGA AGCTTAAACT CCCCCCTTCA GCTACGCCGC GGGAGGTCGC TGAGAGGTTT CCGCAGTACC GCGAAGTGGC GCTTCTCGTC GAGAAGCTTA GATTCGGCCC ACGGGCCGAC AATGAAGACT ACAGGCGGCT AAGGGAGATG TTATGA
|
Protein sequence | MRILLAAAGL VLVYAGIVSL VSPGEPAASP QPWGFSRGIA VRPPEVSLSG GLYFKTYLWA RGGGGVLYLR CYVYSRYEGG TWLPTPEEYP VLGVYSVTVG RGPFTEGGAL NLTLPLMGGC VPVATPSVDG LQLGSIKVSA PGANLAASRT GLYVAYAGGR LGEVTSYYGP GDLPPSPDDL SLPSGQAGVL LALARNITAG CGDVSCKVER IKEFLKGFTY DGTMDAPWPH IPPGVDPLMW FLQNKRGVCV HFATAFVMLA RASGVYVRLV VGYMSDGPVP TQWSLTAFSP HAWAEYYQPG VGWIGVEATP PMGAPSPAAP PPRETPTPAA TTPEVPPASP GPYQWPSIGL GVFLPLAGMA AAALVGGALF KKRVVITVGE ALRVGAPRGF WVYVNRRRVG RAPVEIVFDK PGLYLVAVGP FVRVVRVVDY RSMAGRAFEK LLKKLKLPPS ATPREVAERF PQYREVALLV EKLRFGPRAD NEDYRRLREM L
|
| |