Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1519 |
Symbol | |
ID | 5054307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1377741 |
End bp | 1379153 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469059 |
Product | ATP-dependent protease La |
Protein accession | YP_001153725 |
Protein GI | 145591723 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02653] conserved hypothetical protein [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC TGAATAACAA GGTGAAGAGG TGTTTCGGCG ACTACGCCGT GGATAAGAGA CTCGCCTACG AGCTTGAGCT GGCCAAGTTG CCTAGATACG TGGCGGAGTT CCTCATCTCG GAGTTTATGA TTCAAGGCGG GGACTGGGAG GGCAAGCTGA GGAGCTTCAT TAGGGAGCGC TACTACGAGC CTGAGGAGAA GGAGGTGGTT AAGCACAAGC TGGTGACGGA GGGGGTGGTG GAGCTTATCG ACGAGCTTAG GGTATACGTA GATGTGGAGA CGGGGGCCCA CATAGGCGTC ATACACTCTC TTGATATATG GGCTGAGGTG CCGGTGGACA TCGTCGAGAG GAACAGGGCA ACGCTGACAA CCGGCATGTG GGGGTTGATA ACTCTGCAAC GGTGGGAGGG GGCCAAGGAG GTTTTGGGGA GGCCCACGTC CGTCGTTATA ACCGACTTCA AGCCCTTCCA GGCGCCGGAT ACAGATCCCA AAATCCTGGA GGAGGGGCGG AGGTGCTTCA CGCTGGAGGA GTGGGTAGAG GTTTTGATAA ATACCATAGG TCTCGACCCC GCTGTGTACA GCCCCCGGCA GAGGCTCCTC CTCCTCGCCC GACTAGTCCC CTTAGTGGAG GGGAATGTAA ATATGGCTGA GTTTGGGCCT AGGCAGACTG GCAAGACGTA TCTCTACAGA AATGTGAGCA ACTATGTCAG GATAATCTCA GGCGGCGTCA TATCCCCAGC CGCCTTGTTC TACAATTTGA GGACTAAGGT GCCGGGGGAG CTGGCCCTCA AGGACGCGGT GGTTTTTGAC GAGGTGAGTA AGGTGAGGTT TCCCAACCCC GACGAGATGA TGGGCAAGCT TAAGGACTAC ATGGAGAGCG GCCACTACGA GAGGGGGGAC AAAAAGGTGG TGTCCGACGC CTCTCTGGTC TTCATGGGCA ACGTGTCGGT GGAGCACACG TCGGAGGGCT ACGTGCCGGT GGAGGACTTG ACCTACGTCT TGCCGGAGCC TATGAGGGAT TCGGCGTTTA TTGACAGGAT ACACGGTCTT CTGCCAGGTT GGGAGTTTCC TAAAATATCG CAGAGCAAGT ACCACCTTTC TAAGAGCTAC GGCGTAGCAT CCGACTACTT CGCCGAGGCG TTGCACGGCA TGAGGAAGGA GAGCTTGTCA GGACTTGTTG GGAGGCACGT GGAGCTTTCC GAAAACTTCA AAATTAGGGA CGAGAAGAGT TTTAAGAGAA TTACCAGCGG TTTGTTAAAG CTTCTATTTC CCGACAAGAC TTTTGACAAG AAGGAGCTTA AAACCATCGC GGAGTTCGCG CTAGAGATGA GGCAGAGGGT CAGAGACTGG TTGCACAAAA TCGCACCGGG GGAATTCCCA CGCGAAATCC TCAGCGTGGG AGTTCTGCCA TAA
|
Protein sequence | MSELNNKVKR CFGDYAVDKR LAYELELAKL PRYVAEFLIS EFMIQGGDWE GKLRSFIRER YYEPEEKEVV KHKLVTEGVV ELIDELRVYV DVETGAHIGV IHSLDIWAEV PVDIVERNRA TLTTGMWGLI TLQRWEGAKE VLGRPTSVVI TDFKPFQAPD TDPKILEEGR RCFTLEEWVE VLINTIGLDP AVYSPRQRLL LLARLVPLVE GNVNMAEFGP RQTGKTYLYR NVSNYVRIIS GGVISPAALF YNLRTKVPGE LALKDAVVFD EVSKVRFPNP DEMMGKLKDY MESGHYERGD KKVVSDASLV FMGNVSVEHT SEGYVPVEDL TYVLPEPMRD SAFIDRIHGL LPGWEFPKIS QSKYHLSKSY GVASDYFAEA LHGMRKESLS GLVGRHVELS ENFKIRDEKS FKRITSGLLK LLFPDKTFDK KELKTIAEFA LEMRQRVRDW LHKIAPGEFP REILSVGVLP
|
| |