Gene Pars_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1519 
Symbol 
ID5054307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1377741 
End bp1379153 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID640469059 
ProductATP-dependent protease La 
Protein accessionYP_001153725 
Protein GI145591723 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4930] Predicted ATP-dependent Lon-type protease 
TIGRFAM ID[TIGR02653] conserved hypothetical protein
[TIGR02688] conserved hypothetical protein TIGR02688 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC TGAATAACAA GGTGAAGAGG TGTTTCGGCG ACTACGCCGT GGATAAGAGA 
CTCGCCTACG AGCTTGAGCT GGCCAAGTTG CCTAGATACG TGGCGGAGTT CCTCATCTCG
GAGTTTATGA TTCAAGGCGG GGACTGGGAG GGCAAGCTGA GGAGCTTCAT TAGGGAGCGC
TACTACGAGC CTGAGGAGAA GGAGGTGGTT AAGCACAAGC TGGTGACGGA GGGGGTGGTG
GAGCTTATCG ACGAGCTTAG GGTATACGTA GATGTGGAGA CGGGGGCCCA CATAGGCGTC
ATACACTCTC TTGATATATG GGCTGAGGTG CCGGTGGACA TCGTCGAGAG GAACAGGGCA
ACGCTGACAA CCGGCATGTG GGGGTTGATA ACTCTGCAAC GGTGGGAGGG GGCCAAGGAG
GTTTTGGGGA GGCCCACGTC CGTCGTTATA ACCGACTTCA AGCCCTTCCA GGCGCCGGAT
ACAGATCCCA AAATCCTGGA GGAGGGGCGG AGGTGCTTCA CGCTGGAGGA GTGGGTAGAG
GTTTTGATAA ATACCATAGG TCTCGACCCC GCTGTGTACA GCCCCCGGCA GAGGCTCCTC
CTCCTCGCCC GACTAGTCCC CTTAGTGGAG GGGAATGTAA ATATGGCTGA GTTTGGGCCT
AGGCAGACTG GCAAGACGTA TCTCTACAGA AATGTGAGCA ACTATGTCAG GATAATCTCA
GGCGGCGTCA TATCCCCAGC CGCCTTGTTC TACAATTTGA GGACTAAGGT GCCGGGGGAG
CTGGCCCTCA AGGACGCGGT GGTTTTTGAC GAGGTGAGTA AGGTGAGGTT TCCCAACCCC
GACGAGATGA TGGGCAAGCT TAAGGACTAC ATGGAGAGCG GCCACTACGA GAGGGGGGAC
AAAAAGGTGG TGTCCGACGC CTCTCTGGTC TTCATGGGCA ACGTGTCGGT GGAGCACACG
TCGGAGGGCT ACGTGCCGGT GGAGGACTTG ACCTACGTCT TGCCGGAGCC TATGAGGGAT
TCGGCGTTTA TTGACAGGAT ACACGGTCTT CTGCCAGGTT GGGAGTTTCC TAAAATATCG
CAGAGCAAGT ACCACCTTTC TAAGAGCTAC GGCGTAGCAT CCGACTACTT CGCCGAGGCG
TTGCACGGCA TGAGGAAGGA GAGCTTGTCA GGACTTGTTG GGAGGCACGT GGAGCTTTCC
GAAAACTTCA AAATTAGGGA CGAGAAGAGT TTTAAGAGAA TTACCAGCGG TTTGTTAAAG
CTTCTATTTC CCGACAAGAC TTTTGACAAG AAGGAGCTTA AAACCATCGC GGAGTTCGCG
CTAGAGATGA GGCAGAGGGT CAGAGACTGG TTGCACAAAA TCGCACCGGG GGAATTCCCA
CGCGAAATCC TCAGCGTGGG AGTTCTGCCA TAA
 
Protein sequence
MSELNNKVKR CFGDYAVDKR LAYELELAKL PRYVAEFLIS EFMIQGGDWE GKLRSFIRER 
YYEPEEKEVV KHKLVTEGVV ELIDELRVYV DVETGAHIGV IHSLDIWAEV PVDIVERNRA
TLTTGMWGLI TLQRWEGAKE VLGRPTSVVI TDFKPFQAPD TDPKILEEGR RCFTLEEWVE
VLINTIGLDP AVYSPRQRLL LLARLVPLVE GNVNMAEFGP RQTGKTYLYR NVSNYVRIIS
GGVISPAALF YNLRTKVPGE LALKDAVVFD EVSKVRFPNP DEMMGKLKDY MESGHYERGD
KKVVSDASLV FMGNVSVEHT SEGYVPVEDL TYVLPEPMRD SAFIDRIHGL LPGWEFPKIS
QSKYHLSKSY GVASDYFAEA LHGMRKESLS GLVGRHVELS ENFKIRDEKS FKRITSGLLK
LLFPDKTFDK KELKTIAEFA LEMRQRVRDW LHKIAPGEFP REILSVGVLP