Gene Pars_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1643 
Symbol 
ID5054588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1482688 
End bp1483701 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID640469186 
Productpeptidase M48, Ste24p 
Protein accessionYP_001153848 
Protein GI145591846 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.691275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGG GGGAGGTACT GCTGGTATAT ATAGCGGTTC TGGGGGCGTC TATATACCTA 
GCACCGCGAG TCGTAGGGAC TAGGCGGTGG AAGCTCGGCT TCTACGGGCT AATGGCGCTG
GCCGTTGCCG GAATAGTCCT CACCGCATAC TACGTCTTGT CTCCCCTCCT CTTACCCATA
GTCCTGCTGT TCCAAGCCGC CACCGGCATT AAGGACTACG TACTGGCGTT CGTAGCCCTC
GTGGCCACCT CCGCGTTTAT TATGTACCTG GTAGCGCCGT TCCTCATAAA CGCCGCCTTC
TCCCCGCGCC CCGACCCCTA CCTACAAGGC GTCGTCGATG AGGTCGCCGC CAGGATAGGC
AGGCGGGTGA GGGCCAAGGC CGTGGTCGTG GACGGGCCGC CCAACGCCTT CGCCTACGGC
AACTTCCTCT CCGGCCGATA CGTGGCGGTT ACGACGGGGC TACTGAAAAT CGCAAACCAA
GACGAGCTGA GGGCCGTGAT AGGCCACGAG CTGGGCCACC ACGCCAACCG CGACAACGAG
GTCATGTCAG CCCTGGGGAT CCTCCCATCG CTGGCCTACT ACACCGGAGC CGCCGCCATA
GCCATCGGCC TAGCCAACAG GGAGAGGCCC GGGCTCCTGG CCGTTGCATA CGGCGTCGTT
ATGATAGTCG TGTCCTTCAT AATCCAGCTC CTGGTCATGG CCTTTAGCAG GCTCCGGGAG
TACTACGCCG ACATGCACGG CGCCCGCGCC GCGGGGAAAG AGGCGATGAT GTCAGCCCTC
GCCAAGATAC ACCAGTATTA TAAAAACGCC CCAGAAGAGC TACAAGCCGC GCCCAAGACC
TCCGGCTTCA AAGCCCTATT CATATACGCC CTCGTCGAGG CCGCCGCCAG CCCATTCGCA
GACCAGATCC GCCTCCTCAT GAACGAGCGC ACCTCCTGGC TCGAGGAGCT ACTATCCTCC
CATCCACCCA TACCCAAGAG GCTGAGATTC CTCGCCGCGT TGCCCGCCCT CTAA
 
Protein sequence
MGVGEVLLVY IAVLGASIYL APRVVGTRRW KLGFYGLMAL AVAGIVLTAY YVLSPLLLPI 
VLLFQAATGI KDYVLAFVAL VATSAFIMYL VAPFLINAAF SPRPDPYLQG VVDEVAARIG
RRVRAKAVVV DGPPNAFAYG NFLSGRYVAV TTGLLKIANQ DELRAVIGHE LGHHANRDNE
VMSALGILPS LAYYTGAAAI AIGLANRERP GLLAVAYGVV MIVVSFIIQL LVMAFSRLRE
YYADMHGARA AGKEAMMSAL AKIHQYYKNA PEELQAAPKT SGFKALFIYA LVEAAASPFA
DQIRLLMNER TSWLEELLSS HPPIPKRLRF LAALPAL