Gene Pars_0863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0863 
Symbol 
ID5054563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp765957 
End bp767615 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content60% 
IMG OID640468423 
Productpeptidase S49 
Protein accessionYP_001153100 
Protein GI145591098 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCAACA TGGTCGATAG GTGGGTTGTG GCTGGCCTGG CTCTAGCCTT GGCCGTCAGC 
GCCGCCGCGC TTTTTGTTGC ATATACACAC TGCGAGAAGG GACAGACCCC CGCGGCTCCT
CCGAAGCCTA AGATAGTCCT GGTGCCTATC GACTTCGTGC TAGACTCGTC CCCAGTTGAC
GGGGTTATAC GCGACCTTGT TAGCCTTGCT CAGCGCAAGG ACGTGGCTGG GGTGGTGCTG
GTGATAAACT CGCCTGGCGG GACGGTGTCG GCAACTGAGG CGCTCTATTC GGCGCTGGCC
GGGCTGAACA AAACCAAGTA CGCCGTGGTC AACGGCCTCG CCGCGTCCGG CGCCTACTAC
GTCGCGATGG CCGCCGAAAA GATATACGCA ACGCCGTCGA GCTGGGTGGG GAGCATCGGC
GTGGTGGCCG TGATGTGGCC CGACGAGTAT TTCTACGACA TCCCCGACTA CGTCTACACC
ACAGGCCCGC TGAAGTACTA CGGCAAGGAG CTCACCGACT ACTACAACGA CATAGAGAGA
GTGAGGATGA ACTTCGTGCA GGCCGTGCTC AGGGGCAGGG CTGGGAGGAT AAAGGCAGAT
CCCCAGGTGT TCGAGACTGC AGCTATCTTC ACGGCCGAGG ATGCGCTACG GCTGGGCCTT
GTAGACGCAG TGGGCGGCGT CTTCGACGCC GTGAGGGATA TGGCCCAGAG GCTTGGGCTT
AAGGAATACG AGGTAAAGTT CCTTCGCGAG CTCGGCAACG CCTCGGCTTC TCCAGCCGCC
GCGTTTAGGG TTGACCTGGA GAGGCTGATG AACTCCACAC CGGTCCCCAT CTTCTACATC
CTCCCCACAG CGGTGCAGTG GCGGGTGACT ACAAACGGGG CCTCCAACGC CACGGCGGCT
ACCCCTGCCA GGCCCGAGAA GCCCTACGTC CTGCTAGACC TGGCCCACAA CAACATGGTG
CCTAGGAGCT TTATAGAGGT CCTCAGGGCA GAGCTGGCGC AGAGGGGGTA CGTCTTAGTA
ACCGCCTCTA GCGAGTACCA GCTCACCACG CTCCTCTCAA ACGCCACCGC CCTCGTGGTG
GTGAACCCCA CAGCTCCTTT CAGCAAAGAC GCCGTTAGGG CCGTGCTGAA CGCCACGGCT
AGGGGGGTCA GAGTCGCCTA CTTCTACGAC CTCAGGGCAA GCGCCATAAT CACAGTCAGC
GGCGTCTCCT ACGTGGCCCC CTATTCGTCA TATACCATCT TTGACCCCCT GCCAATGTAC
TACAACATGT CGGGGCTGAG GGCCGTCTAC AACTTCACAG CTGGCGGGCT GAACTACACC
CAGAACTGGC AGTATGTCGA GGCCAGGCCC AGGGGCGACT GGCCGTTACT GAGAGGCGTG
GAGAGGCTCC TCCTCTTCAG CCCGTCGGCT GTGTCGACAA ACGCGCCGTA TAGGCTAGAG
GTCCGGGGCT ACGTCTTCGG CTACGGCTGG GGGAACTACA CAGTGGCGGC GCAGACCGGC
AACTTCACCT TTATAGGCGC CGTGCGCTCC TTCACACCAT ACTTCATAAC GCTGGCCGAC
AACTGGCACT TCTTTAAGAA CATTGTGGAC TGGCTTGTCG AACCAAGACC TATTGAGAGA
AAGCAGGGCC CCATCTCAGC CGTGATATAT ACGAGCTGA
 
Protein sequence
MCNMVDRWVV AGLALALAVS AAALFVAYTH CEKGQTPAAP PKPKIVLVPI DFVLDSSPVD 
GVIRDLVSLA QRKDVAGVVL VINSPGGTVS ATEALYSALA GLNKTKYAVV NGLAASGAYY
VAMAAEKIYA TPSSWVGSIG VVAVMWPDEY FYDIPDYVYT TGPLKYYGKE LTDYYNDIER
VRMNFVQAVL RGRAGRIKAD PQVFETAAIF TAEDALRLGL VDAVGGVFDA VRDMAQRLGL
KEYEVKFLRE LGNASASPAA AFRVDLERLM NSTPVPIFYI LPTAVQWRVT TNGASNATAA
TPARPEKPYV LLDLAHNNMV PRSFIEVLRA ELAQRGYVLV TASSEYQLTT LLSNATALVV
VNPTAPFSKD AVRAVLNATA RGVRVAYFYD LRASAIITVS GVSYVAPYSS YTIFDPLPMY
YNMSGLRAVY NFTAGGLNYT QNWQYVEARP RGDWPLLRGV ERLLLFSPSA VSTNAPYRLE
VRGYVFGYGW GNYTVAAQTG NFTFIGAVRS FTPYFITLAD NWHFFKNIVD WLVEPRPIER
KQGPISAVIY TS