Gene Pars_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2056 
Symbol 
ID5054847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1836604 
End bp1837782 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID640469605 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001154254 
Protein GI145592252 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.317337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.564481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGA TGTTGTATGT GTTGGCGCTG GCGTTGCTAG TCGTGTATAT ACAAGCGGCG 
AGGGTGGTTG TGGGCTACGC TGATGCGCCT CCCGACATGG CTCACATCAA CGCCACGGGC
GATGTGAAGG TTTTGAAACA TCTAAAAGAG ATAAAGGCGC TTGTGCTGAA CATCCCGGAT
CACAAGGTCG GCGAGTTGAA GAAGTTGAAG GGGATTAAAT ACGTCGAGGA GGACAAGATT
GCGAAGGCCT TCGGGTTCGG CGACTATGCA GACGTGCAGT GGAACGTGAA GATGATAAAT
GCGCACCTCG TGTGGGATCA GTACTTCGTT ACAATAGGCG ATGCCGCATT TGGCTACGGG
GTGACTGTGG CTGTGTTGGA TACCGGCATA GACTACAAAC ACCCACAGCT CTCAGGCAAG
GTGGTGTACT GCATATACAC AGTGGGTACT AGGCTCTACA AGGGGACAAA CCTAGGCAAC
TGTGCCGATA GAAATGGCCA CGGGACGCAC GTGGCTGGCA TAATCGCCGC CTCGCTTGAC
AACGTCGGCG TTGCCGGCGT GGCGCCGAAG GTCAAGCTTA TAGCTGTGAA GGTGCTCTCG
GACTCGGGCT CTGGATACTA CAGCGACATA GCGGAGGGAA TTATAGAGGC GGTAAAAGCC
GGTGCGACGA TCCTCTCTAT GTCGCTGGGC GGGCCCTCAG ACTCTTCTGT GCTTAGAGAT
GCCTCGTATT GGGCGTACCA GCAAGGCGCC GTCCAGGTAG TGGCCGCTGG CAACTCGGGC
GATGGTAACC CATCGACGGA CAACGTCTCG TACCCCGCTA GGTACAGCTG GGTAATTGCG
GTAGCCGCCG TTGACCAAAA CGGCGTCGTC CCCACGTGGA GTAGCGACGG CCCCGAGGTA
GACGTGGCCG CGCCCGGCGT AAACATCTTA TCCACGTATC CAGGCGGCAG ATATGCGTAT
ATGTCCGGCA CATCTATGGC CACGCCGCAT GTCACAGGCG TAGTGGCGCT GATTCAAGCC
ATCAGGTTGG CCTACGGCAA GAGGTTACTC ACGCCGGACG AGGTGTATCA AGTCCTCACC
ACGACTGCAA GGGACATAGG GACGCCGGGA TTTGACGTAT TCACCGGATA CGGCCTCGTG
GACGCCTACG CCGCGGTTGC CTCCGCCCTT GCTAAGTAG
 
Protein sequence
MSKMLYVLAL ALLVVYIQAA RVVVGYADAP PDMAHINATG DVKVLKHLKE IKALVLNIPD 
HKVGELKKLK GIKYVEEDKI AKAFGFGDYA DVQWNVKMIN AHLVWDQYFV TIGDAAFGYG
VTVAVLDTGI DYKHPQLSGK VVYCIYTVGT RLYKGTNLGN CADRNGHGTH VAGIIAASLD
NVGVAGVAPK VKLIAVKVLS DSGSGYYSDI AEGIIEAVKA GATILSMSLG GPSDSSVLRD
ASYWAYQQGA VQVVAAGNSG DGNPSTDNVS YPARYSWVIA VAAVDQNGVV PTWSSDGPEV
DVAAPGVNIL STYPGGRYAY MSGTSMATPH VTGVVALIQA IRLAYGKRLL TPDEVYQVLT
TTARDIGTPG FDVFTGYGLV DAYAAVASAL AK