Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2056 |
Symbol | |
ID | 5054847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1836604 |
End bp | 1837782 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469605 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001154254 |
Protein GI | 145592252 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.317337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.564481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGA TGTTGTATGT GTTGGCGCTG GCGTTGCTAG TCGTGTATAT ACAAGCGGCG AGGGTGGTTG TGGGCTACGC TGATGCGCCT CCCGACATGG CTCACATCAA CGCCACGGGC GATGTGAAGG TTTTGAAACA TCTAAAAGAG ATAAAGGCGC TTGTGCTGAA CATCCCGGAT CACAAGGTCG GCGAGTTGAA GAAGTTGAAG GGGATTAAAT ACGTCGAGGA GGACAAGATT GCGAAGGCCT TCGGGTTCGG CGACTATGCA GACGTGCAGT GGAACGTGAA GATGATAAAT GCGCACCTCG TGTGGGATCA GTACTTCGTT ACAATAGGCG ATGCCGCATT TGGCTACGGG GTGACTGTGG CTGTGTTGGA TACCGGCATA GACTACAAAC ACCCACAGCT CTCAGGCAAG GTGGTGTACT GCATATACAC AGTGGGTACT AGGCTCTACA AGGGGACAAA CCTAGGCAAC TGTGCCGATA GAAATGGCCA CGGGACGCAC GTGGCTGGCA TAATCGCCGC CTCGCTTGAC AACGTCGGCG TTGCCGGCGT GGCGCCGAAG GTCAAGCTTA TAGCTGTGAA GGTGCTCTCG GACTCGGGCT CTGGATACTA CAGCGACATA GCGGAGGGAA TTATAGAGGC GGTAAAAGCC GGTGCGACGA TCCTCTCTAT GTCGCTGGGC GGGCCCTCAG ACTCTTCTGT GCTTAGAGAT GCCTCGTATT GGGCGTACCA GCAAGGCGCC GTCCAGGTAG TGGCCGCTGG CAACTCGGGC GATGGTAACC CATCGACGGA CAACGTCTCG TACCCCGCTA GGTACAGCTG GGTAATTGCG GTAGCCGCCG TTGACCAAAA CGGCGTCGTC CCCACGTGGA GTAGCGACGG CCCCGAGGTA GACGTGGCCG CGCCCGGCGT AAACATCTTA TCCACGTATC CAGGCGGCAG ATATGCGTAT ATGTCCGGCA CATCTATGGC CACGCCGCAT GTCACAGGCG TAGTGGCGCT GATTCAAGCC ATCAGGTTGG CCTACGGCAA GAGGTTACTC ACGCCGGACG AGGTGTATCA AGTCCTCACC ACGACTGCAA GGGACATAGG GACGCCGGGA TTTGACGTAT TCACCGGATA CGGCCTCGTG GACGCCTACG CCGCGGTTGC CTCCGCCCTT GCTAAGTAG
|
Protein sequence | MSKMLYVLAL ALLVVYIQAA RVVVGYADAP PDMAHINATG DVKVLKHLKE IKALVLNIPD HKVGELKKLK GIKYVEEDKI AKAFGFGDYA DVQWNVKMIN AHLVWDQYFV TIGDAAFGYG VTVAVLDTGI DYKHPQLSGK VVYCIYTVGT RLYKGTNLGN CADRNGHGTH VAGIIAASLD NVGVAGVAPK VKLIAVKVLS DSGSGYYSDI AEGIIEAVKA GATILSMSLG GPSDSSVLRD ASYWAYQQGA VQVVAAGNSG DGNPSTDNVS YPARYSWVIA VAAVDQNGVV PTWSSDGPEV DVAAPGVNIL STYPGGRYAY MSGTSMATPH VTGVVALIQA IRLAYGKRLL TPDEVYQVLT TTARDIGTPG FDVFTGYGLV DAYAAVASAL AK
|
| |