Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1960 |
Symbol | |
ID | 5054593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1758371 |
End bp | 1760044 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469507 |
Product | peptidase S16, lon domain-containing protein |
Protein accession | YP_001154159 |
Protein GI | 145592157 |
COG category | [R] General function prediction only |
COG ID | [COG1750] Archaeal serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAC TGGTATTAAT CGCCGTGCTG GCGGTGTTCC TAGTCGCAAC TACCTGGCAG TCCTACACTG TGAGGGTTTC GTCTGTGGAG ATCAACGCGC TGGCTGTGGG CCCTACCGGT GGCGCTGTCC TGCCAATAGA GGTCACCCTC ATAACCCCGG GAGACGGCAG GGCATATGTC GCAGGAGTGC CTGAGGCGGG CCAAGGGTTT GGCCCCTCGG CCCAGATTGC GCTTTACACG GCGTCTCGCC TAGCCGGCGT CCCGTACGCG AACTACACAG CCCTCCTTAG GGTGTTGTCC GCGGATGCCC AGGTTGGCGG GCCGTCGGCC AGCGGCTACA TAACAGTTGC GCTCTACGCG CTGATGAAGG GGCTGAACCT TAAAAACGAC ACTGCTATGA CGGGCATCAT ACTGCCAGAT GGCCTAATAG GGCCAGTGGG AGGCGTGTCG CAAAAGGTGG GCGCTGCGGC TGAGAAGGGC ATAAAAACTG TGTTGGTGCC GATAGGGGAG GCCCCGTCGG GGGCGCAAGG GGTTAGAGTT ATTGAGATAG GAACAGTTGA GGAGGCGATT TACTACCTAA CGGGAGAGAG GATAGACACT CCTCCGCCTA GTAGCGTAGA CGACGCCGTT TTTAAGAATA TATCAAAAAA GTTGTTCAAC GACATATACA GCTACTACAA TAGCACAATT GGCCGCGGCT ATGTCGACGT TGGCTTGATA AACAAGCTGA AGGCGCAGGG CTACTACTAC GCCGCCGCTT CTTTGATATA TCAAGGGATT GTGAACTACT ACCGCGACCA AGCGGCCTCT TCTAGGAGGA CATACCGCGA TCTATACGAC AAGGCGCTTC AGATGGCCAA GGCGGCTGAG TCGGAGCTCT CAAAAATCCC CGTTACGATC AACAACCTTG ACTTAGTCGT GGCAAGCTAC ACGAGGATAT ACGAGGTCTA CTTCATGGCA AACTCGTCGA ATCCCGATGC TGGCGCTATG TACGCCAGGG CAGTAACTCT GAGTTCTTGG GTAGGCGAAG CCAGGCAGAT GGCGTACGGG CCTGCCCTTA ACGACACCGG ACTTGCGGAA GTTGCTTTGC TGTATCTAGA CTACGCCAAG ACAATGGAGG CTTATCTTGA GACCACATAC GGCGTCGTTG CCACAAGCAA CCTCCCCTCC GTCGTAGAAG TTGCCCAAGA CCTCTACCGC AGGGGTCACT ACTTGGCATC AATGGCGAAT TCCATCGAGG TAATAGCCCA GTCGGCCTCA GCCCTTATGG CCGCCGCACC GGACAAGTAC CTCACCGTGG CGAGGGAGAG GGCGTTGACA AACATGGCAA GGGCAGCCGC TTGTGGCTAT ACCAATACGT TGCCGCTTAG CTATCTGCAG TTTGGAGACT ACTACCGCAC TCAGCAAGAC GGCGCATCGC TCGCTCTTGG GTATTACATA ATGGCCTCGA TATACTCCAC GGCGATGGGC GACGCGGCGT GCACAGCCGT AAAGAGCGGC GCCGTGATAC CGAGGCCTAG CCTATCTCCT GTCTACACCA CGCTGCCTCC CACCACTGAG CAGGCCACCA CTAGGACGGC GTCTAACATG GGTGGAGAGG AAAAAACGGT GATATTGCCA CTCGTGCTGG CGTTGCTGGC TGCAGTAGCA CTTGTCTACG CCTCTAGACG GTGA
|
Protein sequence | MRKLVLIAVL AVFLVATTWQ SYTVRVSSVE INALAVGPTG GAVLPIEVTL ITPGDGRAYV AGVPEAGQGF GPSAQIALYT ASRLAGVPYA NYTALLRVLS ADAQVGGPSA SGYITVALYA LMKGLNLKND TAMTGIILPD GLIGPVGGVS QKVGAAAEKG IKTVLVPIGE APSGAQGVRV IEIGTVEEAI YYLTGERIDT PPPSSVDDAV FKNISKKLFN DIYSYYNSTI GRGYVDVGLI NKLKAQGYYY AAASLIYQGI VNYYRDQAAS SRRTYRDLYD KALQMAKAAE SELSKIPVTI NNLDLVVASY TRIYEVYFMA NSSNPDAGAM YARAVTLSSW VGEARQMAYG PALNDTGLAE VALLYLDYAK TMEAYLETTY GVVATSNLPS VVEVAQDLYR RGHYLASMAN SIEVIAQSAS ALMAAAPDKY LTVARERALT NMARAAACGY TNTLPLSYLQ FGDYYRTQQD GASLALGYYI MASIYSTAMG DAACTAVKSG AVIPRPSLSP VYTTLPPTTE QATTRTASNM GGEEKTVILP LVLALLAAVA LVYASRR
|
| |