Gene Pars_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1960 
Symbol 
ID5054593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1758371 
End bp1760044 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content56% 
IMG OID640469507 
Productpeptidase S16, lon domain-containing protein 
Protein accessionYP_001154159 
Protein GI145592157 
COG category[R] General function prediction only 
COG ID[COG1750] Archaeal serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC TGGTATTAAT CGCCGTGCTG GCGGTGTTCC TAGTCGCAAC TACCTGGCAG 
TCCTACACTG TGAGGGTTTC GTCTGTGGAG ATCAACGCGC TGGCTGTGGG CCCTACCGGT
GGCGCTGTCC TGCCAATAGA GGTCACCCTC ATAACCCCGG GAGACGGCAG GGCATATGTC
GCAGGAGTGC CTGAGGCGGG CCAAGGGTTT GGCCCCTCGG CCCAGATTGC GCTTTACACG
GCGTCTCGCC TAGCCGGCGT CCCGTACGCG AACTACACAG CCCTCCTTAG GGTGTTGTCC
GCGGATGCCC AGGTTGGCGG GCCGTCGGCC AGCGGCTACA TAACAGTTGC GCTCTACGCG
CTGATGAAGG GGCTGAACCT TAAAAACGAC ACTGCTATGA CGGGCATCAT ACTGCCAGAT
GGCCTAATAG GGCCAGTGGG AGGCGTGTCG CAAAAGGTGG GCGCTGCGGC TGAGAAGGGC
ATAAAAACTG TGTTGGTGCC GATAGGGGAG GCCCCGTCGG GGGCGCAAGG GGTTAGAGTT
ATTGAGATAG GAACAGTTGA GGAGGCGATT TACTACCTAA CGGGAGAGAG GATAGACACT
CCTCCGCCTA GTAGCGTAGA CGACGCCGTT TTTAAGAATA TATCAAAAAA GTTGTTCAAC
GACATATACA GCTACTACAA TAGCACAATT GGCCGCGGCT ATGTCGACGT TGGCTTGATA
AACAAGCTGA AGGCGCAGGG CTACTACTAC GCCGCCGCTT CTTTGATATA TCAAGGGATT
GTGAACTACT ACCGCGACCA AGCGGCCTCT TCTAGGAGGA CATACCGCGA TCTATACGAC
AAGGCGCTTC AGATGGCCAA GGCGGCTGAG TCGGAGCTCT CAAAAATCCC CGTTACGATC
AACAACCTTG ACTTAGTCGT GGCAAGCTAC ACGAGGATAT ACGAGGTCTA CTTCATGGCA
AACTCGTCGA ATCCCGATGC TGGCGCTATG TACGCCAGGG CAGTAACTCT GAGTTCTTGG
GTAGGCGAAG CCAGGCAGAT GGCGTACGGG CCTGCCCTTA ACGACACCGG ACTTGCGGAA
GTTGCTTTGC TGTATCTAGA CTACGCCAAG ACAATGGAGG CTTATCTTGA GACCACATAC
GGCGTCGTTG CCACAAGCAA CCTCCCCTCC GTCGTAGAAG TTGCCCAAGA CCTCTACCGC
AGGGGTCACT ACTTGGCATC AATGGCGAAT TCCATCGAGG TAATAGCCCA GTCGGCCTCA
GCCCTTATGG CCGCCGCACC GGACAAGTAC CTCACCGTGG CGAGGGAGAG GGCGTTGACA
AACATGGCAA GGGCAGCCGC TTGTGGCTAT ACCAATACGT TGCCGCTTAG CTATCTGCAG
TTTGGAGACT ACTACCGCAC TCAGCAAGAC GGCGCATCGC TCGCTCTTGG GTATTACATA
ATGGCCTCGA TATACTCCAC GGCGATGGGC GACGCGGCGT GCACAGCCGT AAAGAGCGGC
GCCGTGATAC CGAGGCCTAG CCTATCTCCT GTCTACACCA CGCTGCCTCC CACCACTGAG
CAGGCCACCA CTAGGACGGC GTCTAACATG GGTGGAGAGG AAAAAACGGT GATATTGCCA
CTCGTGCTGG CGTTGCTGGC TGCAGTAGCA CTTGTCTACG CCTCTAGACG GTGA
 
Protein sequence
MRKLVLIAVL AVFLVATTWQ SYTVRVSSVE INALAVGPTG GAVLPIEVTL ITPGDGRAYV 
AGVPEAGQGF GPSAQIALYT ASRLAGVPYA NYTALLRVLS ADAQVGGPSA SGYITVALYA
LMKGLNLKND TAMTGIILPD GLIGPVGGVS QKVGAAAEKG IKTVLVPIGE APSGAQGVRV
IEIGTVEEAI YYLTGERIDT PPPSSVDDAV FKNISKKLFN DIYSYYNSTI GRGYVDVGLI
NKLKAQGYYY AAASLIYQGI VNYYRDQAAS SRRTYRDLYD KALQMAKAAE SELSKIPVTI
NNLDLVVASY TRIYEVYFMA NSSNPDAGAM YARAVTLSSW VGEARQMAYG PALNDTGLAE
VALLYLDYAK TMEAYLETTY GVVATSNLPS VVEVAQDLYR RGHYLASMAN SIEVIAQSAS
ALMAAAPDKY LTVARERALT NMARAAACGY TNTLPLSYLQ FGDYYRTQQD GASLALGYYI
MASIYSTAMG DAACTAVKSG AVIPRPSLSP VYTTLPPTTE QATTRTASNM GGEEKTVILP
LVLALLAAVA LVYASRR