Gene Pisl_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1412 
Symbol 
ID4616977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1281223 
End bp1282227 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content62% 
IMG OID639784497 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_930913 
Protein GI119872906 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.256157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.274347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGG CGGGAGCTCG CCGCGGTGAA GACGCAGACG GAGAAAAACC GCGTCCCCGC 
CCTAAAGGGC GAACAGTTGT AAGTCACTGC GTTTTTGCCG ACAGATATGT TAAAAATAGT
GGCGCACCTT TGTATATGGA TTTGAGTAAT TTAGTTGAAA AGGTGGCGCG TTCTGTCGTC
GGCGTTGTGA CGAGGGGGTT TGGGGCCTTT GGCGAGGGCT TCGGCTCCGC CTTCGCCATA
GACCGGGGGG TCTACGCCAC GGCATACCAC GTCGTGGCGC AGGCGGGGGA GGTGGCGTTG
ATCACCCCCG AGGGGGAGGT GGCCGACGCC GTGGTGGCGG CGGCGGATCC CGCCGAGGAT
CTAGCCATAC TCTACTCCGA CCTCTACGCC GTCCCGCTGG CCCTTGGGAG CGCGCTGAGG
CTGAGGGTCG GGCAGGGGGT AGTCGCCGTG GGCTTCCCCC TAGCCCTCCT TGACAAGCCC
ACTGCGACCT TCGGCATCGT AAGCGCCGTG GGGAGGAGCT TGAGGGCTGG CGATAGGTTT
TTCGAGTACC TCGTCCAGAC AGACGCGGCG ATCAACCCCG GCAACTCGGG CGGCCCGCTC
GTGAACCTCT CCGGAGAGGC GGTGGGGGTC TGCTCGGCCG TAATCGCCGG GGCCCAGGGC
CTGGGCTTCG CGGTGCCTAT AGACCTAGTC AGAATCATGT ACCAGATGGT GAAGAGATAC
GGGAGATACG TAAGGCCGGC GCTCGGGGTA TACGTCGTCG CGTTGAACAA AGCTCTGAAA
GCCCTATACG GCCTCCCCAC AGACAGAGGG CTCCTCGTTG TCGACGTCAT GCCTAACTCG
CCCGCCGAAG AGATGGGCAT CGCCCGAGGC GACATCTTAA CCAAGGTCGA CAGCCGCGAG
GTGGCCAACG TCTTCGAACT CCGCCTGTTG ATAGGCGAAG CGCTGGTCCA GGGCAGAACC
CCCAGGATAG AGGTCATCAG AGGCGGAAGG AGTATAGAGC TCTAA
 
Protein sequence
MALAGARRGE DADGEKPRPR PKGRTVVSHC VFADRYVKNS GAPLYMDLSN LVEKVARSVV 
GVVTRGFGAF GEGFGSAFAI DRGVYATAYH VVAQAGEVAL ITPEGEVADA VVAAADPAED
LAILYSDLYA VPLALGSALR LRVGQGVVAV GFPLALLDKP TATFGIVSAV GRSLRAGDRF
FEYLVQTDAA INPGNSGGPL VNLSGEAVGV CSAVIAGAQG LGFAVPIDLV RIMYQMVKRY
GRYVRPALGV YVVALNKALK ALYGLPTDRG LLVVDVMPNS PAEEMGIARG DILTKVDSRE
VANVFELRLL IGEALVQGRT PRIEVIRGGR SIEL