Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1412 |
Symbol | |
ID | 4616977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1281223 |
End bp | 1282227 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639784497 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_930913 |
Protein GI | 119872906 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.256157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.274347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGG CGGGAGCTCG CCGCGGTGAA GACGCAGACG GAGAAAAACC GCGTCCCCGC CCTAAAGGGC GAACAGTTGT AAGTCACTGC GTTTTTGCCG ACAGATATGT TAAAAATAGT GGCGCACCTT TGTATATGGA TTTGAGTAAT TTAGTTGAAA AGGTGGCGCG TTCTGTCGTC GGCGTTGTGA CGAGGGGGTT TGGGGCCTTT GGCGAGGGCT TCGGCTCCGC CTTCGCCATA GACCGGGGGG TCTACGCCAC GGCATACCAC GTCGTGGCGC AGGCGGGGGA GGTGGCGTTG ATCACCCCCG AGGGGGAGGT GGCCGACGCC GTGGTGGCGG CGGCGGATCC CGCCGAGGAT CTAGCCATAC TCTACTCCGA CCTCTACGCC GTCCCGCTGG CCCTTGGGAG CGCGCTGAGG CTGAGGGTCG GGCAGGGGGT AGTCGCCGTG GGCTTCCCCC TAGCCCTCCT TGACAAGCCC ACTGCGACCT TCGGCATCGT AAGCGCCGTG GGGAGGAGCT TGAGGGCTGG CGATAGGTTT TTCGAGTACC TCGTCCAGAC AGACGCGGCG ATCAACCCCG GCAACTCGGG CGGCCCGCTC GTGAACCTCT CCGGAGAGGC GGTGGGGGTC TGCTCGGCCG TAATCGCCGG GGCCCAGGGC CTGGGCTTCG CGGTGCCTAT AGACCTAGTC AGAATCATGT ACCAGATGGT GAAGAGATAC GGGAGATACG TAAGGCCGGC GCTCGGGGTA TACGTCGTCG CGTTGAACAA AGCTCTGAAA GCCCTATACG GCCTCCCCAC AGACAGAGGG CTCCTCGTTG TCGACGTCAT GCCTAACTCG CCCGCCGAAG AGATGGGCAT CGCCCGAGGC GACATCTTAA CCAAGGTCGA CAGCCGCGAG GTGGCCAACG TCTTCGAACT CCGCCTGTTG ATAGGCGAAG CGCTGGTCCA GGGCAGAACC CCCAGGATAG AGGTCATCAG AGGCGGAAGG AGTATAGAGC TCTAA
|
Protein sequence | MALAGARRGE DADGEKPRPR PKGRTVVSHC VFADRYVKNS GAPLYMDLSN LVEKVARSVV GVVTRGFGAF GEGFGSAFAI DRGVYATAYH VVAQAGEVAL ITPEGEVADA VVAAADPAED LAILYSDLYA VPLALGSALR LRVGQGVVAV GFPLALLDKP TATFGIVSAV GRSLRAGDRF FEYLVQTDAA INPGNSGGPL VNLSGEAVGV CSAVIAGAQG LGFAVPIDLV RIMYQMVKRY GRYVRPALGV YVVALNKALK ALYGLPTDRG LLVVDVMPNS PAEEMGIARG DILTKVDSRE VANVFELRLL IGEALVQGRT PRIEVIRGGR SIEL
|
| |