Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2801 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 2561989 |
End bp | 2563809 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | Peptidase S53 propeptide |
Protein accession | ACX92883 |
Protein GI | 261603280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTA AGAATGTAAT ATTAAAAAGG GTAATGTTAC TTCTAGTGTT GATTTTAAGC ACTACAACTT TCCTAACAAT TATAGCGCAA AGTCAAGCAC AATACTATTA TATACAAACA TCTTCTCCAC AATACACAAT AATTCCCGGA TCAGTATTTG TAGAACCCCT CAACAGTAGT CAAACCTTAT ACATAGCAGT TCTCTTAAAT TTCACTAATT TAGCCTCTTT ACAATCATAC CTTAACGAAA TTTACCTCTC TGCCCCACAG TTCCATCACT GGTTGACTCC ATCACAGTTT AGAGAATATT ATTATCCTTC AAGGTCCTAT GTAAACTCAC TAATAAAGTA TCTGGAATCT TATAACTTAC AATTTTTAGG TAATTATGGT TTAATACTAG TATTTAGTGG AACTGTGGGG AATATAGAGA AAGCATTCAA CACTTACATT AACGTTTACT ACTATCCATT CAAGAACCTC TATTGGTTTG GTCTACTAGG AATTAAGAAC ATTGGTCCAT TTTACTACTA CTCAAATAAC GTTACTCCAT CATTACCATT TAATATTGGA AAATATGTAT TAGGAGTAGT TGGGATAGAT AGTCTAGATC CCAAGGTAGT TAACGTGGTT ACACAAACAT GGCATTTACC TATGGTTAAA GCCCAAAGCG GACTGGTTTC AAAAGCCATA ATTTCACCGA TAACAATAGA GCAATATTTT AACTTTACCT TAGCCTATGA GCGAGGTTAT ACTGGCGGAG GTAGTAATAT TGCGATTGAG GGAGTACCTG AGTCCTTTGT AAACGTATCA GACATCTATA GTTTTTGGCA ACTTTATGGT ATACCTAGAA CTGGTCATCT AAACGTTATA TATTTCGGGA ATGTTACAAC TGGAGGGCAA TCAGGAGAGA ATGAGCTTGA TGCGGAATGG TCTGGTGCCT TCGCACCAGC AGCTAACGTT ACAATAGTCT TCAGTAACGG TTACGTGGGC GGTCCCCAGC TAGTGGGCAA TTTACTAAAC TATTATTATG AGTATTATTA CATGGTTAAC TACTTAAATC CTAACGTCAT TTCAATTTCT GTAACCGTTC CAGAAAGTTT TCTAGCAGCA TACTATCCAG CAATGTTAGA CATGATTCAT AACATAATGT TGCAAGCTGC AGCGCAAGGA ATTTCTGTCT TAGCAGCCTC TGGAGACTGG GGATATGAGA GTGATCACCC GCCTCCTAAT TTCCATATCG GAACATATAA TACGATATGG TACCCTGAGT CTGATCCCTA CGTAACGTCA GTTGGCGGGA TATTTCTTAA TGCGTCGTCT AATGGTAGTA TTGTGGAAAT TAGTGGGTGG GATTATAGTA CTGGAGGTAA TAGTGTTGTT TATCCAGCAC AAATTTATGA AATAACTTCA CTGATTCCAT TTACTCCCGT TATTGTAAGG ACTTATCCAG ATATCGCATT CGTCTCAGCT GGGGGTTATA ATATTCCAGA ATTCGGTTTC GGTCTGCCTT TAGTATTTCA AGGTCAATTG TTCGTATGGT ATGGAACCAG TGGAGCTGCA CCAATGACTG CTGCAATGGT AGCCTTAGCT GGTACCAGAT TAGGTGCACT CAACTTCGCA TTGTATCACA TTTCGTATCA AGGTATAATA GAATCTCCAC TAGGCAATTT TGTCGGTAAG GTTGCCTGGA TACCAATAAC TAGTGGAAAT AATCCACTTC CAGCCCATTA TGGATGGAAC TATGTCACAG GTCCAGGAAC ATATAATGCG TACGCAATGG TTTACGATTT GTTGCTATAT TCTGGCTTAA TTGAAAGTTA A
|
Protein sequence | MESKNVILKR VMLLLVLILS TTTFLTIIAQ SQAQYYYIQT SSPQYTIIPG SVFVEPLNSS QTLYIAVLLN FTNLASLQSY LNEIYLSAPQ FHHWLTPSQF REYYYPSRSY VNSLIKYLES YNLQFLGNYG LILVFSGTVG NIEKAFNTYI NVYYYPFKNL YWFGLLGIKN IGPFYYYSNN VTPSLPFNIG KYVLGVVGID SLDPKVVNVV TQTWHLPMVK AQSGLVSKAI ISPITIEQYF NFTLAYERGY TGGGSNIAIE GVPESFVNVS DIYSFWQLYG IPRTGHLNVI YFGNVTTGGQ SGENELDAEW SGAFAPAANV TIVFSNGYVG GPQLVGNLLN YYYEYYYMVN YLNPNVISIS VTVPESFLAA YYPAMLDMIH NIMLQAAAQG ISVLAASGDW GYESDHPPPN FHIGTYNTIW YPESDPYVTS VGGIFLNASS NGSIVEISGW DYSTGGNSVV YPAQIYEITS LIPFTPVIVR TYPDIAFVSA GGYNIPEFGF GLPLVFQGQL FVWYGTSGAA PMTAAMVALA GTRLGALNFA LYHISYQGII ESPLGNFVGK VAWIPITSGN NPLPAHYGWN YVTGPGTYNA YAMVYDLLLY SGLIES
|
| |