Gene Ssol_0013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0013 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp9461 
End bp10789 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content35% 
IMG OID 
ProductPeptidase A5, thermopsin 
Protein accessionACX90319 
Protein GI261600716 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAGG TTCTGCTCAT AATAATTCTA TTATTGCCAT TATCAATGCC CTTGAGTATA 
CCCACTACTT CACAACCTTC AGCTTTAGCT TTTCCCTCAG GAGTGACTAG TTATCCTTTG
AATACAATAA TCTATACAGA TTTCGTTATG GGTAGGATCA ATATTTCATA TTTAAATATA
GGTAGCTCGT ACTTACCAGG AGGAGAATAT TTCACTACTG GAAACGCATC GCTTCAGTTA
AACGCTATGG TATTAGGAGA GTATTGGGCA CAAAATGTGA TTCTATTTCA TCAAATATCA
AATAATACCT TTTATGCTAC ACTGATAGTA AATCTTTGGA ATCTTTCTGG CCCCTTTAGT
AATACAACAA GTAATTCGTT AGTATATCAA GGTCTAGGTG TAATTTGCTA TCAAGGTCCA
ACGTTCAAGG TAACCTTACC CCTTTCCATT AGCCTATTTA TGGAAATAGT TAATTCTACA
TTAAACTTTG GATATAATAT AAATGGGCAG AAGGGAATCT ATTTCAGATA CCCTATAATA
GGTTTATTCC AGTTAGGTGG TCTTTCACTA TTAGGGTTGC CAAATGATCT AGAGTTAGTT
TGGGGAGGAC CAGGTGGTGG AAGTGTGGTA TTTATGAATG TGAGTAGTAT AGCCAATTTG
TACTATTTCA ATGGGAATAC TTTAACTATT GTACCCAACG CTTACTCTAT AGGATTTGAT
ACTGCAGAAT CGGCTTACGG GGTAAAGGTA TACTCCACTT TTCCTAGTGT ATTTTCACCT
ATAGTGATTG AGACAAGTGG GGTTAACGTA CCTTCAGTAT TATGGCCAAT TCCTCCCCAC
GTTTTAGTTA ATCAGACTAG TAATAAAATA ACTGTGAAGT TGTCCATAAG TAATAAGTCC
TTATCAGGGC AAGCGGTCTA TTTGGAAACC GGATTTCCTC CTTCGGTCAT ATCTTCTGCA
GTGACAAATT CCTCTGGAAT TGCGGTATTT CCTAATAACA ATTATTCGTT TTATGTAGTT
TATTTTCCAG GCAATTTCAC TCTATCTTCG ACCTACTACT TCTCCTCACC AATCCTTAAT
TCACTTTCTA GTAAGTTTCG ATCTTATTAC CAAGATTTAT TGAATTTTCT AAACTCGGCC
CAGAATTCCT TTAAGAAAGG TATAAAGTCT GTACTATCTA AGCAAGAAAC TTCCATAACT
ACTACCACGT TAACTTCTAC TACTTCAAGT TCTTCCCAAT TTGGGGTTAA CTTGTATATC
GTACTTTATA TCTTAGCTTT TGTAATAGGT ATGGTAATTT CAGCAATATT AATAAGGTTC
AAATTATAG
 
Protein sequence
MYKVLLIIIL LLPLSMPLSI PTTSQPSALA FPSGVTSYPL NTIIYTDFVM GRINISYLNI 
GSSYLPGGEY FTTGNASLQL NAMVLGEYWA QNVILFHQIS NNTFYATLIV NLWNLSGPFS
NTTSNSLVYQ GLGVICYQGP TFKVTLPLSI SLFMEIVNST LNFGYNINGQ KGIYFRYPII
GLFQLGGLSL LGLPNDLELV WGGPGGGSVV FMNVSSIANL YYFNGNTLTI VPNAYSIGFD
TAESAYGVKV YSTFPSVFSP IVIETSGVNV PSVLWPIPPH VLVNQTSNKI TVKLSISNKS
LSGQAVYLET GFPPSVISSA VTNSSGIAVF PNNNYSFYVV YFPGNFTLSS TYYFSSPILN
SLSSKFRSYY QDLLNFLNSA QNSFKKGIKS VLSKQETSIT TTTLTSTTSS SSQFGVNLYI
VLYILAFVIG MVISAILIRF KL