Gene Ssol_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2801 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2561989 
End bp2563809 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content38% 
IMG OID 
ProductPeptidase S53 propeptide 
Protein accessionACX92883 
Protein GI261603280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA AGAATGTAAT ATTAAAAAGG GTAATGTTAC TTCTAGTGTT GATTTTAAGC 
ACTACAACTT TCCTAACAAT TATAGCGCAA AGTCAAGCAC AATACTATTA TATACAAACA
TCTTCTCCAC AATACACAAT AATTCCCGGA TCAGTATTTG TAGAACCCCT CAACAGTAGT
CAAACCTTAT ACATAGCAGT TCTCTTAAAT TTCACTAATT TAGCCTCTTT ACAATCATAC
CTTAACGAAA TTTACCTCTC TGCCCCACAG TTCCATCACT GGTTGACTCC ATCACAGTTT
AGAGAATATT ATTATCCTTC AAGGTCCTAT GTAAACTCAC TAATAAAGTA TCTGGAATCT
TATAACTTAC AATTTTTAGG TAATTATGGT TTAATACTAG TATTTAGTGG AACTGTGGGG
AATATAGAGA AAGCATTCAA CACTTACATT AACGTTTACT ACTATCCATT CAAGAACCTC
TATTGGTTTG GTCTACTAGG AATTAAGAAC ATTGGTCCAT TTTACTACTA CTCAAATAAC
GTTACTCCAT CATTACCATT TAATATTGGA AAATATGTAT TAGGAGTAGT TGGGATAGAT
AGTCTAGATC CCAAGGTAGT TAACGTGGTT ACACAAACAT GGCATTTACC TATGGTTAAA
GCCCAAAGCG GACTGGTTTC AAAAGCCATA ATTTCACCGA TAACAATAGA GCAATATTTT
AACTTTACCT TAGCCTATGA GCGAGGTTAT ACTGGCGGAG GTAGTAATAT TGCGATTGAG
GGAGTACCTG AGTCCTTTGT AAACGTATCA GACATCTATA GTTTTTGGCA ACTTTATGGT
ATACCTAGAA CTGGTCATCT AAACGTTATA TATTTCGGGA ATGTTACAAC TGGAGGGCAA
TCAGGAGAGA ATGAGCTTGA TGCGGAATGG TCTGGTGCCT TCGCACCAGC AGCTAACGTT
ACAATAGTCT TCAGTAACGG TTACGTGGGC GGTCCCCAGC TAGTGGGCAA TTTACTAAAC
TATTATTATG AGTATTATTA CATGGTTAAC TACTTAAATC CTAACGTCAT TTCAATTTCT
GTAACCGTTC CAGAAAGTTT TCTAGCAGCA TACTATCCAG CAATGTTAGA CATGATTCAT
AACATAATGT TGCAAGCTGC AGCGCAAGGA ATTTCTGTCT TAGCAGCCTC TGGAGACTGG
GGATATGAGA GTGATCACCC GCCTCCTAAT TTCCATATCG GAACATATAA TACGATATGG
TACCCTGAGT CTGATCCCTA CGTAACGTCA GTTGGCGGGA TATTTCTTAA TGCGTCGTCT
AATGGTAGTA TTGTGGAAAT TAGTGGGTGG GATTATAGTA CTGGAGGTAA TAGTGTTGTT
TATCCAGCAC AAATTTATGA AATAACTTCA CTGATTCCAT TTACTCCCGT TATTGTAAGG
ACTTATCCAG ATATCGCATT CGTCTCAGCT GGGGGTTATA ATATTCCAGA ATTCGGTTTC
GGTCTGCCTT TAGTATTTCA AGGTCAATTG TTCGTATGGT ATGGAACCAG TGGAGCTGCA
CCAATGACTG CTGCAATGGT AGCCTTAGCT GGTACCAGAT TAGGTGCACT CAACTTCGCA
TTGTATCACA TTTCGTATCA AGGTATAATA GAATCTCCAC TAGGCAATTT TGTCGGTAAG
GTTGCCTGGA TACCAATAAC TAGTGGAAAT AATCCACTTC CAGCCCATTA TGGATGGAAC
TATGTCACAG GTCCAGGAAC ATATAATGCG TACGCAATGG TTTACGATTT GTTGCTATAT
TCTGGCTTAA TTGAAAGTTA A
 
Protein sequence
MESKNVILKR VMLLLVLILS TTTFLTIIAQ SQAQYYYIQT SSPQYTIIPG SVFVEPLNSS 
QTLYIAVLLN FTNLASLQSY LNEIYLSAPQ FHHWLTPSQF REYYYPSRSY VNSLIKYLES
YNLQFLGNYG LILVFSGTVG NIEKAFNTYI NVYYYPFKNL YWFGLLGIKN IGPFYYYSNN
VTPSLPFNIG KYVLGVVGID SLDPKVVNVV TQTWHLPMVK AQSGLVSKAI ISPITIEQYF
NFTLAYERGY TGGGSNIAIE GVPESFVNVS DIYSFWQLYG IPRTGHLNVI YFGNVTTGGQ
SGENELDAEW SGAFAPAANV TIVFSNGYVG GPQLVGNLLN YYYEYYYMVN YLNPNVISIS
VTVPESFLAA YYPAMLDMIH NIMLQAAAQG ISVLAASGDW GYESDHPPPN FHIGTYNTIW
YPESDPYVTS VGGIFLNASS NGSIVEISGW DYSTGGNSVV YPAQIYEITS LIPFTPVIVR
TYPDIAFVSA GGYNIPEFGF GLPLVFQGQL FVWYGTSGAA PMTAAMVALA GTRLGALNFA
LYHISYQGII ESPLGNFVGK VAWIPITSGN NPLPAHYGWN YVTGPGTYNA YAMVYDLLLY
SGLIES