Gene Ssol_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0017 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp13412 
End bp16450 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content32% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX90323 
Protein GI261600720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.164377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTTGTT CTAAGAGAAT AAGCGAAACA GCATCTGACT CCTCTTATAT CTATCAAATT 
TTCTCTTGCA TTACAGAGAA CTCATTATAC ATAAATAGAT TAAGATTTTT CAAGCAAGTT
AAGGATGAGA TAGACAGGAT ATCTAAGATT AAACAATCCC CAGAAATAAT ATCGCTTGTC
GGCGATTGGG GACAAGGAAA AAGTACGTTC CTTGACATCA TTGAGGAATA TGCAAAAAAT
AGAAATATTA ATGTAATAAA GATACCCTTT GTAGAACTTT TAAGCCGAAC CGAAGATTTG
TTAACGTTCA AAAATAATGT GTACCTTATT GATGAAGTGG AAAGTTCTGT AGATTACTTC
GCAGAGTATC AAAACGAGAT AAAGGACTTT TGGAGCAAGG TAAAGGAATT AGCTAATTCA
ACTGGAAATT CAATAATATA CCTTTCCATG ACCCCAAGCG CATATTCTAA AATATTTGGA
ACTGGAGGCA TAATTTATAA CTTATTTTCA GAAACTTACC CTTCACTTCT AGAAAGAATA
AGAAAAGTTA GTATAGAAAA TCCATCTAAA TTAGAGTTTC TATTAATGTT AAAGTGTATG
CTAAATATGG CTAACATAAA TGACTTAAAG ATACTACAGT ACATGGATTT GCCTTACTGG
GTAATAGATC AAGAAAGAAG AAAATACGTA AAGTTCTTCA ACGATATAGT ATGCGATAAT
CTTCCTAACG TAGATAGAAT TTTCAACGAA CTGGCTAGAT CGGATAAGGG AATTAATCTA
AACTCAGAAG GAGAGACCGT TAGACTAGAT ATGTTAACAA AATTAGAGAA CGAAATGGAC
TCCCAAGAGT TAAGTAAATT ATACAAAGTC TTAATGAGCA GAATATTTAC TGACGAAAAA
CTAGTCATAA AGAAATTAGA AGGTCATGTA ATAAAAGGGG TACTGATACC ATATCTAAAA
TGGATTGAGA TGTTCCCTAA AGGTCAAGAA TACGTTGAGG ACTTTCTTTT AACATATTAT
CAAGACGATT TTCATGTATT TATATCCGAC AATATAGAAA CTATACTCCA TGAAAGCATA
GATATAAGTA AAATAAAGGA GAATGTAAAA AAACTATCAC TATTCGGTAA AACTGATGCA
TATGCAATAT CCTGGAGCTT TTTTGAGAGC ATAGCGAATA CTAATATAGG AGGATTAATA
GTAGAGTTTA AGTCAAGAGA AATTAGAGAT AAAGCCTTAC AGTTCGTGAA TACGTATATT
ACAGATAGAG AGAAGGAACT TGAATCATTA GAATATTTAA TGGAAGTACT AGGAATAAAG
GTAGATAGTG TAAATAGAAG CAAAGACTAC ATAAGGTTTT TAAAACTTAT AATGGATAAT
AAGAAAATTA CTATTATTTT GGCAAATCCA GCTAACGAGG ATGAAATTAA GAACTTAATA
AAAGAGATCA ACGAAAGTGA CGAGCTAATT CACGGTCTTA TTCTAATAGA GCCTCAAATA
AGGAAGGAAG AACTATCCAA AACTCTAGAC GGACTATCAA TACCGTTGAT CGAACTAAAA
ATGACCACAC CAAAGAAAAG ACAGTTGCTA TATCTACTAT TCTCTAAAAT CTACGGACAA
AGTAGAATTA GGCTTGATTC CATAGAACTA AGACTAGGAG ATCTGAAAAA TTCGATATCC
TCCCTACTGC TAAAAATAAG AGATAATCTT AACTTGAACC AACTTCCCAT ACCTAGAAAT
AAGAGATTAA TACAATCATT CAATTGGATA ATATTTTACC CATCAATAAA AATGGTAAAT
GCTAATGAGT TATTCGAGAA AGTAAATGAG ATAATTAATG AGAAATTTAG AATGTATGGA
AGTAAGCAGT TCCATTTAGA AGATATAGAA ACATCCAACA CGTTTGTTGA TGACATTATT
ACTTACTTCT ATGGTAATCG TATAATAAAG ATTAGATCCA ATTACATAGA TTTCGAAGAC
CTAGCGGGAG AATCGTTATC CTCATTCGCT AAGCTGTTTG CGGGTCTTAT TAGGCAAAAA
TACAAGCAAG AAGCAGAGGA GGTTGTATTT AATTATATAA TGTATTATGT AAGTCCACAA
GATAACAAAA GAAAAGATAA TAAAAATAAT CCCTTAGTCT TTGCGTATCA AATTTTTAGC
CCCGATAAAA AAATTGGACA AAATCCCACC TTAGACTTCT TAGTTTACTC GTCAATAGTT
AGTGGCGAAA TTGCTAAATA TTTAAATAAA GATGTAATTT ATCTGAAAAT AGACGAGCAA
ATAAGAAAAA TAAAGGAAAA ACTAGATAAT CCATACTCGA CCTATGGGTA CTTCATAACA
GCTAAAAAGA GGGGAGCTGC AATAAGAAGC CTTGAGGAAA TGAGAGAGGT CATAGAAGCT
TATGAGAAAT CTTGCACTGA AAATAAAGAC ATAAGGCTAT GTTATGATTA TCTATATTTA
TCGAATATAT ATTTGGAATT ATTAAGAAAG ACCGAAGAAT CAGTGATTGA GACAGACAAG
ATCGTTGAGG AAATCTACAA AAAATTGGAG GTTGTAGAGA AAGCCAAGAG GCATGTAAAG
ATAAACGAGA AAATTGAGGA AATAGAGAAG GTATATGAGA TAATTCATGA ACTTAAAGAC
AACTTCAAAA TGCAAATGGA CAAGTTAGTG AGGAAAATTC AAGAGATAAA TGAGAGGGGA
CAAACAGAGT CGTTTAAACG ATATCTCGAT TACTTACTTG CTACTATACA CGTTGAAGAT
AACTCAAACC TTTATTTCAT ATTATTCAAA TTGCTGAAAG AGATACTAAA TGGCGTATCA
ATATCTGGAG ATGAGTTAAA AGATACGATA ATTGAGGAAA TCGCATCATT AGGTAAAGTC
GGTATCCAAT TAAATAATAT TGAGATGATT GTAAATGATC TAGAAAAGAT AAGTCCAGAA
TTACCTAAGT TGAGAGAAAA CGTTGAGAGA AATACACAAA AAATAACACA ATTAATTCAA
GAAATAAAGG AGGTTCTAGA AGAGTATGGA TTTAGCTGA
 
Protein sequence
MSCSKRISET ASDSSYIYQI FSCITENSLY INRLRFFKQV KDEIDRISKI KQSPEIISLV 
GDWGQGKSTF LDIIEEYAKN RNINVIKIPF VELLSRTEDL LTFKNNVYLI DEVESSVDYF
AEYQNEIKDF WSKVKELANS TGNSIIYLSM TPSAYSKIFG TGGIIYNLFS ETYPSLLERI
RKVSIENPSK LEFLLMLKCM LNMANINDLK ILQYMDLPYW VIDQERRKYV KFFNDIVCDN
LPNVDRIFNE LARSDKGINL NSEGETVRLD MLTKLENEMD SQELSKLYKV LMSRIFTDEK
LVIKKLEGHV IKGVLIPYLK WIEMFPKGQE YVEDFLLTYY QDDFHVFISD NIETILHESI
DISKIKENVK KLSLFGKTDA YAISWSFFES IANTNIGGLI VEFKSREIRD KALQFVNTYI
TDREKELESL EYLMEVLGIK VDSVNRSKDY IRFLKLIMDN KKITIILANP ANEDEIKNLI
KEINESDELI HGLILIEPQI RKEELSKTLD GLSIPLIELK MTTPKKRQLL YLLFSKIYGQ
SRIRLDSIEL RLGDLKNSIS SLLLKIRDNL NLNQLPIPRN KRLIQSFNWI IFYPSIKMVN
ANELFEKVNE IINEKFRMYG SKQFHLEDIE TSNTFVDDII TYFYGNRIIK IRSNYIDFED
LAGESLSSFA KLFAGLIRQK YKQEAEEVVF NYIMYYVSPQ DNKRKDNKNN PLVFAYQIFS
PDKKIGQNPT LDFLVYSSIV SGEIAKYLNK DVIYLKIDEQ IRKIKEKLDN PYSTYGYFIT
AKKRGAAIRS LEEMREVIEA YEKSCTENKD IRLCYDYLYL SNIYLELLRK TEESVIETDK
IVEEIYKKLE VVEKAKRHVK INEKIEEIEK VYEIIHELKD NFKMQMDKLV RKIQEINERG
QTESFKRYLD YLLATIHVED NSNLYFILFK LLKEILNGVS ISGDELKDTI IEEIASLGKV
GIQLNNIEMI VNDLEKISPE LPKLRENVER NTQKITQLIQ EIKEVLEEYG FS