Gene Ssol_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0167 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp142458 
End bp144074 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content32% 
IMG OID 
ProductProtein of unknown function DUF2070, membrane 
Protein accessionACX90463 
Protein GI261600860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGG ATACTGAAAA TTTGACAAGG AAATACTATG GTTATCTTAA AACTCTGCCT 
AGTATAAAAA TATTTGCAAC AACTTTTTCA GTAGAATCAT TGCTTATACT GTTAAGGAGT
TTCCAACTCA CTTTCGATTA TCTATTTTCG TTTGTGCTCT ATTCAATTCT CTTGATAATA
ATTTTTAAGA ATAAAATAAA AATAGCCTTG TTTATGATGA ATCTAACTGC GATACCTTAT
CTTCTACTTT CTCTGTTACC TATAACTCCA TTTTACGCGT TTGGATTTTT CATGCCTTTA
ATGGCGTATA TTCTCTTAGG TAGTTATAAA GAAATTCCTT CAATAGTGTT ATCTGGAATT
ACATCATATG TTCCTATAAT ATTTTACTTT AAATATTCGA TTATTTTTTT ACTATATATA
CTTATTATAG GGTTAATATT TCATTTCTAT ATATATACTG TTAATCGAAA GGGCGTAAAA
ATACTTGGGC TAAAATCAAC GCAAGTAGCT GTTCCCTTCA TAACTGCTAT AACAGAAAAG
AATAAGGTCC CTTTGGAAAA TTTTCTAAAT CTTATCTCTG TAAAAACTAC TCTGAACGTA
TTTATGTACA AATTGGACGA TTTTTTGTTT ATGATACCCC AAATACATTT CGGAGTTTTC
GATAACGTTG GTAGTTCTAG ATTTGTCTAC GATATTGAAA AAGCCTTACG AAATAACATA
GTAACAGTAT TCCATGGACC AGGTAGTCAC GAATTAGATT TACCCTCATC AGCAGAAGTA
AATAAAGTAA TAGAAGCTAT CTCAAAAAGC ACTATAGAAC GTAATGATTG GAATAAAGCG
ACTTTTTATG GCATTTCAAT CGAGAGACGC TCTACTTTTG ATATTACATC ATTAGAATTC
GATAAGTTTA GAGTATCGTT TATGGAGAGA CCAGAGTTTG GGATAGACGA TTTACCATCT
TCCCTGTGGA AATATATGTT GTCTTCAAAC AATTACCTTA TTGATTGCCA TAACTCTTTC
TTAGAAAGAG AATATGATAG TCATGAAATA AATAGTCTAA AGGACTTCAT CGCGGATCAA
AGAGGGATTA AAAGCACAAG GAGACTTATG GTAGGATATT CGGAGGGAAA GTTAGGTAAA
GCATGTGATG GATTATGCGA CAACCGTATA CGTGTCTTCA CATTTGATGA TGGAGTAAAG
AGAGTGTCCA TCGTTTACAT TTACGCTAAT AACTCAACTA AAGAACTTAA TTACGCTATA
TCTAATGCAG TAAGACAGAT TGTGGACAAG GTAATTTTAG TCACGCCAGA TGACCATTCT
TGTACTGGAG TAAGTTTAGG GATCACATAT TCTCCGGCTA CTTTTTGTGA AGATCTTGTG
AATATTGCAT CTGAACTAAT TAAAAGGTCT ACAGAGAATA TGAAGGAAAT AAATAGAGTA
GAATATAAAG TAGTTAAAAT AAAAGGTGTA AAGATACTCG GAAAAATAAT ATCTATTATG
CTTAAGGCAC TGGAAGACGT AGGAAATTAC ACTTCAAAAA CGTTCTGGAT CCCCTTGATA
ACACCGTATG TTTTACTTAT AGTTATACTA CTTTTCCAAA GCTTTATTAA ATTCTAA
 
Protein sequence
MDMDTENLTR KYYGYLKTLP SIKIFATTFS VESLLILLRS FQLTFDYLFS FVLYSILLII 
IFKNKIKIAL FMMNLTAIPY LLLSLLPITP FYAFGFFMPL MAYILLGSYK EIPSIVLSGI
TSYVPIIFYF KYSIIFLLYI LIIGLIFHFY IYTVNRKGVK ILGLKSTQVA VPFITAITEK
NKVPLENFLN LISVKTTLNV FMYKLDDFLF MIPQIHFGVF DNVGSSRFVY DIEKALRNNI
VTVFHGPGSH ELDLPSSAEV NKVIEAISKS TIERNDWNKA TFYGISIERR STFDITSLEF
DKFRVSFMER PEFGIDDLPS SLWKYMLSSN NYLIDCHNSF LEREYDSHEI NSLKDFIADQ
RGIKSTRRLM VGYSEGKLGK ACDGLCDNRI RVFTFDDGVK RVSIVYIYAN NSTKELNYAI
SNAVRQIVDK VILVTPDDHS CTGVSLGITY SPATFCEDLV NIASELIKRS TENMKEINRV
EYKVVKIKGV KILGKIISIM LKALEDVGNY TSKTFWIPLI TPYVLLIVIL LFQSFIKF