Gene Ssol_0663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0663 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp612431 
End bp613831 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content40% 
IMG OID 
Productselenium-binding protein 
Protein accessionACX90933 
Protein GI261601330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.151689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAC CGTCAGTTTT TGCTCCCTTT AAGAGAGATC CAACGTTTTA TCCATCTCCA 
AAGATGGCCA TGAAATCACC TCCAGAGGAC TTAGCTTATG TTGCTTGCTT ATACACTGGA
ACTGGAATAA ATAGGCCAGA TTTCATAGCA GTAGTTGATG TAAAGCCTGA ATCGGAAACT
TATTCTAAGG TAATCCATAA GGTTGAATTA TCATATGTCA ATGATGAGCT GCACCATTTT
GGTTGGAATG CTTGTAGTTC CGCCTTATGC CCTAATGGAA GACCAAATTT TGAGAGAAGA
TTCTTAGTTG TACCCGGTTT ACGTTCCTCA AGGATTTATA TAGTAGATAC AAAATTAAAC
CCTAGACAGC CTAATATAGT TAAAACTATA GAACCAGAGG AGGTTAAGAA AGTAACGGGC
TACAGTAGGC TACATACAGT ACATTGTGGG CCAGATGGTA TCTACATAAG TGCTTTTGGC
AATGAAAACG GTGAGGGTCC AGGAGGAATT TTAATGTTAG ACCATTACAG TTTTGAACCT
TTAGGCAAGT GGGAGATAGA TAGGAGTGAC CAATATTTGG CTTACGATTT CTGGTGGAAT
TTACCAAATG AAGTAATGGT AACTAGTGAG TGGGCAGTGC CAAACACTAT TGAGAACGGG
CTTCGATTGG AACATCTTAA AGATAGATAT GGAAATAGGA TACACTTCTG GGACTTGAGG
AGAAGAAAGA AGGTATCAAG CGTAACCCTT GGTGAAGAGA ATAGGATGGC GTTAGAGCTT
AGACCCCTAC ATGACCCAAC TAAACTCATG GGATTCATAA ATATGGTAGT AAGCCTAAAG
GATCTGAGCA GTTCAATCTG GTTATGGTAC TACGAAGATG GTAAATGGAA TGGGGAAAAG
GTTATTGAAA TCCCTGCGGA ACCTACTGAG GGAGGACTGC CTGAGATATT GAAACCATTT
AAGGCTGTAC CACCATTAGT TACTGATATA GACTTAAGCC TTGATGATAA GTTCCTTTAC
GTTAGCTTAT GGGGTATAGG AGAGATTAGG CAGTACGACG TGAGTAATCC ATTTAAACCA
GTACTTACTG GAAAGGTAAA ATTGGGAGGT ATATTTCATA GGGCTGACCA TCCCTCAGAT
CATAAACTTA CTGGAGCTCC TCAGATGATT GAAATCAGTA GGGACGGAAA AAGAGTTTAC
GTTACCAATT CCCTATATAG TACTTGGGAT AATCAATTCT ATCCAGAGGG CTTAAAGGGA
TGGATGGTTA AACTAAATGC TAATCCAGAT GGAGGTCTAG ATGTGGATAA GGAGTTCTTC
GTGGATTTTG GAGAGGCTAG GTCGCATCAA GTTAGGTTAA GGGGAGGAGA TGCTTCCTCT
GACTCTTATT GCTATCCTTA G
 
Protein sequence
MELPSVFAPF KRDPTFYPSP KMAMKSPPED LAYVACLYTG TGINRPDFIA VVDVKPESET 
YSKVIHKVEL SYVNDELHHF GWNACSSALC PNGRPNFERR FLVVPGLRSS RIYIVDTKLN
PRQPNIVKTI EPEEVKKVTG YSRLHTVHCG PDGIYISAFG NENGEGPGGI LMLDHYSFEP
LGKWEIDRSD QYLAYDFWWN LPNEVMVTSE WAVPNTIENG LRLEHLKDRY GNRIHFWDLR
RRKKVSSVTL GEENRMALEL RPLHDPTKLM GFINMVVSLK DLSSSIWLWY YEDGKWNGEK
VIEIPAEPTE GGLPEILKPF KAVPPLVTDI DLSLDDKFLY VSLWGIGEIR QYDVSNPFKP
VLTGKVKLGG IFHRADHPSD HKLTGAPQMI EISRDGKRVY VTNSLYSTWD NQFYPEGLKG
WMVKLNANPD GGLDVDKEFF VDFGEARSHQ VRLRGGDASS DSYCYP