Gene Ssol_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2047 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1836811 
End bp1838079 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content44% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX92253 
Protein GI261602650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0187129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAAGA ACTTAAGAAT TAGAAAATTT GAACCGGAAG AGGAATACGT GCACTTCACG 
TACTCTATCA AGAATAGTGA GAGGGAGAAG AGCAAAGAGT TAATTAAAGA ATACAGAACA
CTACTACAGA AAGCAATTGA CTACCTGTGG AATTTAACGA AAATACAAGT AAGAAAAAAG
AACGGTAATT ACAAGATAAC ACTACCGAAG AAGAAGGAAG TGTACAAACC ACTTAGGGAA
GAGTTGGAGA AGATCAACCA CCTCGCGTCA CACTACGTCG ATAAGGCAAT TAATGACGCA
TTCTCGATCT TGAAGTCGTG GAGGAAAAGG GCCATAAAGG GGAGAGCTTC GATTGAAAAA
CCAAGGGTGA AGAAGGCTTA CGTTAGGATA AAGACGACTC TGAGGAAGGT TGTGGGGGAA
AGCGTTAGAA TAACTGTAAG ACCTCACGAG TACATCACCT TCCCGTGGAG TAAGTCATGG
TTCTCAAGAA GGGTTAGGGA GTTGGAACTT GGCGAACCTA TAATTAAGGA GGAGAAAGTG
TATTTGCCAT TTCGTTACAA GTTACCGTGG GTAACACCAG TGAACTTTCT AGCTATTGAC
TCCAACCTTT ATACTCTAGA TGCTTATGAT GGTGAGAAAT TCGTTACAAT CTCTCTGAAG
CAGTTGTACT CCCTTAAGTA CTCTATGGAG GTGAAGAGGG CTAAGGTGCA ATCATTTGCA
TCAAAGCACA CGAAGAGGGG GAGAGAGTTG TTAAGGAAGT ATTCGCATAG GGAGAGGAAT
CGCGTTCTGG ACTTCGTTCA CAAGTTTGTA AACACTTTGT TGGACTTGTA CCCCATGACG
TTTTTCGCTG TGGAAAAGCT TAACAAAGAG AGTATGTTTA AGGATGCTAA TGGCTCTCTT
TCGAGGAAGA TTTCTAGGAC TGTTTGGAGG AGTATACATA GAGTGTTGAA GTACAAGGCT
CCGCTTTACG GTTCTTTCGT TAAGGAAGTG AACCCACACC TCACCTCGAG GTCTTGCCCC
AGATGTGGGT TTGTATCCCG AAAGGTTGGT AAGACCTTTG AGTGTGAGAG GTGTGGGTTC
AAGTTGGATA GGCAACTGAA CGCGTCACTG AATATTTATC TCAAGATGTG CGGTTTTCCT
CACATCCGTG AAATAGCGCG GGTGTGGGTT GGGGTTATCC CGCTAATGGG GCGGAGAGGG
ATGAACGTCC GCGACTTCGG TGAAGCCCAA GGGCTGAGGA TTGATATTAA ATATCATGAA
ATCCCATGA
 
Protein sequence
MLKNLRIRKF EPEEEYVHFT YSIKNSEREK SKELIKEYRT LLQKAIDYLW NLTKIQVRKK 
NGNYKITLPK KKEVYKPLRE ELEKINHLAS HYVDKAINDA FSILKSWRKR AIKGRASIEK
PRVKKAYVRI KTTLRKVVGE SVRITVRPHE YITFPWSKSW FSRRVRELEL GEPIIKEEKV
YLPFRYKLPW VTPVNFLAID SNLYTLDAYD GEKFVTISLK QLYSLKYSME VKRAKVQSFA
SKHTKRGREL LRKYSHRERN RVLDFVHKFV NTLLDLYPMT FFAVEKLNKE SMFKDANGSL
SRKISRTVWR SIHRVLKYKA PLYGSFVKEV NPHLTSRSCP RCGFVSRKVG KTFECERCGF
KLDRQLNASL NIYLKMCGFP HIREIARVWV GVIPLMGRRG MNVRDFGEAQ GLRIDIKYHE
IP