Gene Ssol_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2444 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2247106 
End bp2248374 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content43% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX92594 
Protein GI261602991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAAGA ACTTAAGAAT TAGAAAATTT GAACCGGAAG AGGAATACGT ATATTTCACG 
TACTCCATCA AGAATAGTGA GAGGGAGAAG AGCAAAGAGT TAATTAAAGA ATACAGAACA
CTACTACAGA AAGCAATTGA CTACCTGTGG AGCTTAACGA AAATACAAGT AAGAAAAAAG
AACGGTAATT ACAAGATAAC ACTACCGAAG AAGAGGGAGG TGTACAAACC ACTTAGGGAC
GAACTAGAGA AAATCAATCA CCTCGCGTCA CACTACGTCG ATAAGGCAAT TAATAACGCA
TTCTCAATCA TCACATCATG GAGGAAAAGG GCCATAAAGG GGAGAGCTTC GATTGAAAAA
CCTACGTTAA AGAAGGCTTA CGTTAAGGTT AAGTCTACAC TTAGGAAGGT TGTTGGGGAA
AGCGTTAGGA TAACTGTAAG ACCTCATGAG TACATCACCT TCTCGTGGAG TAAGTCATGG
TTCTCAAGAA GGGTTAGGGA GTTGGAACTT GGTGAACCTA TAATTAAGGA GGAGAAGGTT
TACCTACCAT TTCGTTACAA GTTACCTTGG GCAACACCAG TGAACTTCCT GGCTATTGAC
TCCAACCTTT ATACTCTAGA TGCTTATGAT GGTGAGAAAT TCGTTACAAT CTCTCTAAAG
CAGTTGTACT CCCTTAAGTA CTCCATGGAG GTGAAGAGGG CTAAGGTGCA ATCATTTGCA
TCTAAGCACA CGAAGAGGGG GAGAGAGTTG TTAAGGAAGT ATTCGCATAG GGAGAGGAAT
CGCGTTCTGG ACTTCGTTCA CAAGTTTGTT AACACTTTGT TGGACTTGTA CCCCGTGACG
TTTTTCGCTG TGGAAAAGCT TGATAAAGAG AGTATGTTTA AGGATGCTAA TGACTCTCTT
TCGAGGAAGA TTTCTAGGAC TGTTTGGGGG AGTATACATA AAGTGTTGGA GTATAAGGCT
CCGCTTTACG GTTCTTTCGT TAAGGAAGTG AACCCGTACC TCACCTCGAG GTCTTGCCCC
AGATGTGGGT TTGTATCCCG AAAGGTTGGT AAGACCTTTG AGTGTGAGAG GTGCGGGTTC
AAGTTGGATA GGCAATTGAA TGCTTCACTG AATATTTATC TCAAGATGTG CGGATTCCCT
CACATCCGTG ACGTTCCACG GGTGTGGGTT GGGGTTATTC CGCTAATGGG GCGGAGAGGG
ATGAACGTCC GTGACTTTGG TGAAGCCCAA GGGCTGAGGA TTGATATTAA ATATCATGAA
ATCCTATGA
 
Protein sequence
MLKNLRIRKF EPEEEYVYFT YSIKNSEREK SKELIKEYRT LLQKAIDYLW SLTKIQVRKK 
NGNYKITLPK KREVYKPLRD ELEKINHLAS HYVDKAINNA FSIITSWRKR AIKGRASIEK
PTLKKAYVKV KSTLRKVVGE SVRITVRPHE YITFSWSKSW FSRRVRELEL GEPIIKEEKV
YLPFRYKLPW ATPVNFLAID SNLYTLDAYD GEKFVTISLK QLYSLKYSME VKRAKVQSFA
SKHTKRGREL LRKYSHRERN RVLDFVHKFV NTLLDLYPVT FFAVEKLDKE SMFKDANDSL
SRKISRTVWG SIHKVLEYKA PLYGSFVKEV NPYLTSRSCP RCGFVSRKVG KTFECERCGF
KLDRQLNASL NIYLKMCGFP HIRDVPRVWV GVIPLMGRRG MNVRDFGEAQ GLRIDIKYHE
IL