Gene Ssol_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0597 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp543994 
End bp545118 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content42% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX90872 
Protein GI261601269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000997644 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTTAG CGTGCGAAAT CTACAACACC TTAAGGTGGG CAGACATATA CTTTTACCAA 
AGGGATGGGA AAGGACTAAC ACAAACTGAG TTAAGACAGT TGGCTCTAGA TCTGAGAAAA
CAAGATGATG AGTATAAGCA ACTCTACTCG CAAGTAGTTC AACAAATAGC TGACCGTTAT
TACGAAGCTA AGAAGAGGTT TTTCGAAGGT TTAGCACGTT TCCCGAAAGA AAAGAAACCT
CACAAATACT ACTCCCTAGT CTATCCCCAG TATGGTTGGA AAATACTTCA AGTTAGAGAA
ATAAGAAAAG GCAAGAAGAA ACTAATAACG CTTAAACTAT CAAATCTTGG TACGTTCAAG
GTAATAGTTC ACCGAGACTT TCCCCTTGAC AAAGTAAAGA GGGTAGTAGT GAAGCTAACA
AGATCTGAGA GGATTTACAT CACTTTCGTA GTTGATCACG AATTCCCCAA GTTACCTAAC
ACTGGTAAGG TAGTGGCGAT AGATGTTGGC ATAGAGAAGT TGATCGTAAC GTCGGATGGT
GAATATTTCC CCAATCTGAG ACCTTACGAG AAAGCGTTAT GGAAAGTGAA GCATCTACAC
AGAGAACTTT CAAGGAAGAA GTTTCTTTCA AATAACTGGT TTAAGGCTAA GGTTAAGCTT
GCTAGGGCTT ATGAGTATTT GAAGAATCTA AGAACGGATC TTTATATGAA GTTGGGTAAG
TGGTTTGCTG AGCATTATGA CGTTGTGGTG ATGGAGGACA TTCATGTTAA GCAGTTGATA
GGTAAGTCAT TAAGGTCTCT GAGGAGGAGA TTGAGTGACG TCGCGTTCAG CGAGCTTAGA
GATTTGATTA AGTATCAGTT GGAGAAATAC GGTAAGAAAC TCATCCTGGT CAACCCAGCA
TACACTTCCA AAACTTGTGC TAGGTGCGGG TACGTAAAAG AAGATCTGTC TCTATCTGAT
CGTGTTTTCG TTTGTTCCAA CTGTGGTTGG ATTGCAGATC GTGACTATAA TGCTTCTCTT
AACATTTTGA AGGGTGCGGG GTCGGAGCGA TCCTTAGTGC CTGTGGAACT CCGCCCTCTA
CCAGTACCAG CACTTCGGTA CTGGCATGGC AGAGCTGTGA AGTAG
 
Protein sequence
MKLACEIYNT LRWADIYFYQ RDGKGLTQTE LRQLALDLRK QDDEYKQLYS QVVQQIADRY 
YEAKKRFFEG LARFPKEKKP HKYYSLVYPQ YGWKILQVRE IRKGKKKLIT LKLSNLGTFK
VIVHRDFPLD KVKRVVVKLT RSERIYITFV VDHEFPKLPN TGKVVAIDVG IEKLIVTSDG
EYFPNLRPYE KALWKVKHLH RELSRKKFLS NNWFKAKVKL ARAYEYLKNL RTDLYMKLGK
WFAEHYDVVV MEDIHVKQLI GKSLRSLRRR LSDVAFSELR DLIKYQLEKY GKKLILVNPA
YTSKTCARCG YVKEDLSLSD RVFVCSNCGW IADRDYNASL NILKGAGSER SLVPVELRPL
PVPALRYWHG RAVK