Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0597 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 543994 |
End bp | 545118 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | ACX90872 |
Protein GI | 261601269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000997644 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGTTAG CGTGCGAAAT CTACAACACC TTAAGGTGGG CAGACATATA CTTTTACCAA AGGGATGGGA AAGGACTAAC ACAAACTGAG TTAAGACAGT TGGCTCTAGA TCTGAGAAAA CAAGATGATG AGTATAAGCA ACTCTACTCG CAAGTAGTTC AACAAATAGC TGACCGTTAT TACGAAGCTA AGAAGAGGTT TTTCGAAGGT TTAGCACGTT TCCCGAAAGA AAAGAAACCT CACAAATACT ACTCCCTAGT CTATCCCCAG TATGGTTGGA AAATACTTCA AGTTAGAGAA ATAAGAAAAG GCAAGAAGAA ACTAATAACG CTTAAACTAT CAAATCTTGG TACGTTCAAG GTAATAGTTC ACCGAGACTT TCCCCTTGAC AAAGTAAAGA GGGTAGTAGT GAAGCTAACA AGATCTGAGA GGATTTACAT CACTTTCGTA GTTGATCACG AATTCCCCAA GTTACCTAAC ACTGGTAAGG TAGTGGCGAT AGATGTTGGC ATAGAGAAGT TGATCGTAAC GTCGGATGGT GAATATTTCC CCAATCTGAG ACCTTACGAG AAAGCGTTAT GGAAAGTGAA GCATCTACAC AGAGAACTTT CAAGGAAGAA GTTTCTTTCA AATAACTGGT TTAAGGCTAA GGTTAAGCTT GCTAGGGCTT ATGAGTATTT GAAGAATCTA AGAACGGATC TTTATATGAA GTTGGGTAAG TGGTTTGCTG AGCATTATGA CGTTGTGGTG ATGGAGGACA TTCATGTTAA GCAGTTGATA GGTAAGTCAT TAAGGTCTCT GAGGAGGAGA TTGAGTGACG TCGCGTTCAG CGAGCTTAGA GATTTGATTA AGTATCAGTT GGAGAAATAC GGTAAGAAAC TCATCCTGGT CAACCCAGCA TACACTTCCA AAACTTGTGC TAGGTGCGGG TACGTAAAAG AAGATCTGTC TCTATCTGAT CGTGTTTTCG TTTGTTCCAA CTGTGGTTGG ATTGCAGATC GTGACTATAA TGCTTCTCTT AACATTTTGA AGGGTGCGGG GTCGGAGCGA TCCTTAGTGC CTGTGGAACT CCGCCCTCTA CCAGTACCAG CACTTCGGTA CTGGCATGGC AGAGCTGTGA AGTAG
|
Protein sequence | MKLACEIYNT LRWADIYFYQ RDGKGLTQTE LRQLALDLRK QDDEYKQLYS QVVQQIADRY YEAKKRFFEG LARFPKEKKP HKYYSLVYPQ YGWKILQVRE IRKGKKKLIT LKLSNLGTFK VIVHRDFPLD KVKRVVVKLT RSERIYITFV VDHEFPKLPN TGKVVAIDVG IEKLIVTSDG EYFPNLRPYE KALWKVKHLH RELSRKKFLS NNWFKAKVKL ARAYEYLKNL RTDLYMKLGK WFAEHYDVVV MEDIHVKQLI GKSLRSLRRR LSDVAFSELR DLIKYQLEKY GKKLILVNPA YTSKTCARCG YVKEDLSLSD RVFVCSNCGW IADRDYNASL NILKGAGSER SLVPVELRPL PVPALRYWHG RAVK
|
| |