Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0912 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 853326 |
End bp | 854594 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | ACX91157 |
Protein GI | 261601554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000247505 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGAAGA ACTTAAGAAT TAGAAAATTT GAACCGGAAG AGGAATACGT GCACTTCACG TACTCCATCA AGAATAGTGA GAGGGAGAAG AGCAAAGAGT TAATTAAAGA ATACAGAACA CTACTACAGA AAGCAATTGA CTACCTGTGG AATTTAACGA AAATACAAGT AAGAAAAAAG AACGGTAGTT ACAAGATAAC ACTACCGAAG AAGAAGGAAG TATACAAACC ACTTAGGGAA GAGTTGGAGA AGATCAACCA CCTCGCGTCA CACTACGTCG ATAAGGCAAT TAATGACGCA TTCTCGATCT TGAAGTCGTG GAGGAAAAGG GCCATAAAGG GGAGAGCTTC GATTGAAAAA CCTACGTTAA AGAAGGCTTA CGTTAGGATA AAGACGACTC TGAGGAAGGT TGTGGGGGAA AGCGTTAGAA TAACGGTAAG ACCTTATGAG TACATCACCT TCTCGTGGAG TAAGTCATGG TTCTCAAGAA GGGTTAGGGA GTTGGAACTC GGCGAACCTA TAATTAAGGA GGAGAAGGTT TACCTACCAT TTCGTTACAA GTTACCGTGG GTAACACCAG TGAACTTTCT AGCTATTGAC TCCAACCTTT ATACTCTAGA TGCTTATGAT GGTGAGAAAT TCGTTACAAT CTCCCTAAAG CAGTTGTACT CCCTTAAGTA CTCCATGGAG GTGAAGAGGG CTAAGGTGCA ATCATTTGCA TCAAAGCACA CGAAGAGGGG GAGAGAGTTG ATGAGGAAGT ATTCGCATAG GGAGAGGAAT CGCGTTCTAG ATTTTGTTCA CAAGTTTGTT AACACTTTGT TGGACTTGTA CCCCATGACG TTTTTCGCTG TGGAAAAGCT TAACAAAGAG AGTATGTTTA AGGATGCTAA TGGCTCTCTT TCGAGGAAGA TTTCTAGGAC TGTTTGGAGG AGTATACACA GAGTGTTGAA GTATAAGGCT CCGCTTTACG GTTCTTTCGT TAAGGAAGTG AACCCACACC TCACCTCGAG GTCTTGCCCC AGATGTGGGT TTGTATCCCG AAAGGTTGGT AAGACCTTTG AGTGTGAGAG GTGTGGGTTC AAGTTGGATA GACAACTGAA CGCGTCATTG AATATTTATC TCAAGATGTG CGGTTTTCCT CACATCCGTG ATATTCCGCG GGTGTGGGTT GGGGTTATTC CGCTAATGGG GCGGAGAGGG ATGAACGTCC GCGACTTTGG TGAAGCCCAA GGGCTGAGGA TTGATATCAA ATATCATGAA ATCCTATGA
|
Protein sequence | MLKNLRIRKF EPEEEYVHFT YSIKNSEREK SKELIKEYRT LLQKAIDYLW NLTKIQVRKK NGSYKITLPK KKEVYKPLRE ELEKINHLAS HYVDKAINDA FSILKSWRKR AIKGRASIEK PTLKKAYVRI KTTLRKVVGE SVRITVRPYE YITFSWSKSW FSRRVRELEL GEPIIKEEKV YLPFRYKLPW VTPVNFLAID SNLYTLDAYD GEKFVTISLK QLYSLKYSME VKRAKVQSFA SKHTKRGREL MRKYSHRERN RVLDFVHKFV NTLLDLYPMT FFAVEKLNKE SMFKDANGSL SRKISRTVWR SIHRVLKYKA PLYGSFVKEV NPHLTSRSCP RCGFVSRKVG KTFECERCGF KLDRQLNASL NIYLKMCGFP HIRDIPRVWV GVIPLMGRRG MNVRDFGEAQ GLRIDIKYHE IL
|
| |