Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2047 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1836811 |
End bp | 1838079 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | transposase, IS605 OrfB family |
Protein accession | ACX92253 |
Protein GI | 261602650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0187129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGAAGA ACTTAAGAAT TAGAAAATTT GAACCGGAAG AGGAATACGT GCACTTCACG TACTCTATCA AGAATAGTGA GAGGGAGAAG AGCAAAGAGT TAATTAAAGA ATACAGAACA CTACTACAGA AAGCAATTGA CTACCTGTGG AATTTAACGA AAATACAAGT AAGAAAAAAG AACGGTAATT ACAAGATAAC ACTACCGAAG AAGAAGGAAG TGTACAAACC ACTTAGGGAA GAGTTGGAGA AGATCAACCA CCTCGCGTCA CACTACGTCG ATAAGGCAAT TAATGACGCA TTCTCGATCT TGAAGTCGTG GAGGAAAAGG GCCATAAAGG GGAGAGCTTC GATTGAAAAA CCAAGGGTGA AGAAGGCTTA CGTTAGGATA AAGACGACTC TGAGGAAGGT TGTGGGGGAA AGCGTTAGAA TAACTGTAAG ACCTCACGAG TACATCACCT TCCCGTGGAG TAAGTCATGG TTCTCAAGAA GGGTTAGGGA GTTGGAACTT GGCGAACCTA TAATTAAGGA GGAGAAAGTG TATTTGCCAT TTCGTTACAA GTTACCGTGG GTAACACCAG TGAACTTTCT AGCTATTGAC TCCAACCTTT ATACTCTAGA TGCTTATGAT GGTGAGAAAT TCGTTACAAT CTCTCTGAAG CAGTTGTACT CCCTTAAGTA CTCTATGGAG GTGAAGAGGG CTAAGGTGCA ATCATTTGCA TCAAAGCACA CGAAGAGGGG GAGAGAGTTG TTAAGGAAGT ATTCGCATAG GGAGAGGAAT CGCGTTCTGG ACTTCGTTCA CAAGTTTGTA AACACTTTGT TGGACTTGTA CCCCATGACG TTTTTCGCTG TGGAAAAGCT TAACAAAGAG AGTATGTTTA AGGATGCTAA TGGCTCTCTT TCGAGGAAGA TTTCTAGGAC TGTTTGGAGG AGTATACATA GAGTGTTGAA GTACAAGGCT CCGCTTTACG GTTCTTTCGT TAAGGAAGTG AACCCACACC TCACCTCGAG GTCTTGCCCC AGATGTGGGT TTGTATCCCG AAAGGTTGGT AAGACCTTTG AGTGTGAGAG GTGTGGGTTC AAGTTGGATA GGCAACTGAA CGCGTCACTG AATATTTATC TCAAGATGTG CGGTTTTCCT CACATCCGTG AAATAGCGCG GGTGTGGGTT GGGGTTATCC CGCTAATGGG GCGGAGAGGG ATGAACGTCC GCGACTTCGG TGAAGCCCAA GGGCTGAGGA TTGATATTAA ATATCATGAA ATCCCATGA
|
Protein sequence | MLKNLRIRKF EPEEEYVHFT YSIKNSEREK SKELIKEYRT LLQKAIDYLW NLTKIQVRKK NGNYKITLPK KKEVYKPLRE ELEKINHLAS HYVDKAINDA FSILKSWRKR AIKGRASIEK PRVKKAYVRI KTTLRKVVGE SVRITVRPHE YITFPWSKSW FSRRVRELEL GEPIIKEEKV YLPFRYKLPW VTPVNFLAID SNLYTLDAYD GEKFVTISLK QLYSLKYSME VKRAKVQSFA SKHTKRGREL LRKYSHRERN RVLDFVHKFV NTLLDLYPMT FFAVEKLNKE SMFKDANGSL SRKISRTVWR SIHRVLKYKA PLYGSFVKEV NPHLTSRSCP RCGFVSRKVG KTFECERCGF KLDRQLNASL NIYLKMCGFP HIREIARVWV GVIPLMGRRG MNVRDFGEAQ GLRIDIKYHE IP
|
| |