Gene Ssol_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1889 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1678023 
End bp1680029 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content32% 
IMG OID 
ProductDNA topoisomerase I 
Protein accessionACX92101 
Protein GI261602498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.806156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAT GTAATGTAAA CAACTATTAC CTTATAATTG CAGAAAAATC AAAAGCTGCT 
AAAAAAATCG CAGAAGCTCT TTCAGAGAAG CCTATTTTGT GTAGAAAATA TAACGTTAGT
TATTGGATAA TAAAAGATCA CAATAGCAGC AAATATGTTA TAGTTCCTGC AGCAGGACAT
CTTTTTGGGT TAAAAGGCGA GAGCGGTTTT CCGGTATATG ATGCGGACTG GAAACCTCTA
TGGGAAATTG ATAAGAATAG TTACTATACA AAAAGGTACT ATCAACTTAT CTCATCTCTT
AGCAAGTATG CTTTAGGTTT TATCAATGCT TGTGATTACG ATATAGAAGG TTCTGTAATC
GGCTATTTAA TAATCAAAAA TCTAGGCGAT ATTAAGAAGG CTAAACGAAT GAAATTTTCA
GCACTGACTA AAAGTGATAT ATTATCTGCA TTTAGAAACA TTTCTGCATT AGATTACGAT
ATGATAAACG CTGGAATAGC CAGACATAAA ATTGACTGGC TATGGGGAAT TAACGTTAGT
AGGGCTCTTA TGATCTCTCT ACAAGATTTC GCAAAGAAAA GAGTGATATT AAGTGCGGGT
AGGGTTCAAA GTCCAACTCT AGTTCAAGTT GTCAACTCGG AAATTGAAAG AAACCTATTT
ATTCCCCTAC CTAAATTTAC CGTTTCAATT ATCGTGAAAA TTAAAGATTA TTCATTAAAC
ATTAAGGTAA ATAAGGAATT CGAAAAAATT ACCGAAGCAA AGGAATTCTT AAACAAACTA
ATAAATAAAA CAGTAAAAGT TGTTGAAGTT GAAAATAGGG TTAGGTTATT AGAAAGACCC
TCTCCATTTA ATCTTACTGA TCTCCAAATA GAAGCTGGCA GAATATATGG TATATCCCCA
TATAACGTAG AACGTATAGC AGAAGATCTT TATTTGGACG GTCTAATAAG TTACCCAAGA
ACTAACAGTC AAAAAATTCC ATCAACTATC AGCATTTATA ATATAATCAA AGGCTTAGAG
AACAGTTCGT ATAGGAAACT AGTTGATTTA GTAAGGAAAA TCACTGGGGG GAAATATGTA
GTTAAGCAGG GCATTAAGGA TGATCCTGCA CACCCCGCAA TCCACCCTAC TGGTGAAGCT
CCCAAAAACT TACCTAATAG CAAATTCAAG ATATATGACT TAATAGCAAG AAGATTTTTG
GGGTCAGTAT CTGCTGATGC TAAATTATCT AATACTATTT ACACCTTGAA AGTTAGTGAT
TTCCCATTAG AGTTTACGGT CTCATATACA AAAATACTAG AAAGAAATTG GCTAGATATA
TATCACTTTC ATAATGTAAA AGAAGATAAA CCAATATTTC TCTCAAAAGG TGATGAGGGT
AAAATAGTAG ATGGAAAAGT AAATATTAGT TTAAGCAAGC CCACTTCTAG GTATACAAAG
GTTTCATTAC TCAAATGGAT GGAGTCTTCT AATTTGGGTA CAGAAGCCAC TAGAGGAAGA
ATAATCGAGA TCTTAGTAAA GAGAAAATAT TTAACTAACA ATGGGCGATA CATAATTCCT
ACAAAATTGG GATTCTATAT TGCTGAAATA TTAAACAAAT TTTTCCCAGA TATAGTTGAT
GTTAGGATGA CTGCAGATAT GGAAAGTAAA TTAGAAATGA TAAAGACTGG CAAAGTTTTG
GAAAGCAAAG TGATTAAAGA AAACATAGAA AAATTAAATA AATTCATAGA GGAATACAAG
GTAAATAAGG ATAAAGTAGG AGAGTCATTA GCTAAAGCGT TAGGTCTTAT AAAAATTGTT
AAGTGTAAGT ATTGTGATCT GGAGCAGTAT AAGGATGGAT TATGTAAATA TCACTATGAA
GCCAAAGTAA GGCTTTTAGA TGCTGTGGAA ATTTGGAAAG AAAGGACAAA ATATGATCAT
AAAAAGATTT TAAAGAGAAT TAGTAGTAGT AAATCAACGG GTAAATACGT AAAAGATATA
GTAACTTATA TGCTAAGCAG TGAATGA
 
Protein sequence
MNLCNVNNYY LIIAEKSKAA KKIAEALSEK PILCRKYNVS YWIIKDHNSS KYVIVPAAGH 
LFGLKGESGF PVYDADWKPL WEIDKNSYYT KRYYQLISSL SKYALGFINA CDYDIEGSVI
GYLIIKNLGD IKKAKRMKFS ALTKSDILSA FRNISALDYD MINAGIARHK IDWLWGINVS
RALMISLQDF AKKRVILSAG RVQSPTLVQV VNSEIERNLF IPLPKFTVSI IVKIKDYSLN
IKVNKEFEKI TEAKEFLNKL INKTVKVVEV ENRVRLLERP SPFNLTDLQI EAGRIYGISP
YNVERIAEDL YLDGLISYPR TNSQKIPSTI SIYNIIKGLE NSSYRKLVDL VRKITGGKYV
VKQGIKDDPA HPAIHPTGEA PKNLPNSKFK IYDLIARRFL GSVSADAKLS NTIYTLKVSD
FPLEFTVSYT KILERNWLDI YHFHNVKEDK PIFLSKGDEG KIVDGKVNIS LSKPTSRYTK
VSLLKWMESS NLGTEATRGR IIEILVKRKY LTNNGRYIIP TKLGFYIAEI LNKFFPDIVD
VRMTADMESK LEMIKTGKVL ESKVIKENIE KLNKFIEEYK VNKDKVGESL AKALGLIKIV
KCKYCDLEQY KDGLCKYHYE AKVRLLDAVE IWKERTKYDH KKILKRISSS KSTGKYVKDI
VTYMLSSE