Gene Ssol_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1947 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1732610 
End bp1734202 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content35% 
IMG OID 
ProductDNA topoisomerase VI, B subunit 
Protein accessionACX92158 
Protein GI261602555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.333338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCTA AAGAAAAGTT TGCAAGCTTA TCCCCAGCAG AATTCTTTAA AAGGAATCCG 
GAGCTTGCCG GATTCCCTAA TCCAGCTAGG GCTCTTTATC AAACTGTTAG AGAATTAATA
GAAAACTCGT TGGACGCAAC TGATGTTCAT GGGATACTGC CTAATATTAA GATTACGATT
GACTTAATTG ATGAATCTAG GCAAATATAT AAAGTTAATG TAGTTGATAA TGGAATAGGC
ATCCCCCCAC AAGAGGTTCC CAATGCGTTT GGTAGAGTAT TATACAGTTC AAAATACGTG
AATAGGCAAA CTAGGGGTAT GTATGGTCTG GGTGTAAAAG CTGCTGTTCT CTATAGTCAA
ATGCATCAAG ATAAGCCTAT AGAAATAGAG ACTTCACCAG TAAATTCTAA AAGATTATAT
ACTTTTAAAT TGAAAATTGA TATAAATAAG AACGAGCCAA TAATTGTAGA AAGGGGATCT
GTTGAAAACA ATACAGGTTT TCATGGGACG TCAGTAGCAA TATCTATACC GGGGGACTGG
CCCAAAGCTA AATCAAGAAT TTATGAATAT ATCAAAAGGA CTTACATTAT TACCCCATAT
GCAGAATTTA TCTTTAAGGA CCCTGAAGGA AATGTAACAT ATTATCCGAG ACTAACAAAT
AAGATTCCTA AGCCACCACA AGAGGTTAAG CCTCATCCTT ATGGAGTAGA TAGAGAAGAA
ATCAAAATAA TGATAAATAA TCTAAAGAGA GATTATACTA TAAAGGAATT TTTAATGAGT
GAATTCCAAA GTATAGGAGA TACTACTGCA GATAAGATTT TAGAATTAGT TGGATTAAGG
CCCAATAAGA AGGTTAAGAA TTTAACAGAA GAGGAAATCA CTAGGCTAGT TGAGACTTTT
AAGAAATATG AGGATTTTAG ATCTCCTTCA GCAGACTCAC TTTCTGTAAT AGGGGAAGAT
TTGATTGAAT TAGGTTTAAA AAAGATCTTT AATCCGGATT TTACAGCATC CATAACCAGG
AAACCTAAGG CTTATCAAGG ACACCCATTT ATAGTGGAAG CCGGTATCGC ATTCGGTGGT
AGCATACCCG TTGGGGAAGA GCCCATAGTT TTAAGATACG CTAATAAGAT TCCGTTAATT
TATGACGAGA AATCGGATGT CATATGGAAG GTTGTTGAAG AGTTAGATTG GAAAAGGTAT
GGTATTGAGT CAGATCAGTA TCAAATGGTA GTGATGGTTC ATCTGTGCAG TACTAAGATA
CCTTATAAGA GTGCTGGTAA GGAAAGCATC GCTGAAGTAG AAGATATAGA GAAGGAAATA
AAGAACGCGC TGATGGAGGT TGCAAGAAAA CTGAAATTGT ATTTAAGTGA GAAGAGAAAG
GAGCAAGAGG CTAAGAAGAA ATTACTCGCA TATTTAAAAT ATGTACCAGA AGTCAGTAGG
TCTTTGGCAA TCTTTTTAGC ATCGGGTAAT AAAGAGTTAG TGCCAAAATA TCAAGGTGAG
ATTGTGGAAG GTTTATTTAA ACTTATTTCT AAGAAATTAG ATTTGATTAA TATTGAAGAG
TATAGAAAGG TATATAAGGT GGATAGTGAA TGA
 
Protein sequence
MSAKEKFASL SPAEFFKRNP ELAGFPNPAR ALYQTVRELI ENSLDATDVH GILPNIKITI 
DLIDESRQIY KVNVVDNGIG IPPQEVPNAF GRVLYSSKYV NRQTRGMYGL GVKAAVLYSQ
MHQDKPIEIE TSPVNSKRLY TFKLKIDINK NEPIIVERGS VENNTGFHGT SVAISIPGDW
PKAKSRIYEY IKRTYIITPY AEFIFKDPEG NVTYYPRLTN KIPKPPQEVK PHPYGVDREE
IKIMINNLKR DYTIKEFLMS EFQSIGDTTA DKILELVGLR PNKKVKNLTE EEITRLVETF
KKYEDFRSPS ADSLSVIGED LIELGLKKIF NPDFTASITR KPKAYQGHPF IVEAGIAFGG
SIPVGEEPIV LRYANKIPLI YDEKSDVIWK VVEELDWKRY GIESDQYQMV VMVHLCSTKI
PYKSAGKESI AEVEDIEKEI KNALMEVARK LKLYLSEKRK EQEAKKKLLA YLKYVPEVSR
SLAIFLASGN KELVPKYQGE IVEGLFKLIS KKLDLINIEE YRKVYKVDSE