Gene Ssol_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1948 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1734199 
End bp1735368 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content37% 
IMG OID 
ProductDNA topoisomerase (ATP-hydrolyzing) 
Protein accessionACX92159 
Protein GI261602556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.86374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTG AATTTATATC AAAGGTTGAT AAAGAAGCGA GAAGGAAAGC TGCTAGTATA 
TTGCGCGATA AGTTCCTTAA TTTAGTTGAA CAACTTAAGA AAGGCGAACC ATTAGTAATG
GAAATCCCAA TGAGAACTTT ATCTAATGCG ATCTATGATG AAAAGAGAAA GCTACTACTC
TTGGGAGAAA AGAAACTTAA AAGGAATTTT CTAGATATGA ACGAAGCAAA ACGATTTATG
CAGACCGTAT TGATGGCATC AATAATTTAT GACGCGCTAG TTAGCGATGA GTACCCAACT
ATACGTGATC TTTACTACAG AGGAAAGCAC TCACTTTTGT TAAAGTCAAT TGATGGCAAT
AAGATAGTGT CTGAAGAGAA TACATGGGAT GAACAAAAGG AGTCAGATAG TGTTATAGTT
GATATCGAAG TATTTACGTC TCTCCTTAGA GAAGAAATGC TGATTCTCAG TAAGGAAAAG
GGTAAAGTAG TAGGTAATTT AAGGATAAGG AGCGGAAATG ATACAATAGA TCTGAGTAAA
ACTGGTCATG GAGCCTACGC GATTGAACCT ACTCCCGATT TGATAGATTT CATTGATGTT
GATGCAGAAT TTGTACTAGT AGTGGAGAAA GATGCAGTAT TCCAACAGTT GCATAGAGCT
GGTTTTTGGA AACAGTATAA GTCCATTTTA ATAACTAGTG CGGGTCAACC AGATAGGGCA
ACTAGGAGAT TTGTCAGAAG ACTTAATGAG GAGCTAAAAT TGCCAGTTTA TATCTTAACT
GATGCTGATC CCTATGGATG GTATATATTC AGCGTATTCA GAATAGGCTC AATATCTTTA
TCTTACGAGA GTGAGAGGCT AGCTACTCCA GACGCCAAAT TTTTGGGCGT ATCAATGAGT
GATATCTTCG GTAATTCCAG AAAGAAACCC TATTTAAGTG AAGCCGAGAG AAAGAATTAT
ATAATTAAGG CCAAAGAGGC AGATATAAAG AGAGCTGAGG AAATTAAAAA CTATGAGTGG
TTTAAGACTA AAGCATGGGA AGAAGAGATA AACACTTTCC TACATAGGAA AGCTAAATTG
GAAATAGAAG CTATGGCAAG CAAGGGTCTT AAGTTTCTCG CTTTCCAGTA CATTCCAGAG
AAGATAACTA ATAAGGATTA CATTGCCTAA
 
Protein sequence
MSSEFISKVD KEARRKAASI LRDKFLNLVE QLKKGEPLVM EIPMRTLSNA IYDEKRKLLL 
LGEKKLKRNF LDMNEAKRFM QTVLMASIIY DALVSDEYPT IRDLYYRGKH SLLLKSIDGN
KIVSEENTWD EQKESDSVIV DIEVFTSLLR EEMLILSKEK GKVVGNLRIR SGNDTIDLSK
TGHGAYAIEP TPDLIDFIDV DAEFVLVVEK DAVFQQLHRA GFWKQYKSIL ITSAGQPDRA
TRRFVRRLNE ELKLPVYILT DADPYGWYIF SVFRIGSISL SYESERLATP DAKFLGVSMS
DIFGNSRKKP YLSEAERKNY IIKAKEADIK RAEEIKNYEW FKTKAWEEEI NTFLHRKAKL
EIEAMASKGL KFLAFQYIPE KITNKDYIA