Gene Ssol_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1851 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1642641 
End bp1644320 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content38% 
IMG OID 
Productthermosome 
Protein accessionACX92063 
Protein GI261602460 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0805773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTC CAGTCTTATT GCTTAAAGAG GGAACAAGCA GAACTACTGG AAGAGATGCG 
CTAAGGAATA ATATACTTGC TGCGAAGACA TTAGCTGAAA TGTTAAGGAG TAGTTTAGGT
CCTAAAGGTC TTGATAAAAT GCTAATTGAT AGCTTCGGTG ACGTAACCAT AACTAATGAT
GGTGCTACAA TAGTAAAGGA TATGGAGATT CAGCATCCAG CTGCGAAATT ATTAGTAGAA
GCAGCTAAAG CTCAAGATGC TGAAGTAGGT GATGGTACTA CAAGTGCTGT AGTATTGGCT
GGTGCTCTAT TAGAGAAGGC TGAAAGTTTA TTGGATCAAA ATATACATCC TACAATAATT
ATTGAGGGGT ATAAGAAGGC ATACAACAAG GCCTTAGAGT TACTTCCGCA GTTAGGAACT
AGAATTGATA TAAAGGATTT GAATTCTTCA GTAGCTAGAG ATACTCTAAG AAAGATAGCA
TTTACTACTT TAGCAAGTAA GTTTATTGCA GAAGGTGCTG AATTAAATAA AATAATTGAC
ATGGTAATAG ATGCAATAGT TAATGTAGCA GAACCTTTAC CTAATGGTGG ATATAATGTG
AGTTTAGACT TAATAAAGAT AGATAAGAAG AAAGGCGGAA GTATAGAGGA TAGTGTACTA
GTTAAAGGAC TAGTGTTAGA TAAGGAGGTA GTACATCCTG GAATGCCTAG AAGAGTCACC
AAGGCTAAGA TAGCTGTTTT GGATGCAGCA TTAGAGGTAG AGAAGCCTGA AATTTCAGCC
AAGATAAGCA TCACATCACC TGAGCAAATT AAGGCTTTCT TGGATGAGGA GTCTAAATAC
CTTAAGGATA TGGTTGATAA GTTAGCATCA ATAGGTGCTA ATGTTGTAAT ATGCCAGAAG
GGTATTGATG ATATAGCACA GCATTTCTTA GCCAAGAAAG GGATATTGGC TGTAAGAAGA
GTTAAGAGGA GCGATATAGA AAAATTAGAG AAGGCATTAG GTGCAAGAAT AATAAGTAGC
ATTAAAGATG CTACTCCCGA AGACTTAGGA TATGCTGAAT TAGTTGAGGA AAGGAGAGTT
GGGAACGATA AAATGGTATT TATAGAGGGT GCTAAGAACT TGAAAGCTGT GAATATCTTG
TTAAGAGGTT CAAATGATAT GGCATTAGAT GAGGCTGAGA GGAGTATAAA TGATGCATTG
CATGCTCTAA GGAACATATT ATTAGAGCCA GTAATATTAC CAGGCGGTGG TGCTATCGAG
TTAGAGTTAG CGATGAAATT AAGAGAGTAT GCTAGAAGTG TGGGAGGTAA GGAGCAATTA
GCTATAGAAG CATTTGCAGA CGCATTAGAG GAGATACCTT TAATTTTAGC TGAAACTGCA
GGGCTAGAGG CTATATCTTC ATTGATGGAC TTAAGAGCTA GGCACGCTAA GGGCTTGAGT
AATACTGGTG TAGATGTCAT AGGCGGGAAG ATTGTAGATG ATGTATATGC GTTAAACATT
ATCGAGCCTA TTAGAGTAAA GTCTCAAGTG TTAAAGAGTG CTACAGAGGC AGCCACAGCA
ATATTAAAGA TTGATGATCT AATAGCGGCT GCCCCACTAA AGAGTGAGAA GAAAGGAGGA
GAAGGAAGTA AAGAAGAAAG TGGTGGAGAG GGAGGATCTA CTCCATCTTT AGGAGACTAA
 
Protein sequence
MAAPVLLLKE GTSRTTGRDA LRNNILAAKT LAEMLRSSLG PKGLDKMLID SFGDVTITND 
GATIVKDMEI QHPAAKLLVE AAKAQDAEVG DGTTSAVVLA GALLEKAESL LDQNIHPTII
IEGYKKAYNK ALELLPQLGT RIDIKDLNSS VARDTLRKIA FTTLASKFIA EGAELNKIID
MVIDAIVNVA EPLPNGGYNV SLDLIKIDKK KGGSIEDSVL VKGLVLDKEV VHPGMPRRVT
KAKIAVLDAA LEVEKPEISA KISITSPEQI KAFLDEESKY LKDMVDKLAS IGANVVICQK
GIDDIAQHFL AKKGILAVRR VKRSDIEKLE KALGARIISS IKDATPEDLG YAELVEERRV
GNDKMVFIEG AKNLKAVNIL LRGSNDMALD EAERSINDAL HALRNILLEP VILPGGGAIE
LELAMKLREY ARSVGGKEQL AIEAFADALE EIPLILAETA GLEAISSLMD LRARHAKGLS
NTGVDVIGGK IVDDVYALNI IEPIRVKSQV LKSATEAATA ILKIDDLIAA APLKSEKKGG
EGSKEESGGE GGSTPSLGD