Gene Ssol_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1158 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1077179 
End bp1078234 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content34% 
IMG OID 
Productflap structure-specific endonuclease 
Protein accessionACX91396 
Protein GI261601793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.146438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAGGAGTAG ATTTAGCAGA TTTAGTAAAA GATGTAAAAA GAGAGCTTTC GTTTTCTGAA 
CTTAAAGGAA AGAGGGTAAG TATCGATGGA TATAATGCGT TATATCAGTT TTTAGCTGCA
ATAAGACAAC CGGATGGAAC TCCCTTAATG GATTCACAGG GGAGAGTAAC TAGTCATCTA
AGTGGATTAT TTTATAGAAC GATAAATATC CTAGAAGAAG GTGTCATACC AATCTACGTC
TTTGATGGTA AACCGCCAGA GCAAAAAAGT GAAGAGTTAG AAAGAAGGAG AAAGGCAAAA
GAAGAAGCTG AGAGAAAGTT AGAAAGAGCC AAAAGCGAAG GGAAGATAGA GGAGCTAAGG
AAATATTCAC AAGCTATACT TAGATTATCA AATATCATGG TAGAAGAAAG TAAAAAATTG
TTGAGAGCTA TGGGAATACC TATAGTTCAA GCACCTTCTG AAGGAGAGGC TGAGGCTGCA
TATCTGAATA AATTAGGATT AAGCTGGGCA GCAGCAAGTC AAGATTATGA TGCGATACTT
TTCGGAGCTA AAAGGCTAGT AAGAAATTTG ACAATTACAG GGAAAAGAAA GCTTCCTAAC
AAGGACGTAT ATGTGGAGAT AAAACCAGAA TTAATAGAAA CGGAAATCTT ACTTAAAAAA
TTAGGAATAA CAAGGGAACA ACTAATTGAT ATAGGCATAT TAATTGGAAC AGACTATAAT
CCTGATGGAA TAAGGGGCAT AGGACCAGAA AGAGCACTAA AGATAATAAA AAAATACGGC
AAGATAGAAA AAGCAATGGA ATATGGTGAA ATTTCAAAAA AGGATATTAA TTTCAATATC
GATGAAATTA GAGGTTTATT TTTGAATCCA CAAGTAGTTA AACCAGAGGA AGCTTTAGAT
TTGAATGAAC CTAACGGAGA AGATATAATT AATATCTTAG TATATGAACA TAACTTCAGT
GAAGAAAGAG TGAAGAATGG AATTGAAAGA TTAACTAAAG CAATAAAAGA AGCTAAAGGA
GCGTCTAGAC AAACAGGATT GGACAGATGG TTTTAA
 
Protein sequence
MGVDLADLVK DVKRELSFSE LKGKRVSIDG YNALYQFLAA IRQPDGTPLM DSQGRVTSHL 
SGLFYRTINI LEEGVIPIYV FDGKPPEQKS EELERRRKAK EEAERKLERA KSEGKIEELR
KYSQAILRLS NIMVEESKKL LRAMGIPIVQ APSEGEAEAA YLNKLGLSWA AASQDYDAIL
FGAKRLVRNL TITGKRKLPN KDVYVEIKPE LIETEILLKK LGITREQLID IGILIGTDYN
PDGIRGIGPE RALKIIKKYG KIEKAMEYGE ISKKDINFNI DEIRGLFLNP QVVKPEEALD
LNEPNGEDII NILVYEHNFS EERVKNGIER LTKAIKEAKG ASRQTGLDRW F