Gene Ssol_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1936 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1719822 
End bp1721477 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content34% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionACX92147 
Protein GI261602544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0298009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCGA GGACTTTCTA TATCAAACAA TGGTTAGATG AGGATGACTT CAAGAGGTTA 
CTATTATTCT CGAGATATTT AGGAAGAGAT AGTAGTGGAT CTCAGTTTAT GATAGATTTG
GAAAGAGCTA AAAGAAATGG AGTAAAACCG AATGAGATAG TAGAAATCTT AGAGGATTAT
GACGTGCGTC TTTCTCAAGA AGATATTGCA ACATTAAGGG AGAAACTCCT TGATTGTTCC
TTTGATATAG AGGCTGGTAA AATAATCATG AAACCATATA CTTATCTTGC TGACGTTCTA
GAGAAAATTA ATGGTAAAAA TGAGAATTCA GAGCACAGAT TCAAAATCAG ATATGATAAA
CAAAATAGAA GATTTATAGT GCGTCCAATG GATTATTTTA ATCTATTGCA AAAATTGAGA
GAAAATGGAT TAGAGGTCAA GGAATTAAAC CTTTCATTCA AGGAGTTTGA ATTCGAATTT
AATGGGCAAT TAAGGGAATA TCAAAGAGAA GCTATAGAAA ATTGGATTAA GAATGGAAAC
AAGGGGGTTA TAGCACTACC GACAGGCGCT GGAAAGACAG TAGTTGGAAT AAAAGGTATA
GATATAATCA GAAAGCCCAC TCTCATTGTA ACCTTTACCA AAGAGCAGAT GTTACAATGG
AAAGAAGCTA TACTAAAGTT TACCTCAAAA AGACCTGACA TTGGTTTATA TTACTCAGAA
GAAAAGAGAA TAAGACCAAT AACCATAACT ACCTATCATA CAGCATATAG ACATCTTCCA
CAACTATTTG ATAAATTTTA CCTGCTAATA GTTGATGAAG TTCATCACCT ACCTGCAGAT
AAGTTTAAAT TAATAGCAGA AGGTTTAATA GCTCCTTATA GGATGGGTCT TTCAGCGACT
CCTCATAGAG AAGATAATAA ACACAATGAG TTATTTAGCC TAATAGGGGG AATAATATAT
TATAAGTCAG TTACCGAACT AGCCAAGTTA GGTTATCTCG CCTCCTATGA GATAATTCAG
AAAAAAGTAA GGCTAACCTT AGAAGAGAGG AAGCGATACA ATGAGCTATT GAATAGGTTT
AAGGCATTAT CTAAGGGCAG GAAGGTAAGT GAATTGATAG AGTTAGTAAA AAAAGGTGAT
GAAAGTGCTA TTGAAGCCAT GAGAGTTTAT AATGAAATGA GAAAAATAGT GAACTTCGCT
TCAGAGAAAA TGAAGGCTTT AGACGAAATA CTTAAGTCTG AAAAGGGAAA GATTCTCATA
TTTACACAGT ATATTGATCA AGCTGAAGAA ATTGCTAAGA GATACAATTC TTTACTTTTA
ACCGGCAAGA TGTCTAAAGA GGAAAGGAAG AGAGTTTTAG TGACGTTTAA AACTATGAAT
GCAGGGGTTC TTGTTTTAAC CACTGTTGGA GATGAGGGAA TAGACATTCC GGATGCTAAT
GTAGGGATTA TTGTGACTGG GACCAGTTCT AGAAGGCAAT TTATTCAAAG ATTAGGTAGG
ATTATGAGAC CCTATAATGG CAAGCAAGCA AAACTTTATG AAATAGTTGT TAGTGGTACT
CCAGAGGAAT ATCAAGCAAA AAAGAGAAAA GAAACTGATA TTCTCTCATT TGAGGGTATG
TCTTATTCAT CCTCCGAAAA TATCGACAGA GATTAA
 
Protein sequence
MSSRTFYIKQ WLDEDDFKRL LLFSRYLGRD SSGSQFMIDL ERAKRNGVKP NEIVEILEDY 
DVRLSQEDIA TLREKLLDCS FDIEAGKIIM KPYTYLADVL EKINGKNENS EHRFKIRYDK
QNRRFIVRPM DYFNLLQKLR ENGLEVKELN LSFKEFEFEF NGQLREYQRE AIENWIKNGN
KGVIALPTGA GKTVVGIKGI DIIRKPTLIV TFTKEQMLQW KEAILKFTSK RPDIGLYYSE
EKRIRPITIT TYHTAYRHLP QLFDKFYLLI VDEVHHLPAD KFKLIAEGLI APYRMGLSAT
PHREDNKHNE LFSLIGGIIY YKSVTELAKL GYLASYEIIQ KKVRLTLEER KRYNELLNRF
KALSKGRKVS ELIELVKKGD ESAIEAMRVY NEMRKIVNFA SEKMKALDEI LKSEKGKILI
FTQYIDQAEE IAKRYNSLLL TGKMSKEERK RVLVTFKTMN AGVLVLTTVG DEGIDIPDAN
VGIIVTGTSS RRQFIQRLGR IMRPYNGKQA KLYEIVVSGT PEEYQAKKRK ETDILSFEGM
SYSSSENIDR D