Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1936 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1719822 |
End bp | 1721477 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | ACX92147 |
Protein GI | 261602544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0298009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCGA GGACTTTCTA TATCAAACAA TGGTTAGATG AGGATGACTT CAAGAGGTTA CTATTATTCT CGAGATATTT AGGAAGAGAT AGTAGTGGAT CTCAGTTTAT GATAGATTTG GAAAGAGCTA AAAGAAATGG AGTAAAACCG AATGAGATAG TAGAAATCTT AGAGGATTAT GACGTGCGTC TTTCTCAAGA AGATATTGCA ACATTAAGGG AGAAACTCCT TGATTGTTCC TTTGATATAG AGGCTGGTAA AATAATCATG AAACCATATA CTTATCTTGC TGACGTTCTA GAGAAAATTA ATGGTAAAAA TGAGAATTCA GAGCACAGAT TCAAAATCAG ATATGATAAA CAAAATAGAA GATTTATAGT GCGTCCAATG GATTATTTTA ATCTATTGCA AAAATTGAGA GAAAATGGAT TAGAGGTCAA GGAATTAAAC CTTTCATTCA AGGAGTTTGA ATTCGAATTT AATGGGCAAT TAAGGGAATA TCAAAGAGAA GCTATAGAAA ATTGGATTAA GAATGGAAAC AAGGGGGTTA TAGCACTACC GACAGGCGCT GGAAAGACAG TAGTTGGAAT AAAAGGTATA GATATAATCA GAAAGCCCAC TCTCATTGTA ACCTTTACCA AAGAGCAGAT GTTACAATGG AAAGAAGCTA TACTAAAGTT TACCTCAAAA AGACCTGACA TTGGTTTATA TTACTCAGAA GAAAAGAGAA TAAGACCAAT AACCATAACT ACCTATCATA CAGCATATAG ACATCTTCCA CAACTATTTG ATAAATTTTA CCTGCTAATA GTTGATGAAG TTCATCACCT ACCTGCAGAT AAGTTTAAAT TAATAGCAGA AGGTTTAATA GCTCCTTATA GGATGGGTCT TTCAGCGACT CCTCATAGAG AAGATAATAA ACACAATGAG TTATTTAGCC TAATAGGGGG AATAATATAT TATAAGTCAG TTACCGAACT AGCCAAGTTA GGTTATCTCG CCTCCTATGA GATAATTCAG AAAAAAGTAA GGCTAACCTT AGAAGAGAGG AAGCGATACA ATGAGCTATT GAATAGGTTT AAGGCATTAT CTAAGGGCAG GAAGGTAAGT GAATTGATAG AGTTAGTAAA AAAAGGTGAT GAAAGTGCTA TTGAAGCCAT GAGAGTTTAT AATGAAATGA GAAAAATAGT GAACTTCGCT TCAGAGAAAA TGAAGGCTTT AGACGAAATA CTTAAGTCTG AAAAGGGAAA GATTCTCATA TTTACACAGT ATATTGATCA AGCTGAAGAA ATTGCTAAGA GATACAATTC TTTACTTTTA ACCGGCAAGA TGTCTAAAGA GGAAAGGAAG AGAGTTTTAG TGACGTTTAA AACTATGAAT GCAGGGGTTC TTGTTTTAAC CACTGTTGGA GATGAGGGAA TAGACATTCC GGATGCTAAT GTAGGGATTA TTGTGACTGG GACCAGTTCT AGAAGGCAAT TTATTCAAAG ATTAGGTAGG ATTATGAGAC CCTATAATGG CAAGCAAGCA AAACTTTATG AAATAGTTGT TAGTGGTACT CCAGAGGAAT ATCAAGCAAA AAAGAGAAAA GAAACTGATA TTCTCTCATT TGAGGGTATG TCTTATTCAT CCTCCGAAAA TATCGACAGA GATTAA
|
Protein sequence | MSSRTFYIKQ WLDEDDFKRL LLFSRYLGRD SSGSQFMIDL ERAKRNGVKP NEIVEILEDY DVRLSQEDIA TLREKLLDCS FDIEAGKIIM KPYTYLADVL EKINGKNENS EHRFKIRYDK QNRRFIVRPM DYFNLLQKLR ENGLEVKELN LSFKEFEFEF NGQLREYQRE AIENWIKNGN KGVIALPTGA GKTVVGIKGI DIIRKPTLIV TFTKEQMLQW KEAILKFTSK RPDIGLYYSE EKRIRPITIT TYHTAYRHLP QLFDKFYLLI VDEVHHLPAD KFKLIAEGLI APYRMGLSAT PHREDNKHNE LFSLIGGIIY YKSVTELAKL GYLASYEIIQ KKVRLTLEER KRYNELLNRF KALSKGRKVS ELIELVKKGD ESAIEAMRVY NEMRKIVNFA SEKMKALDEI LKSEKGKILI FTQYIDQAEE IAKRYNSLLL TGKMSKEERK RVLVTFKTMN AGVLVLTTVG DEGIDIPDAN VGIIVTGTSS RRQFIQRLGR IMRPYNGKQA KLYEIVVSGT PEEYQAKKRK ETDILSFEGM SYSSSENIDR D
|
| |