Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1204 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1120477 |
End bp | 1123851 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | DNA-directed RNA polymerase subunit B |
Protein accession | ACX91442 |
Protein GI | 261601839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.074104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCATCTA ATTTAACCAT TGATGAAAGA TGGAGAGTCA TCGAAGCGTA CTTTAAATCC AAAGGCTTAG TTAGACAGCA TCTGGATTCA TACAATGACT TTGTTAGAAA TAAGCTTCAA GAAATTATTG ACGAACAAGG AGAAATACCC ACAGAAATAC CAGGTTTGAA AGTGAGATTA GGCAAGATAA GGATAGGAAA ACCAAGAGTC CGGGAATCGG ATAGAGGAGA AAGAGAAATC AGTCCCATGG AAGCTCGATT AAGGAACTTA ACTTATGCCG CTCCACTATG GCTTACAATG ATCCCAGTTG AAAATAATAT TGAAGCAGAA CCAGAAGAGG TTTATATAGG CGACCTGCCT ATAATGCTTA AATCAGCCAT AGACCCAATA TCACAGTATA CTCTAGATAA GCTAATTGAG ATAGGTGAAG ATCCTAAAGA CCCAGGAGGG TATTTCATAG TAAATGGATC TGAAAGAGTT ATAGTAACTC AAGAAGATTT GGCTCCAAAT AGAGTTCTTG TAGATACTGG AAAGACAGGA TCAAACATTA CGCATACAGC GAAAATTATC TCGAGTACTG CGGGCTATAG AGTGCCTGTG ACAATAGAAA GATTAAAAGA TGGTACATTT CATGTATCTT TTCCAGCAGT TCCCGGTAAG ATTCCGTTTG TTATTCTAAT GAGGGCACTG GGTATATTAA CCGATAGAGA TATAGTTTAT GCGGTATCAT TAGATCCTGA GGTTCAGAAT GAATTATTTC CTTCTCTAGA GCAAGCAAGT TCGATAGCCA ACGTTGATGA TGCACTAGAT TTTATAGGTA GTAGAGTAGC TATAGGCCAA AAGAGAGAAA ACAGAATAGA AAAAGCACAG CAGATAATTG ATAAATATTT CCTACCCCAT TTAGGCACTT CAGCAGAAGA TAGAAAGAAG AAAGCGTATT ATTTAGCTTA CGCTATATCA AAAGTAATTG AATTATATCT TGGTAGAAGG GAACCCGACG ATAAAGATCA TTACGCTAAC AAAAGATTAA GATTAGCTGG AGATTTGTTT GCATCATTAT TTAGAGTAGC TTTCAAAGCT TTCGTAAAAG ATTTAACATA TCAATTAGAG AAATCTAAGG TAAGAGGTAG GAAACTCGCT TTAAAGGCAT TAGTTAGACC AGATATTGTT ACAGAAAGAA TAAGGCATGC ATTAGCTACT GGGAACTGGG TTGGTGGAAG AACTGGAGTT AGCCAATTAC TTGATAGGAC CAACTGGCTT TCTATGTTAA GCCATCTGAG GAGAGTAATA TCCTCACTAG CAAGGGGTCA ACCTAATTTC GAAGCCAGAG ATTTACATGG TACGCAATGG GGTAGGATGT GTCCCTTTGA AACACCAGAA GGTCCAAATA GTGGACTAGT TAAGAATCTA GCGTTAATGG CTCAAATTGC TGTAGGAATA AATGAGAGGA TTGTAGAAAA AACACTTTAT GAAATGGGAG TAGTTCCAGT GGAGGAGGTC ATAAGAAGAG TAACGGAAGG CGGAGAGGAT CAGAATGAGT ATCTGAAATG GTCTAAGGTT ATACTCAATG GAAGATTAAT AGGCTATTAT CAAGATGGTG GAGAATTAGC TAATAAGATA AGAGAAAGAA GGAGAAAAGG AGAAATTAGT GATGAAGTAA ACGTAGGCCA TATAGTGACA GATTTTATTA ATGAGGTTCA TGTTAATTGT GATTCTGGAA GAGTTAGAAG ACCACTTATA ATTGTTTCTA ACGGTAACCC GTTGGTAACT ATTGAAGACA TTGAAAAGTT AGAATCAGGT GCTATTACAT TTGACGATCT TGTTAGACAA GGAAAGATAG AGTATCTAGA TGCAGAAGAA GAGGAGAACG CTTATGTTGC TTTAGAACCT AATGACTTAA CTCCAGATCA TACTCATTTA GAAATATGGT CTCCAGCTAT TTTAGGCATA ACAGCGTCTA TAATACCATA TCCAGAGCAT AATCAATCAC CTAGAAATAC ATACCAATCA GCTATGGCGA AACAAGCTCT AGGTCTATAT GCAGCAAATT ATCAATTACG TACGGACACG AGAGCACATT TACTTCATTA TCCACAAAGA CCTCTAGTTC AAACTAGGGC ATTAGATATT ATAGGATATA CAAATAGGCC GGCCGGAAAT AATGCTATAT TAGCCGTAAT GTCATTTACT GGCTACAATA TGGAAGATTC AATAATTATG AATAGATCCT CCGTGGAGAG GGGAATGTAT AGATCTACAT TTTTTAGGCT TTACTCAACG GAAGAGGTAA AATACCCTGG AGGTCAAGAA GATAAAATAG TAATGCCAGA AGCTGGTGTT AGAGGATATA AGGGCAAAGA ATATTACAGA CTTCTAGAGG ATAACGGAGT AGTCTCTCCA GAGGTCGAAG TGAAGGGAGG AGATGTTTTA ATAGGTAAAG TTAGCCCTCC AAGATTCTTA CAAGAATTTA AAGAATTATC TCCAGAGCAA GCTAAGCGTG ACACCTCAAT AGTTACAAGA CATGGTGAAA TGGGTATAGT GGATTTAGTT CTAATTACCG AAACTGCTGA GGGTAATAAG CTAGTTAAGG TAAGAGTAAG AGATCTTAGG ATACCAACAA TTGGCGATAA ATTCGCCAGT AGACATGGAC AAAAAGGCGT TATAGGTATG CTCATACCAC AAGTTGACAT GCCATATACC GTTAAAGGCG TTGTGCCAGA TATAATATTA AATCCTCATG CATTGCCATC TAGAATGACG TTAGGACAAA TTATGGAAGG AATAGCTGGT AAATATGCAG CATTATCCGG AAATATTGTA GATGCTACAC CTTTCTACAA GACACCTATA GAACAATTAC AAAATGAGAT TTTGAGATAC GGTTATCTAC CAGATGCTAC TGAAGTAGTG TATGATGGAC GTACTGGACA GAAAATTAAA TCTAGAATAT ACTTTGGAGT AGTCTATTAT CAGAAATTGC ATCACATGGT AGCAGATAAG CTTCATGCTA GAGCTAGGGG TCCAGTCCAA ATTTTAACTA GACAACCAAC AGAAGGAAGA GCTAGAGAAG GTGGTTTAAG ATTTGGAGAA ATGGAGAGAG ATTGCTTAAT TGGTTTTGGT ACTGCAATGC TTCTTAAAGA CAGGTTATTG GATAACTCTG ATAGGACAAT GATTTACGTT TGTGATCAGT GTGGTTATAT AGGCTGGTAC GATAAGAATA AGAATAAATA TGTATGCCCA ATACATGGTG ATAAGAGTAA CTTGTTCCCA GTTACTGTAT CTTACGCATT TAAGCTTTTA ATTCAAGAAC TAATGAGTAT GATTATCTCA CCTAGGTTAG TTTTGGAGGA TAAAGTTGGA TTAAGTGGAG GTTAA
|
Protein sequence | MASNLTIDER WRVIEAYFKS KGLVRQHLDS YNDFVRNKLQ EIIDEQGEIP TEIPGLKVRL GKIRIGKPRV RESDRGEREI SPMEARLRNL TYAAPLWLTM IPVENNIEAE PEEVYIGDLP IMLKSAIDPI SQYTLDKLIE IGEDPKDPGG YFIVNGSERV IVTQEDLAPN RVLVDTGKTG SNITHTAKII SSTAGYRVPV TIERLKDGTF HVSFPAVPGK IPFVILMRAL GILTDRDIVY AVSLDPEVQN ELFPSLEQAS SIANVDDALD FIGSRVAIGQ KRENRIEKAQ QIIDKYFLPH LGTSAEDRKK KAYYLAYAIS KVIELYLGRR EPDDKDHYAN KRLRLAGDLF ASLFRVAFKA FVKDLTYQLE KSKVRGRKLA LKALVRPDIV TERIRHALAT GNWVGGRTGV SQLLDRTNWL SMLSHLRRVI SSLARGQPNF EARDLHGTQW GRMCPFETPE GPNSGLVKNL ALMAQIAVGI NERIVEKTLY EMGVVPVEEV IRRVTEGGED QNEYLKWSKV ILNGRLIGYY QDGGELANKI RERRRKGEIS DEVNVGHIVT DFINEVHVNC DSGRVRRPLI IVSNGNPLVT IEDIEKLESG AITFDDLVRQ GKIEYLDAEE EENAYVALEP NDLTPDHTHL EIWSPAILGI TASIIPYPEH NQSPRNTYQS AMAKQALGLY AANYQLRTDT RAHLLHYPQR PLVQTRALDI IGYTNRPAGN NAILAVMSFT GYNMEDSIIM NRSSVERGMY RSTFFRLYST EEVKYPGGQE DKIVMPEAGV RGYKGKEYYR LLEDNGVVSP EVEVKGGDVL IGKVSPPRFL QEFKELSPEQ AKRDTSIVTR HGEMGIVDLV LITETAEGNK LVKVRVRDLR IPTIGDKFAS RHGQKGVIGM LIPQVDMPYT VKGVVPDIIL NPHALPSRMT LGQIMEGIAG KYAALSGNIV DATPFYKTPI EQLQNEILRY GYLPDATEVV YDGRTGQKIK SRIYFGVVYY QKLHHMVADK LHARARGPVQ ILTRQPTEGR AREGGLRFGE MERDCLIGFG TAMLLKDRLL DNSDRTMIYV CDQCGYIGWY DKNKNKYVCP IHGDKSNLFP VTVSYAFKLL IQELMSMIIS PRLVLEDKVG LSGG
|
| |