Gene Ssol_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1204 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1120477 
End bp1123851 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content37% 
IMG OID 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionACX91442 
Protein GI261601839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.074104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCATCTA ATTTAACCAT TGATGAAAGA TGGAGAGTCA TCGAAGCGTA CTTTAAATCC 
AAAGGCTTAG TTAGACAGCA TCTGGATTCA TACAATGACT TTGTTAGAAA TAAGCTTCAA
GAAATTATTG ACGAACAAGG AGAAATACCC ACAGAAATAC CAGGTTTGAA AGTGAGATTA
GGCAAGATAA GGATAGGAAA ACCAAGAGTC CGGGAATCGG ATAGAGGAGA AAGAGAAATC
AGTCCCATGG AAGCTCGATT AAGGAACTTA ACTTATGCCG CTCCACTATG GCTTACAATG
ATCCCAGTTG AAAATAATAT TGAAGCAGAA CCAGAAGAGG TTTATATAGG CGACCTGCCT
ATAATGCTTA AATCAGCCAT AGACCCAATA TCACAGTATA CTCTAGATAA GCTAATTGAG
ATAGGTGAAG ATCCTAAAGA CCCAGGAGGG TATTTCATAG TAAATGGATC TGAAAGAGTT
ATAGTAACTC AAGAAGATTT GGCTCCAAAT AGAGTTCTTG TAGATACTGG AAAGACAGGA
TCAAACATTA CGCATACAGC GAAAATTATC TCGAGTACTG CGGGCTATAG AGTGCCTGTG
ACAATAGAAA GATTAAAAGA TGGTACATTT CATGTATCTT TTCCAGCAGT TCCCGGTAAG
ATTCCGTTTG TTATTCTAAT GAGGGCACTG GGTATATTAA CCGATAGAGA TATAGTTTAT
GCGGTATCAT TAGATCCTGA GGTTCAGAAT GAATTATTTC CTTCTCTAGA GCAAGCAAGT
TCGATAGCCA ACGTTGATGA TGCACTAGAT TTTATAGGTA GTAGAGTAGC TATAGGCCAA
AAGAGAGAAA ACAGAATAGA AAAAGCACAG CAGATAATTG ATAAATATTT CCTACCCCAT
TTAGGCACTT CAGCAGAAGA TAGAAAGAAG AAAGCGTATT ATTTAGCTTA CGCTATATCA
AAAGTAATTG AATTATATCT TGGTAGAAGG GAACCCGACG ATAAAGATCA TTACGCTAAC
AAAAGATTAA GATTAGCTGG AGATTTGTTT GCATCATTAT TTAGAGTAGC TTTCAAAGCT
TTCGTAAAAG ATTTAACATA TCAATTAGAG AAATCTAAGG TAAGAGGTAG GAAACTCGCT
TTAAAGGCAT TAGTTAGACC AGATATTGTT ACAGAAAGAA TAAGGCATGC ATTAGCTACT
GGGAACTGGG TTGGTGGAAG AACTGGAGTT AGCCAATTAC TTGATAGGAC CAACTGGCTT
TCTATGTTAA GCCATCTGAG GAGAGTAATA TCCTCACTAG CAAGGGGTCA ACCTAATTTC
GAAGCCAGAG ATTTACATGG TACGCAATGG GGTAGGATGT GTCCCTTTGA AACACCAGAA
GGTCCAAATA GTGGACTAGT TAAGAATCTA GCGTTAATGG CTCAAATTGC TGTAGGAATA
AATGAGAGGA TTGTAGAAAA AACACTTTAT GAAATGGGAG TAGTTCCAGT GGAGGAGGTC
ATAAGAAGAG TAACGGAAGG CGGAGAGGAT CAGAATGAGT ATCTGAAATG GTCTAAGGTT
ATACTCAATG GAAGATTAAT AGGCTATTAT CAAGATGGTG GAGAATTAGC TAATAAGATA
AGAGAAAGAA GGAGAAAAGG AGAAATTAGT GATGAAGTAA ACGTAGGCCA TATAGTGACA
GATTTTATTA ATGAGGTTCA TGTTAATTGT GATTCTGGAA GAGTTAGAAG ACCACTTATA
ATTGTTTCTA ACGGTAACCC GTTGGTAACT ATTGAAGACA TTGAAAAGTT AGAATCAGGT
GCTATTACAT TTGACGATCT TGTTAGACAA GGAAAGATAG AGTATCTAGA TGCAGAAGAA
GAGGAGAACG CTTATGTTGC TTTAGAACCT AATGACTTAA CTCCAGATCA TACTCATTTA
GAAATATGGT CTCCAGCTAT TTTAGGCATA ACAGCGTCTA TAATACCATA TCCAGAGCAT
AATCAATCAC CTAGAAATAC ATACCAATCA GCTATGGCGA AACAAGCTCT AGGTCTATAT
GCAGCAAATT ATCAATTACG TACGGACACG AGAGCACATT TACTTCATTA TCCACAAAGA
CCTCTAGTTC AAACTAGGGC ATTAGATATT ATAGGATATA CAAATAGGCC GGCCGGAAAT
AATGCTATAT TAGCCGTAAT GTCATTTACT GGCTACAATA TGGAAGATTC AATAATTATG
AATAGATCCT CCGTGGAGAG GGGAATGTAT AGATCTACAT TTTTTAGGCT TTACTCAACG
GAAGAGGTAA AATACCCTGG AGGTCAAGAA GATAAAATAG TAATGCCAGA AGCTGGTGTT
AGAGGATATA AGGGCAAAGA ATATTACAGA CTTCTAGAGG ATAACGGAGT AGTCTCTCCA
GAGGTCGAAG TGAAGGGAGG AGATGTTTTA ATAGGTAAAG TTAGCCCTCC AAGATTCTTA
CAAGAATTTA AAGAATTATC TCCAGAGCAA GCTAAGCGTG ACACCTCAAT AGTTACAAGA
CATGGTGAAA TGGGTATAGT GGATTTAGTT CTAATTACCG AAACTGCTGA GGGTAATAAG
CTAGTTAAGG TAAGAGTAAG AGATCTTAGG ATACCAACAA TTGGCGATAA ATTCGCCAGT
AGACATGGAC AAAAAGGCGT TATAGGTATG CTCATACCAC AAGTTGACAT GCCATATACC
GTTAAAGGCG TTGTGCCAGA TATAATATTA AATCCTCATG CATTGCCATC TAGAATGACG
TTAGGACAAA TTATGGAAGG AATAGCTGGT AAATATGCAG CATTATCCGG AAATATTGTA
GATGCTACAC CTTTCTACAA GACACCTATA GAACAATTAC AAAATGAGAT TTTGAGATAC
GGTTATCTAC CAGATGCTAC TGAAGTAGTG TATGATGGAC GTACTGGACA GAAAATTAAA
TCTAGAATAT ACTTTGGAGT AGTCTATTAT CAGAAATTGC ATCACATGGT AGCAGATAAG
CTTCATGCTA GAGCTAGGGG TCCAGTCCAA ATTTTAACTA GACAACCAAC AGAAGGAAGA
GCTAGAGAAG GTGGTTTAAG ATTTGGAGAA ATGGAGAGAG ATTGCTTAAT TGGTTTTGGT
ACTGCAATGC TTCTTAAAGA CAGGTTATTG GATAACTCTG ATAGGACAAT GATTTACGTT
TGTGATCAGT GTGGTTATAT AGGCTGGTAC GATAAGAATA AGAATAAATA TGTATGCCCA
ATACATGGTG ATAAGAGTAA CTTGTTCCCA GTTACTGTAT CTTACGCATT TAAGCTTTTA
ATTCAAGAAC TAATGAGTAT GATTATCTCA CCTAGGTTAG TTTTGGAGGA TAAAGTTGGA
TTAAGTGGAG GTTAA
 
Protein sequence
MASNLTIDER WRVIEAYFKS KGLVRQHLDS YNDFVRNKLQ EIIDEQGEIP TEIPGLKVRL 
GKIRIGKPRV RESDRGEREI SPMEARLRNL TYAAPLWLTM IPVENNIEAE PEEVYIGDLP
IMLKSAIDPI SQYTLDKLIE IGEDPKDPGG YFIVNGSERV IVTQEDLAPN RVLVDTGKTG
SNITHTAKII SSTAGYRVPV TIERLKDGTF HVSFPAVPGK IPFVILMRAL GILTDRDIVY
AVSLDPEVQN ELFPSLEQAS SIANVDDALD FIGSRVAIGQ KRENRIEKAQ QIIDKYFLPH
LGTSAEDRKK KAYYLAYAIS KVIELYLGRR EPDDKDHYAN KRLRLAGDLF ASLFRVAFKA
FVKDLTYQLE KSKVRGRKLA LKALVRPDIV TERIRHALAT GNWVGGRTGV SQLLDRTNWL
SMLSHLRRVI SSLARGQPNF EARDLHGTQW GRMCPFETPE GPNSGLVKNL ALMAQIAVGI
NERIVEKTLY EMGVVPVEEV IRRVTEGGED QNEYLKWSKV ILNGRLIGYY QDGGELANKI
RERRRKGEIS DEVNVGHIVT DFINEVHVNC DSGRVRRPLI IVSNGNPLVT IEDIEKLESG
AITFDDLVRQ GKIEYLDAEE EENAYVALEP NDLTPDHTHL EIWSPAILGI TASIIPYPEH
NQSPRNTYQS AMAKQALGLY AANYQLRTDT RAHLLHYPQR PLVQTRALDI IGYTNRPAGN
NAILAVMSFT GYNMEDSIIM NRSSVERGMY RSTFFRLYST EEVKYPGGQE DKIVMPEAGV
RGYKGKEYYR LLEDNGVVSP EVEVKGGDVL IGKVSPPRFL QEFKELSPEQ AKRDTSIVTR
HGEMGIVDLV LITETAEGNK LVKVRVRDLR IPTIGDKFAS RHGQKGVIGM LIPQVDMPYT
VKGVVPDIIL NPHALPSRMT LGQIMEGIAG KYAALSGNIV DATPFYKTPI EQLQNEILRY
GYLPDATEVV YDGRTGQKIK SRIYFGVVYY QKLHHMVADK LHARARGPVQ ILTRQPTEGR
AREGGLRFGE MERDCLIGFG TAMLLKDRLL DNSDRTMIYV CDQCGYIGWY DKNKNKYVCP
IHGDKSNLFP VTVSYAFKLL IQELMSMIIS PRLVLEDKVG LSGG