Gene Ssol_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1202 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1116654 
End bp1117832 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content35% 
IMG OID 
ProductDNA-directed RNA polymerase, subunit A'' 
Protein accessionACX91440 
Protein GI261601837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.551072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATG AGAAAGATAA GCCCTATTTA GAGGAGAAAG TGAAGCAAGC TTCCAATATC 
CTCCCTCAAA AAATTGTAGA CGATTTGAAA AATTTGATAT TAAACAAGGA AATAATAGTG
ACGAGAGATG AGATTGATAA AATCTTCGAT TTGGCTATTA AAGAGTATAG TGAAGGGTTA
ATAGCTCCAG GAGAGGCTAT TGGAATTGTA GCCGCACAGT CAGTAGGTGA GCCCGGTACC
CAAATGACAT TAAGGACTTT CCATTTTGCG GGTATAAGAG AGTTAAATGT AACTTTAGGA
CTTCCAAGGC TAATAGAAAT TGTGGATGCG AAGAAAGTTC CATCTACTCC AATGATGACT
ATTTATTTAA CTGATGAATA CAAGCGTGAT AGGGATAAAG CGTTAGAAGT CGCCAGAAAA
TTAGAATATA CGAAAATAGA AAATGTAGTG AGTTCAACTA GTATCGATAT AGCCTCAATG
TCCATTATTC TCCAACTCGA TAATGAAATG TTAAAAGATA AAGGCGTTAC TGTAGATGAT
GTTAAAAAAG CTATAGGTAG ATTGAAATTA GGAGATTTTA TGATAGAAGA ATCTGAGGAT
AGTACATTAA ACATAAATTT CGCTAATATA GATAGTATAG CTGCGCTATT TAAACTAAGG
GATAAGATAC TTAATACCAA AATAAAGGGA ATAAAGGGTA TAAAACGTGC TATAGTCCAG
AAAAAGGGCG ATGAGTATAT CATTTTAACC GATGGTTCAA ATTTATCTGG TGTTCTTAGT
GTAAAAGGAG TTGACGTAGC TAAAGTAGAG ACTAATAATA TCCGTGAGAT TGAGGAAGTA
TTTGGAATAG AAGCGGCAAG GGAAATTATA ATTAGGGAGA TTAGTAAAGT ATTAGCAGAA
CAAGGATTGG ATGTTGATAT AAGGCATATA TTATTAATTG CGGACGTGAT GACGAGAACG
GGTATTGTAA GGCAGATAGG TAGACATGGT GTAACTGGAG AGAAGAATAG TGTATTAGCA
AGAGCTGCAT TTGAAGTTAC TGTAAAACAT CTTTTAGATG CTGCGGCTAG AGGAGATGTA
GAAGAATTTA AAGGTGTAGT AGAAAACATT ATAATTGGTC ATCCAATTAA ACTAGGTACT
GGAATGGTTG AATTAACAAT GAGGCCGATA TTAAGGTGA
 
Protein sequence
MIDEKDKPYL EEKVKQASNI LPQKIVDDLK NLILNKEIIV TRDEIDKIFD LAIKEYSEGL 
IAPGEAIGIV AAQSVGEPGT QMTLRTFHFA GIRELNVTLG LPRLIEIVDA KKVPSTPMMT
IYLTDEYKRD RDKALEVARK LEYTKIENVV SSTSIDIASM SIILQLDNEM LKDKGVTVDD
VKKAIGRLKL GDFMIEESED STLNINFANI DSIAALFKLR DKILNTKIKG IKGIKRAIVQ
KKGDEYIILT DGSNLSGVLS VKGVDVAKVE TNNIREIEEV FGIEAAREII IREISKVLAE
QGLDVDIRHI LLIADVMTRT GIVRQIGRHG VTGEKNSVLA RAAFEVTVKH LLDAAARGDV
EEFKGVVENI IIGHPIKLGT GMVELTMRPI LR