Gene Ssol_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1961 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1745404 
End bp1746762 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content36% 
IMG OID 
Productpyruvate kinase 
Protein accessionACX92172 
Protein GI261602569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGA CTAAAATAGT TGCTACTTTA GGTCCTTCCT CAGAGGAAAA AGTAAAAGAA 
CTGGCAGAAT ACGTTGATGT TTTTAGAATA AATTTTGCAC ATGGAGACGA AACATCTCAT
AGGAAGTATT TTGATCTTAT TAGAACATAT GCACCGGAAT CTAGTATTAT AGTAGATTTG
CCAGGGCCTA AGTTGAGACT AGGAGAACTC AAAGAACCAA TAGAGGTGAA GAAAGGAGAT
AAGATAGTTT TCTCTCAAAA AGATGGAATT CCAGTTGATG ATGAGTTATT TTATTCGGCT
GTAAAAGAAA ACTCGGATAT CTTAATTGCA GACGGAACAA TACGTGTGAG GGTTAAGTCA
AAAGCTAAGG ATAGAGTAGA GGGAACCGTA ATAGAGGGTG GAATTTTATT ATCGAGAAAA
GGAATAAATA TTCCTAATGT CAATCTAAAA TCTGGGATAA CGGACAACGA TTTAAAACTT
TTAAAAAGAG CTTTAGATCT GGGAGCAGAT TATATAGGAC TCTCTTTTGT AATAAGTGAG
AATGATGTAA AGAAGGTAAA GGAATTTATA GGTGATGAAG CTTGGGTTAT CGCGAAGATA
GAAAAAAGTG AGGCATTAAA GAACTTAACC AATATCGTTA ATGAATCGGA TGGAATAATG
GTAGCCAGAG GCGATTTGGG GGTTGAGACT GGCTTAGAAA ATCTGCCTTT AATTCAAAGG
AGAATAGTAA GGACTTCAAG AGTATTTGGC AAACCCGTCA TTTTAGCAAC TCAAGTATTA
ACTTCGATGA TAAACAGCCC TATACCTACC AGAGCTGAGA TTATAGATAT TTCTAACTCG
ATTATGCAGG GAGTGGACTC TATAATGTTA AGCGATGAAA CAGCCATAGG CAATTATCCA
GTTGAAAGCG TAAGAACTCT TCATAATATC ATAAGTAATG TAGAAAAGAG TGTAAAACAT
AGACCAATCG GACCACTAAA TAGTGAGAGT GATGCGATAG CTCTAGCTGC TGTAAATGCA
AGTAAAGTAT CTAAGGCAGA TGTAATAGTA GTGTATAGTA GATCAGGTAA TTCAATATTG
CGCGTATCGA GACTGAGACC TGAACGTAAC ATAATAGGAG TCTCTCCTGA TCCTAGACTA
GCTAAAAAGT TTAAGCTTTG TTATGGTGTA ATACCCATTA GTATAAACAA AAAGATGCAG
TCCATAGACG AGATAATAGA CGTCTCAGCC AAGCTAATGC AGGAAAAAAT AAAGGACTTA
AAATTTAAAA AAATCGTTAT AGTAGGAGGG GATCCTAAAC AAGAAGCGGG GAAGACTAAC
TTCGTTATAG TTAAGACACT AGAACAACAA AAGAAATGA
 
Protein sequence
MRKTKIVATL GPSSEEKVKE LAEYVDVFRI NFAHGDETSH RKYFDLIRTY APESSIIVDL 
PGPKLRLGEL KEPIEVKKGD KIVFSQKDGI PVDDELFYSA VKENSDILIA DGTIRVRVKS
KAKDRVEGTV IEGGILLSRK GINIPNVNLK SGITDNDLKL LKRALDLGAD YIGLSFVISE
NDVKKVKEFI GDEAWVIAKI EKSEALKNLT NIVNESDGIM VARGDLGVET GLENLPLIQR
RIVRTSRVFG KPVILATQVL TSMINSPIPT RAEIIDISNS IMQGVDSIML SDETAIGNYP
VESVRTLHNI ISNVEKSVKH RPIGPLNSES DAIALAAVNA SKVSKADVIV VYSRSGNSIL
RVSRLRPERN IIGVSPDPRL AKKFKLCYGV IPISINKKMQ SIDEIIDVSA KLMQEKIKDL
KFKKIVIVGG DPKQEAGKTN FVIVKTLEQQ KK