Gene Ssol_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0200 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp172710 
End bp174044 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content35% 
IMG OID 
Productphosphomethylpyrimidine kinase 
Protein accessionACX90496 
Protein GI261600893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.619111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAC CTGTTGGTAT GAGCATAGCT GGATTGGATA CTGGAAATGG GGCTGGTGGA 
GAGACTGACT TAAGAGTTTT TGAAGTATTA GGAATCCACG GAGTTTTTGC TATCACTGCG
ATAACAGCTC AAAGTACAAA AGGCATAAAA GATATCACTG TTGTTAATCC TGAGTTTCTT
AAAAAGCAGA TAGAAACTTT ACTCGAGGAT TTCAAAGTGG AAGCTGTAAA GATAGGAATG
ATATATACTA AAGAACAGTT TCAGGTTGTA AATGAATTAT TAAATGATTA CTTTTTAGTA
GCAGATCCGG TACTTTATGC GAAAGATGGA ACTCCGCTGA TTAGAGACAT TGAAGAATAT
AAGAAGATAA TTTTGCCAAA AGTTAAAGTA ATAACGCCAA ATATAATAGA AGCATCTGCA
ATTAGCGGTG TCAAAATAGA AAAGGAGAGC GATGTAGTAA TTGTATGTAA GAAATTAAGG
GATAGTTATA ATATTCCCTA CGTGATAATC AAGGGAGGCC ATAGTAAAGG AGATTATAGT
TTCGATTATA TGTGTAATGA TGAAGGACTA TACAAGATAG GATATAAAAG GTTGCAGGCA
AAGGATACGC ATGGTACTGG AAGCGTATTT GCAACTGCAC TTACTGCAGA ATACATAAAG
ATAAGGGATT TAAAATTGGC CTTTAGGAAG GCTAGAGATT TCGTTCAAAC CTCAATAGAA
TATGGACTTA ACATAGGAAA AGGTATAGGA CCAGTTAACG TAAGTGTTGA AATTATGAAG
AAGTCTATGA AATATGAAGC AGTTGAGGAA ATGAGGAGAT TTGCGGATTT TGCCGAAAAT
AACGATAGAT TCTGGATTTT AATCCCTGAA GTACAATCAA ATCTAGCACA TAGTATAAAA
CCAGAATACG TTAGGGATCT AAATGATATT GCTACATTTA GGGGTAGAAT AATAAGGAGA
TGGGATAAAA AGGTAATTGT TGGACATCCA GTGGTGTTTG GAAACCCTAC CCATACAGCT
CGAATGCTGT TGTCGCTCAT TCTCAAGGGA AAGGATAGTA CTTGCTTAAT GAATATTAGA
TATGACGATA AGATAGTTGA GAGTTTTAAG AGAATTGGGT ATGAAACAAT TGAGATTAAT
AGGGAACTAG AACCTGCCCA TGGAGAAGGA AAGACAATGC AATGGATTAT TGAGTATGTA
AGTAGCGAAT ATGGTGGCAT ACCGAATGTA ATTTATGATA AGGGAACTAA GGGAAAGGAG
GCAATGATTA GGTTTTGGAC AAAAAACATG GAAGAGATGA TAGAAGCTTT AGATAATTTA
TTAAAAATGC TGTAA
 
Protein sequence
MIKPVGMSIA GLDTGNGAGG ETDLRVFEVL GIHGVFAITA ITAQSTKGIK DITVVNPEFL 
KKQIETLLED FKVEAVKIGM IYTKEQFQVV NELLNDYFLV ADPVLYAKDG TPLIRDIEEY
KKIILPKVKV ITPNIIEASA ISGVKIEKES DVVIVCKKLR DSYNIPYVII KGGHSKGDYS
FDYMCNDEGL YKIGYKRLQA KDTHGTGSVF ATALTAEYIK IRDLKLAFRK ARDFVQTSIE
YGLNIGKGIG PVNVSVEIMK KSMKYEAVEE MRRFADFAEN NDRFWILIPE VQSNLAHSIK
PEYVRDLNDI ATFRGRIIRR WDKKVIVGHP VVFGNPTHTA RMLLSLILKG KDSTCLMNIR
YDDKIVESFK RIGYETIEIN RELEPAHGEG KTMQWIIEYV SSEYGGIPNV IYDKGTKGKE
AMIRFWTKNM EEMIEALDNL LKML