Gene Ssol_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0072 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp62341 
End bp63846 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content37% 
IMG OID 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionACX90373 
Protein GI261600770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGG TAAAAGAAAT GAGGGCTTTG GAAATAAATA GTTCTGCTTT AGGCGTATCT 
ACATTATTAC TCATGGAGAA TGCAGGTAGA TCCGTTAAAG ACGAAATAGT AAAGAGATTT
AACGTAAAGG ATAAGGTAGC ATATGTTTAT GTAGGACATG GTGGAAAAGG TGGTGACGGG
TTAGTAGCGG CAAGACATTT AGCCGATGAA GGTGCTAAGG TAACCGTAAT TTTATTGGGA
GAAAATAAAC ACGAGGATGC AATCCTTAAT CTTAATGTCA TAGAAGAGAT GGACTATTCA
ATAACGTTAG TTGAGATAAA GGATATGGAT GAACTAAAGC CAATCTCTGC TGATATTTTA
ATCGATGCCA TGTTAGGTAC GGGATTCTCT GGAAAGCCAA GAGAACCGTT TAGAAGTGCG
ATAAAAGCGT TCAACAATAG TAAAGGGTTC AAGGTCTCTA TAGACGTTCC CTCCGGGATA
AATGCTGATA CTGGTGAAGC ATATGAAGAC GAGTATGTTA AACCGGATCT GGTTGTCACC
TTTCACGATA TCAAACCTGG CTTATTAAAG TATAATTTCA ATACTGTGGT TACGAAAATA
GGTATTCCAG TAGAAGCCGA AATATATGTT GGGCCAGGGG ATTTAATAGT TAACGCGCGT
AGTAGACCTT ATTACTCTAA GAAGGGTGAT AGCGGAAGAG TACTAGTAAT TGGAGGAAGT
TACACTTTTA GTGGTGCTCC AACTCTAGCC GCTTTGGGTG CTTTGAGAGC TGGAGCTGAC
CTAGTTTATG TAGCATCACC AGAGGATACG GCTAGAATTA TAGCGGGATA CTCTCCAGAC
TTAATTACAA TAAAATTAAG GGGAAAGAAC ATTTCTCCAG ACAATTTTGA AGAATTGAAA
TTATGGATAG ATAGAGCTGA TGTGGTAGTT ATAGGTCCGG GAATGGGTCT AGCTGAGGAG
ACTATTGAGG CTTCTAAACT AATTGTGAAT TATCTTAAAG AGAAGAATAA GCTAGCTGTT
ATTGATGCTG ATGCACTTAA GGCAATAAGT GGGTTCGATT TGTATGAGAA TGCTGTAATA
ACACCTCATG CAGGCGAATT CAAAATATTC TTTGGAGAAG AACCAGATAA GAACATAAGA
GATAGAATAA GCCAAGTAAT TACTTATGCT AAGAAATGTA AATGTACAGT TCTACTTAAG
GGTTATGTTG ATATAATAAG TGATGGTAAA AGGTTTAAAT TAAATAAAAC TGGTAACCCA
GGTATGACTG TAGGTGGGAG CGGGGATACG TTGACTGGTA TAACAGCAAC ATTAATGGCT
CAAAAAATCG AACCATTTAT AGCCGCATAT TTAGGGGTTT TCATAAATAG CCTAGCTGGA
ACTTTAGCAT ATAATAGGCT CGGAGCTCAT TTGACACCTA CTGACATAAT AAATGAAATT
CCAAATGTGA TAAATAATCC CTTGGATTCT TTCAAAAGAA AATTGTATAA AAGAGTTTTA
AGTTGA
 
Protein sequence
MISVKEMRAL EINSSALGVS TLLLMENAGR SVKDEIVKRF NVKDKVAYVY VGHGGKGGDG 
LVAARHLADE GAKVTVILLG ENKHEDAILN LNVIEEMDYS ITLVEIKDMD ELKPISADIL
IDAMLGTGFS GKPREPFRSA IKAFNNSKGF KVSIDVPSGI NADTGEAYED EYVKPDLVVT
FHDIKPGLLK YNFNTVVTKI GIPVEAEIYV GPGDLIVNAR SRPYYSKKGD SGRVLVIGGS
YTFSGAPTLA ALGALRAGAD LVYVASPEDT ARIIAGYSPD LITIKLRGKN ISPDNFEELK
LWIDRADVVV IGPGMGLAEE TIEASKLIVN YLKEKNKLAV IDADALKAIS GFDLYENAVI
TPHAGEFKIF FGEEPDKNIR DRISQVITYA KKCKCTVLLK GYVDIISDGK RFKLNKTGNP
GMTVGGSGDT LTGITATLMA QKIEPFIAAY LGVFINSLAG TLAYNRLGAH LTPTDIINEI
PNVINNPLDS FKRKLYKRVL S