Gene Ssol_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0848 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp792185 
End bp793861 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content43% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionACX91096 
Protein GI261601493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAA AATTAAATAG TCCCTCGAGA TATCATGGCA TATATAATGC TCCACACAGA 
GCGTTTCTAA GATCTGTAGG TTTAACGGAT GAAGAGATAG GTAAACCATT GGTTGCCATA
GCCACAGCAT GGAGTGAGGC AGGCCCTTGT AACTTCCACA CATTAGCTTT AGCAAGAGTA
GCTAAGGAGG GGACAAAGGA AGCCGGTTTA TCCCCATTAG CATTCCCAAC CATGGTAGTT
AACGATAACA TTGGAATGGG ATCTGAAGGA ATGAGGTATA GTTTAGTTAG CAGGGATTTA
ATAGCAGATA TGGTAGAGGC GCAATTTAAT GCTCACGCGT TTGATGGACT AGTGGGCATA
GGAGGGTGTG ATAAGACAAC GCCTGGTATA CTAATGGCAA TGGCTAGGTT AAACGTTCCC
TCAATTTACA TTTATGGCGG ATCAGCAGAG CCAGGGTACT TTATGGGTAA AAGACTAACA
ATAGAGGATG TACATGAGGC AATAGGCGCA TATTTAGCGA AGAGAATAAC AGAAAATGAG
CTATACGAAA TAGAGAAAAG GGCACATCCG ACATTGGGAA CTTGCTCTGG ATTATTTACA
GCCAATACTA TGGGCTCAAT GTCGGAGGCA TTAGGAATGG CATTGCCAGG TAGTGCATCA
CCAACCGCTA CCTCATCTAG GAGAGTAATG TATGTGAAGG AAACTGGAAA AGCTTTAGGT
AGTTTAATTG AAAACGGAAT AAAGTCAAGG GAAATATTAA CTTTTGAGGC CTTTGAGAAC
GCAATAACAA CGCTAATGGC TATGGGTGGT TCAACAAACG CGGTATTACA CCTATTGGCA
ATAGCCTATG AAGCAGGAGT GAAATTAACC TTAGATGATT TCAATAGGAT ATCTAAAAGA
ACACCATATA TTGCAAGTAT GAAACCTGGT GGAGATTACG TGATGGCTGA TTTGGACGAA
GTTGGAGGTG TCCCAGTAGT CTTAAAGAAG CTACTAGACG CTGGTCTACT TCATGGTGAC
GTTTTAACAG TTACTGGAAA GACTATGAAG CAAAACCTTG AGCAATACAA GTATCCTAAT
GTACCTCACA GTCATATAGT TAGGGATGTT AAGAATCCGA TTAAGCCTAG AGGAGGAATA
GTTATATTGA AGGGATCGTT AGCCCCAGAA GGTGCCGTAA TTAAGGTAGC CGCAACTAAC
GTTGTTAAAT TTGAAGGAAA GGCTAAGGTG TATAATTCCG AGGACGACGC TTTTAAGGGA
GTTCAAAGTG GCGAAGTTAG TGAGGGTGAA GTTGTAATTA TAAGATATGA AGGACCTAAG
GGAGCTCCAG GTATGCCAGA AATGCTGAGA GTTACAGCGG CAATCATGGG TGCGGGTTTG
AATAACGTTG CCTTAGTTAC TGATGGTAGA TTTTCCGGAG CCACTAGGGG ACCTATGGTA
GGCCATGTAG CTCCAGAGGC AATGGTAGGT GGTCCTATCG CAATAGTTGA AGATGGAGAT
ACTATAGTGA TTGATGTTGA GAGCGAAAGA CTTGACTTGA AGCTATCCGA GGAGGAGATA
AAGAATAGAC TGAAAAGATG GTCTCCTCCA TCTCCAAGAT ACAAGTCTGG TCTATTGGCT
AAATACGCCT CCTTGGTCTC CCAGGCGTCA ATGGGGGCTG TGACTAGACC AGCTTAA
 
Protein sequence
MPAKLNSPSR YHGIYNAPHR AFLRSVGLTD EEIGKPLVAI ATAWSEAGPC NFHTLALARV 
AKEGTKEAGL SPLAFPTMVV NDNIGMGSEG MRYSLVSRDL IADMVEAQFN AHAFDGLVGI
GGCDKTTPGI LMAMARLNVP SIYIYGGSAE PGYFMGKRLT IEDVHEAIGA YLAKRITENE
LYEIEKRAHP TLGTCSGLFT ANTMGSMSEA LGMALPGSAS PTATSSRRVM YVKETGKALG
SLIENGIKSR EILTFEAFEN AITTLMAMGG STNAVLHLLA IAYEAGVKLT LDDFNRISKR
TPYIASMKPG GDYVMADLDE VGGVPVVLKK LLDAGLLHGD VLTVTGKTMK QNLEQYKYPN
VPHSHIVRDV KNPIKPRGGI VILKGSLAPE GAVIKVAATN VVKFEGKAKV YNSEDDAFKG
VQSGEVSEGE VVIIRYEGPK GAPGMPEMLR VTAAIMGAGL NNVALVTDGR FSGATRGPMV
GHVAPEAMVG GPIAIVEDGD TIVIDVESER LDLKLSEEEI KNRLKRWSPP SPRYKSGLLA
KYASLVSQAS MGAVTRPA