Gene Ssol_0793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0793 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp735379 
End bp737481 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content39% 
IMG OID 
ProductAlpha-glucosidase 
Protein accessionACX91047 
Protein GI261601444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAA TAAAAATATA CGAGAACAAA GGCGTTTACA AAGTAGTTAT AGGAGAACCA 
TTTCCCCCCA TAGAATTCCC ACTTGAGCAA AAGATATCAT CGAATAAATC TTTATCAGAG
TTGGGTTTAA CAATAGTTCA ACAAGGTAAC AAGGTTATTG TCGAGAAATC ATTGGATTTG
AAAGAGCACA TTATAGGATT GGGAGAGAAG GCGTTTGAGT TGGATAGAAA GAGGAAAAGG
TATGTGATGT ATAACGTTGA CGCTGGGGCT TATAAGAAAT ATCAAGATCC ACTTTACGTT
AGTATACCCT TATTTATATC AGTGAAAGAC GGCGTTGCAA CTGGTTACTT CTTCAACTCA
GCTTCTAAAG TGATCTTCGA CGTGGGACTT GAGGAATACG ATAAAGTAAT TGTTACAATT
CCAGAGGACT CAGTAGAGTT TTACGTGATT GAAGGGCCAA GAATTGAGGA CGTTCTAGAG
AAATACACGG AGCTTACCGG AAAACCTTTC CTACCTCCCA TGTGGGCTTT CGGTTACATG
ATATCACGCT ACTCTTACTA CCCCCAGGAT AAGGTTGTTG AGTTAGTAGA TATAATGCAA
AAGGAGGGTT TTAGAGTAGC TGGAGTATTC TTAGATATAC ACTACATGGA CTCCTATAAG
TTATTTACAT GGCATCCTTA TAGGTTCCCA GAACCTAAAA AGCTAATTGA CGAATTACAC
AAGAGAAACG TTAAGCTAAT TACAATAGTT GACCACGGAA TAAGGGTTGA TCAGAATTAT
TCACCATTTC TTTCCGGAAT GGGAAAATTC TGTGAGATTG AAAGTGGTGA ACTATTCGTA
GGTAAAATGT GGCCTGGTAC TACTGTCTAT CCAGACTTCT TCAGGGAGGA TACTAGAGAA
TGGTGGGCTG GGTTAATCTC CGAATGGCTT TCACAAGGAG TTGATGGTAT TTGGCTAGAC
ATGAATGAAC CAACTGACTT CTCTAGGGCT ATTGAGATCA GAGACGTTTT ATCTTCGTTA
CCCGTACAGT TCAGAGATGA TAGACTTGTT ACCACTTTTC CAGATAACGT AGTTCACTAC
TTGAGGGGAA AGAGGGTTAA ACACGAAAAA GTTAGAAATG CTTATCCTTT ATATGAGGCT
ATGGCAACGT TTAAGGGGTT TAGGACAAGC CATAGGAATG AAATATTTAT CTTGAGTAGA
GCCGGTTATG CCGGAATACA AAGATACGCA TTCATCTGGA CTGGTGATAA TACCCCTTCA
TGGGATGATT TGAAGCTTCA ACTACAATTG GTTCTCGGCT TATCGATTTC TGGTGTACCA
TTTGTAGGTT GTGATATAGG TGGATTTCAA GGCAGGAACT TCGCGGAAAT TGACAACTCT
ATGGATTTAT TAGTCAAATA TTATGCTTTA GCCTTGTTCT TCCCCTTCTA TAGGTCACAC
AAGGCAACTG ATGGTATAGA TACGGAACCA GTTTTCCTGC CAGATTACTA TAAGGAGAAA
GTAAAGGAAA TCGTGGAGTT GAGGTATAAG TTCTTACCCT ATATTTATTC CTTAGCTTTA
GAGGCTAGTG AGAAGGGACA TCCGGTAATT AGACCTCTAT TTTACGAATT CCAGGATGAT
GACGACATGT ATAGAATAGA AGACGAGTAT ATGGTTGGTA AGTATTTGCT TTACGCTCCA
ATTGTAAGTA AAGAGGAGAG TAGGTTAGTA ACATTACCTA GAGGTAAGTG GTACAATTAC
TGGAATGGCG AGATAATAAA CGGTAAGAGT GTTGTTAAGT CTACTCATGA GTTGCCAATT
TACTTGAGAG AAGGATCAAT AATCCCGTTG GAGGGTGACG AGTTAATAGT TTACGGTGAG
ACCTCGTTCA AGCGTTACGA TAATGCTGAA ATTACCTCCT CAAGTAATGA AATTAAGTTT
TCAAGGGAGA TTTATGTATC TAAGCTAACT ATCACATCAG AGAAACCAGT GAGCAAGATA
ATAGTTGACG ATAGTAAGGA AATTCAAGTA GAGAAGACAA TGCAAAACAC TTACGTTGCT
AAGATTAATC AAAAAATTAG GGGAAAGATT AACCTAGAGG GGAGTGTTCT CAAACAGTCA
TGA
 
Protein sequence
MQTIKIYENK GVYKVVIGEP FPPIEFPLEQ KISSNKSLSE LGLTIVQQGN KVIVEKSLDL 
KEHIIGLGEK AFELDRKRKR YVMYNVDAGA YKKYQDPLYV SIPLFISVKD GVATGYFFNS
ASKVIFDVGL EEYDKVIVTI PEDSVEFYVI EGPRIEDVLE KYTELTGKPF LPPMWAFGYM
ISRYSYYPQD KVVELVDIMQ KEGFRVAGVF LDIHYMDSYK LFTWHPYRFP EPKKLIDELH
KRNVKLITIV DHGIRVDQNY SPFLSGMGKF CEIESGELFV GKMWPGTTVY PDFFREDTRE
WWAGLISEWL SQGVDGIWLD MNEPTDFSRA IEIRDVLSSL PVQFRDDRLV TTFPDNVVHY
LRGKRVKHEK VRNAYPLYEA MATFKGFRTS HRNEIFILSR AGYAGIQRYA FIWTGDNTPS
WDDLKLQLQL VLGLSISGVP FVGCDIGGFQ GRNFAEIDNS MDLLVKYYAL ALFFPFYRSH
KATDGIDTEP VFLPDYYKEK VKEIVELRYK FLPYIYSLAL EASEKGHPVI RPLFYEFQDD
DDMYRIEDEY MVGKYLLYAP IVSKEESRLV TLPRGKWYNY WNGEIINGKS VVKSTHELPI
YLREGSIIPL EGDELIVYGE TSFKRYDNAE ITSSSNEIKF SREIYVSKLT ITSEKPVSKI
IVDDSKEIQV EKTMQNTYVA KINQKIRGKI NLEGSVLKQS