Gene Ssol_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2811 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2575799 
End bp2578825 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content37% 
IMG OID 
Productpeptidase S41 
Protein accessionACX92892 
Protein GI261603289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAT ACTATATGTA CCCTGATATT AGGGGAGATT TAATATCATT TACTTCAGAT 
GATGACGTTT GGCTTCTTTC TTTAAAAGAT ATGAAACCGC TTAGGATAAC AAGTGGTTTA
GGAGTTTCCA CTAGGCCTAA AATAAGTCCA AGTGGTAGAA AAGTGGCGTT TTCTGTTATT
TGGCTTAAGA GCGGTAAGCA AGGTGGAGAT ATCTACGTTG TTGAAGACGG GCAAGCTAGG
AGGGTTACCT ATTTTGGTAG TAGGAATAGT AGGGTTGCTG GTTGGATTTC TGAGGACGAG
ATTATTGTAA TAACTGACTT TCACACTCCT TTTATTCAAT GGACTGAGGC GTATAAGGTA
AATGTAAATA ACGGAAAGAC AGAGAAATTG CCTTTCGGCA TGTTATCCAA TATTGTAATA
AAGGATGATA TAATAGTAAT TGCAAGGGGT TATCAAGACT TACCAAACTG GAAAGGGTAT
AAGGGTGGAA CTAAGGGTGA ATTATGGATT TCCAGCGATG GTGGTAAAAC CTTTGAGAAG
TTTGTTAGTT TAGATGGTAA CGTTAGCTGG CCTATGATAG TTAGAGAAAG GGTTTACTTT
TTATCTGATC ACGAGGGAGT TGGTAATCTT TATTCAGTTG ATTTAAAAGG TAAGGATTTA
AGGAGACATA CTAATTTCAC TGATTATTAT TGTAGGAATG CCAGTAGTGA TGGTAAGAGA
ATTGTTTTTC AAAACGCTGG AGACATATAT TTGTACGATC CAGAAAAGGA CAGCTTAACT
AAACTGGATA TTAACTTACC TACCGATAGG AAGAAGAAGC AACCAAAATT CGTTAATGTA
ATGGAGTACA TGAACGAAGC TGTTGTAAAT GGTAACTATA TAGCATTAGT AAGTAGGGGC
AAGGTATTTT TAATGAGACC ATGGGATGGT CCTTCAGTTC AATTGGGTAA GAAACAAGGT
GTAAAATATA GGCAGATTCA AGTTTTGCCT AATGGTGACG TGATAGGAGT AAACGATGAG
GACAAATTGG TAATCTTAGG TAAGGACGGT AGTGAGAAGG TTATAAACAA GGATTTTAGT
AGAATAGAGA GAGTTAAGGT TTCTCCAGAT GGTAAGAAAG TATTATTATC TAACAATAAA
CTTGAATTAT GGGTTTACGA GATTGATAAT GATAACGCTA GATTAATAGA TAAGAGCGAG
TACGACTTAA TTTTAGAGTT TGATTGGCAT CCAAATGCTG AGTGGTTTGC TTACGCTTTT
CCAGAAGGCT ATTATACTCA ATCAATAAAG CTTGCCCACA TTGATGGGAA GGTTGTTAGG
ATAACGACTC CCTATGGATA TGACTTTTCA CCATCATTTG ACCCAGATGG TAGATATTTA
TACTTCTTGG CTGCTAGGCA TTTGGACCCA ACTAACGATA AGGTAATATT TAATTTAAGT
TTCCAGAGGG TTGTTAAGCC ATACCTAGTA GTTTTAGGAA ATTATTATTC CCCATTTAAC
CAACCATTAG ATGAGGCTAA TAGCAACGAC AAAAACGTCA TAATTGAGGG AATCGAAGAT
AGGGTAGTTC CATTCCCGAT TGAAGAGGAA AATTACGTGC AAATAGCTGG AGCTAAGAAC
AACAAGATCT TCCTATTTTC CTATCCAATA AGGGGGCTTA GATCACAAAC TGGAGATGTG
TTTGGTAGGT TAGAGGTTTA TGATCTAGAG AATAAGGCGA AGGAGTTATA TGCAGATAAC
GTTTCAAGCT TCTCTTTGTC TAGCGATAAA AGTAAAATAC TTTTAATACT TAAGGATAGT
CTAAGGCTAT TTGATGTTAA TGTAAAACCA GATTTTAACT CAACTGGAAG AAAAGGTGGG
GTAATAGATT TATCTAGAGT TAAGGTTTAT GTTGAGCCGG AGAAGGAATG GAGGCAAATG
CTTAGGGAAA CGTGGAAGTT GATGAAGCAG AATTATTGGA ATGAGGAGAG ATTAAAGAAT
TGGGACTCTA TCTTACCCAA ATACGAGAGA CTTTTAGATA GGATAAGTAC TAGATTTGAG
CTTTCTGATG TAATTCAAGA GATGCAAGGC GAGACTAGGA CTTCTCATTC CTACGAAACA
GCTTACGATT ACGATACTCC GGAGCCGTTG TCAGTTGGTG GTTTAGGTGC TGAGTTTGAG
TATGATGAGA GCAATAAATG TTACAAAATT ACAAAGATTT ATGTTGGGGA TTCTACCAAT
GAGAATGAGA GAAGTCCATT ACGGGATCCT GGTGTTCAAT TGAATGTTGG AGATTGTATA
AAAAATATTG ACGGGGAAGA TGCAAATGGT AACATTTACT CTCATCTAAT AAATAAGGAT
CAAGTTATTC TTGACGTAAT AACTGCTGAC GGTAAGAATA AACGCGTCAC GGTTAAAGTA
TTAAAAGATG AAAGGTTCTT AATATATAGG TATTGGGTTG AGAAGAATAG GGAATATGTT
CACGAGAAGA GCAAGGGTAG ATTAGGATAT ATTCACATAC CAGATATGAT GTATCAAGGA
TTCGCTGAGT TTTACAGACT TTTCATGTCT GAATTCCACA GAGAAGGGCT AGTAGTTGAC
GTTAGGTTTA ATAGGGGTGG CTTTGTCTCA GGTTTACTCT TAGAGAAGCT ACTCTTGAAA
AGAGTTGGCT ATGATTATCC TAGAAATGGA AAACCAATAC CTATGCCTTA TTTCTCTTCT
CCTAAGGTTT TAGTGGGAAT AACTAATGAG CATGCTGGGT CCGATGGCGA TATCTTTTCA
TTCTTGTTCA AGAAGTACAA GCTAGGAGTA CTTATTGGTA GAAGAACGTG GGGAGGTGTT
GTCGGTATAA GACCTAGATA TAGATTAGTG GATAAAACTT ATATTAGTCA ACCAGAGTTT
GCTGTTAACT TCGAGGATGT AGGTTTTGGT ATTGAGAATT ACGGAGTAGA CCCAGATATA
GTTGTTGAGA TTAAGCCAGA TGATTATGTA AATAATAGGG ATACTCAATT AGATACGGCA
ATAGAGTTGG CATTAAAACA ACTTTAA
 
Protein sequence
MKAYYMYPDI RGDLISFTSD DDVWLLSLKD MKPLRITSGL GVSTRPKISP SGRKVAFSVI 
WLKSGKQGGD IYVVEDGQAR RVTYFGSRNS RVAGWISEDE IIVITDFHTP FIQWTEAYKV
NVNNGKTEKL PFGMLSNIVI KDDIIVIARG YQDLPNWKGY KGGTKGELWI SSDGGKTFEK
FVSLDGNVSW PMIVRERVYF LSDHEGVGNL YSVDLKGKDL RRHTNFTDYY CRNASSDGKR
IVFQNAGDIY LYDPEKDSLT KLDINLPTDR KKKQPKFVNV MEYMNEAVVN GNYIALVSRG
KVFLMRPWDG PSVQLGKKQG VKYRQIQVLP NGDVIGVNDE DKLVILGKDG SEKVINKDFS
RIERVKVSPD GKKVLLSNNK LELWVYEIDN DNARLIDKSE YDLILEFDWH PNAEWFAYAF
PEGYYTQSIK LAHIDGKVVR ITTPYGYDFS PSFDPDGRYL YFLAARHLDP TNDKVIFNLS
FQRVVKPYLV VLGNYYSPFN QPLDEANSND KNVIIEGIED RVVPFPIEEE NYVQIAGAKN
NKIFLFSYPI RGLRSQTGDV FGRLEVYDLE NKAKELYADN VSSFSLSSDK SKILLILKDS
LRLFDVNVKP DFNSTGRKGG VIDLSRVKVY VEPEKEWRQM LRETWKLMKQ NYWNEERLKN
WDSILPKYER LLDRISTRFE LSDVIQEMQG ETRTSHSYET AYDYDTPEPL SVGGLGAEFE
YDESNKCYKI TKIYVGDSTN ENERSPLRDP GVQLNVGDCI KNIDGEDANG NIYSHLINKD
QVILDVITAD GKNKRVTVKV LKDERFLIYR YWVEKNREYV HEKSKGRLGY IHIPDMMYQG
FAEFYRLFMS EFHREGLVVD VRFNRGGFVS GLLLEKLLLK RVGYDYPRNG KPIPMPYFSS
PKVLVGITNE HAGSDGDIFS FLFKKYKLGV LIGRRTWGGV VGIRPRYRLV DKTYISQPEF
AVNFEDVGFG IENYGVDPDI VVEIKPDDYV NNRDTQLDTA IELALKQL