Gene Ssol_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1993 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1787629 
End bp1788804 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content37% 
IMG OID 
Productformate hydrogenlyase subunit 5 (HycE) 
Protein accessionACX92203 
Protein GI261602600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACT ATAAATGGAC TCAAAAAGGC GAGGGAAGGA AAATAGGCAA AATAGGGGAT 
TATTGTCTCT ACGAAAAAAC GATAACAGAA GAGAAATGTG AGGAAAACAA ACCAAATATA
ACACAAACGT ATGGATCATT CAAGTTCATT TACGGACCCT CAGCTGGAGG ACTACTCGAA
ACAATAAAAT TCATTATTAC AACTAATGGT GAAAAAATTC TAGGAATAGA CGCTGAGGTA
TACAAGAACA GAGAAATAGT AATAAGCGGT TTAACTGTGG ACGATGCCTT ACTCAGAGTA
GAGAGAATAA ACGCTCCATT TAGTGCTTCC CACACAATAT CCTTTTTACT CGCTGTAGAA
GATTCGTTAG AATTAGAACA AGACTATCCA ACCCAACTAA AGAGAATAGC CGAAATAGAA
TTGGAAAGAA TAAGAAATCA CTTATTCGTA ATATCGAGAT TAACTGAAAC CACATCACTA
AACGTACCTA CATACCATCT CTTGCACCTC GTTGAAAAAG TCAACAGATT AATAGGCAAA
ATGTGTGGTC ACAGGTATTT CTTTGGCGTT AATGCAATTA ACGGGGTTAA CTGCGATTTC
GGAAATTTAT TAAGAATAAT AGATATTACC AAGGAATTTA AACAAATCTT CGATGGGCTA
CTTGAAAGTA GAATCTTCAT AGATAGACTC CAAGAAAACG GAAAAATAAT AGATGAAAAC
AGTATAGGAC CAGCTGCAAG AGCTGCTGGA CTCGCTTACG ATGCGAGAAA GGACTTTAAA
GCCTTACCTT ATGAAGACTT AGGTTTTAGA ACAGTTATCA CACAAGAGGC AGACTCATTC
GGAAGGTTCC TAGTTAGGGG AATGGAGATA ATCGAGTCGG CCAAAATTTT AGTAGAGTTA
CACGATGAAA TAAAGAACAG CAATAACGAG AGAGGGAAAA ATCACAAACA AGGAGGGGGA
GAGGGACTAG CCAGAGTCGA GAGTCCATCT GGTGATCTAG CCTATTACGT CAAGTTAAAT
AACGGGATTA TCGACTCAGT ATCACTTCTC ACTCCTTCAC AAGTCAATCT CAACCTATTT
TTGAAAAGCG TGATAAACAC AATATTTACC GATTTTCAAT TCAATTGGGA AAGTTTTGGA
ATTTGGGTAA GTGAAATAGG AGTGATGTTA AAGTGA
 
Protein sequence
MKYYKWTQKG EGRKIGKIGD YCLYEKTITE EKCEENKPNI TQTYGSFKFI YGPSAGGLLE 
TIKFIITTNG EKILGIDAEV YKNREIVISG LTVDDALLRV ERINAPFSAS HTISFLLAVE
DSLELEQDYP TQLKRIAEIE LERIRNHLFV ISRLTETTSL NVPTYHLLHL VEKVNRLIGK
MCGHRYFFGV NAINGVNCDF GNLLRIIDIT KEFKQIFDGL LESRIFIDRL QENGKIIDEN
SIGPAARAAG LAYDARKDFK ALPYEDLGFR TVITQEADSF GRFLVRGMEI IESAKILVEL
HDEIKNSNNE RGKNHKQGGG EGLARVESPS GDLAYYVKLN NGIIDSVSLL TPSQVNLNLF
LKSVINTIFT DFQFNWESFG IWVSEIGVML K