Gene Ssol_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0100 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp84820 
End bp86145 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID 
ProductCoA-disulfide reductase 
Protein accessionACX90401 
Protein GI261600798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGGC TTATTATTAT AGGTGGTGGA GCTGCTGGAA TGACAGCCGC CTCATGGGTT 
AGGAGGCTTA AGCCAAACAT GCATGTAACA GTGTTCGAAT CCACTAAGAT GGTGAGTCAT
GCACCTTGTG GTATTCCCTA TTTTACTGAA GGTCTCTTCG ATGACGAAAA CCTATTCATG
ACCTACACTC CAGAATATTT TATTGAAAAG AGAAAGATAA ATGTTAAGAC AAATTCTAAA
GTGGAAGAGG TAGATTTAAG GTCGAGAATT GTCGTGGTAA GAGAAAATCA AGAGAAAAGG
AAATACGAAT TCGACTATCT ATTATTTTCC ACAGGTGCTA GACCTAAGAA GATAAATGCA
GAGGGAGATA GAATTTTCTA TGTTCATCAT CCTGCAGAAG CCTCTTATAT AAGGCAAAAA
TTATGGAGTT TTAATAGGAT TGCAATAATT GGTGGGGGTA TATTAGGCAT AGAAATGGCT
GAAGCATTAA GAGCTAGAGG AAAGAAAATA GTTTTGATTC ATAGAGGGAG ATATTTGCTC
AATAAAATGC TAGATGAAGA TATGGGTAAG ATCGTAACAG ATAAGGTTGA AAGCGAAATA
GAGCTAAAGT TGAATGAGAG TCTCGTAAGT GTAATAGAGG AAGGAAGAGT CGTAATAACA
GATAAAGGAA AATATGATGT AGACGCCACT GTAGTTGCTA TTGGAGTTGA GCCAAATATT
GATCTAGTTA AAGATCAATT AAAAATAGGG CAGACTGGTG CCATATGGGC TGATAACCAT
ATGAGAACTA GTGTCGAAAA TGTTTATGCA GCTGGGGATT CTACAGAATC GATAAATATC
ATTACTAAAA AACCCGATTG GGTTCCTTTT GCACCAGTTG CGAATAAAAT GGGTTTCGTC
GCTGGAAATA ACATAGGTGG TAAAGACGTG ACTTTCCCTG GGGTAGTAGG AACAATGATA
ACTAAATTTG AAGAATTTAT AATTGGTAAA ACTGGTGTTA CTGAGAACGA GGCTAAACGA
TATAACATTA AAACAGTTTC AGCAATAGTT CATCATAAGA CTAGGGCTAG ATACTACCCT
GGATCTAAAG ACATTATAGT GAAGTTAATA GCTGAGGCTG ACACTATGCG AATAATCGGC
GCTCAGATTG TAGGAGAAGA AGAGGTTCTA GGAAGGCTAA ACATGATGGC TGCTGTTATC
CAAAAAGGTT TTACGGCAGA AGATCTATTC TTCGTAGAAA CTGGGTATGT TCCACCCGTA
AATAGGGTAT GGGACGTGGT TACTTTAGCA GCGAGAAAAT TATTTTCCGG TATTAGTGGA
GAATAG
 
Protein sequence
MERLIIIGGG AAGMTAASWV RRLKPNMHVT VFESTKMVSH APCGIPYFTE GLFDDENLFM 
TYTPEYFIEK RKINVKTNSK VEEVDLRSRI VVVRENQEKR KYEFDYLLFS TGARPKKINA
EGDRIFYVHH PAEASYIRQK LWSFNRIAII GGGILGIEMA EALRARGKKI VLIHRGRYLL
NKMLDEDMGK IVTDKVESEI ELKLNESLVS VIEEGRVVIT DKGKYDVDAT VVAIGVEPNI
DLVKDQLKIG QTGAIWADNH MRTSVENVYA AGDSTESINI ITKKPDWVPF APVANKMGFV
AGNNIGGKDV TFPGVVGTMI TKFEEFIIGK TGVTENEAKR YNIKTVSAIV HHKTRARYYP
GSKDIIVKLI AEADTMRIIG AQIVGEEEVL GRLNMMAAVI QKGFTAEDLF FVETGYVPPV
NRVWDVVTLA ARKLFSGISG E