Gene Ssol_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2200 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1980794 
End bp1981924 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content37% 
IMG OID 
Productmethane/phenol/toluene hydroxylase 
Protein accessionACX92392 
Protein GI261602789 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0929517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAGAT TAGAGGATTT GGAATGGTAT AAAAGGTATA AACAAATGTT CGGAGCATTC 
AAAAGCGGTC CAGAAGGGGA TCCGTTTTTC AGAGATTATG AGTATAGGGG ATACAAGAAA
GTTTGGTCAA CGTGGCCAAT GTTGGAGAAA AAGTTAGGAA GAAAAAAACC ATCAGAATAC
CAAGTAGTTA CTTATGCATT ATCATACTGG GCCGATCCAA GATCCCCTAC TTATATCTAC
GATAAAGGGC CATTTGAACT AGGGACAGAA CACATAACCC AAAAATGGTA TAAACACTTT
AGAGACAACT CACCGTTCAT TAAACCATTA TTCGAAAGAG GTGAATGGCA TGATTATGAA
GACCCATACA AATTGACATA TTGGACTTAT AACTCCATGG CGGACGATAA TGAAACTTTC
TTAGATAAGA TTTACGAGGA GATTGTAAAT ACTAAATATG ATTGGAACCT AAATGAAGAG
GTACTGGAAT TATACAAGAA CGTTTATGAC CCATTAAGAT ACGTATTCCA TATTATGCAA
ATGGAGTCAA TGTATCTAGC TACTATGGCT CCTACAAGTT CTATAGCTAA CGTATTTATT
TTCATGGGAA TGGACCATTT AAGGAGAGTC CAAAGAATTT CACAAAGGGT AAAGATGCTC
GATATTGTAT ATCCAAGTCT AGGATTTGGA AAGGAAACAA GGAAGGTATT TGAGGAGAGT
CCAATATTTC AGCCAACAAG AGAAGTATTG GAGAAAATGT TAGTTACGTA TGATGTAGGA
GAAGCCTTAG TAGCGTTTAA CTTAGCAGTC AAATTCGTAT TAGATGAACT GATACTCCAA
CATCTAACTC AACCATTCAG TAAGTTAGGA GACGAGATGA TAAAGCACAT TCACTTATCG
TTCTATAACG ATACGTTAAG ACATAGACAT CAAGCTCAAG AGCTGTTCAA ATACGCATTC
AGTAAGGAGC CAAGTTTAAA GGATGTTATT AAACCTTGGG TGAAAGATTG GCAAGAGATG
GGATTTAAGG CTACAGAAGG GTTTAGAGAT GTGCTTAAAG GAGAATATGA TAATGCTATA
AGGCAGATTA GGAAGGCTCA TAGTGAATAT CTTGGAGGAA TAGGACTATG A
 
Protein sequence
MTRLEDLEWY KRYKQMFGAF KSGPEGDPFF RDYEYRGYKK VWSTWPMLEK KLGRKKPSEY 
QVVTYALSYW ADPRSPTYIY DKGPFELGTE HITQKWYKHF RDNSPFIKPL FERGEWHDYE
DPYKLTYWTY NSMADDNETF LDKIYEEIVN TKYDWNLNEE VLELYKNVYD PLRYVFHIMQ
MESMYLATMA PTSSIANVFI FMGMDHLRRV QRISQRVKML DIVYPSLGFG KETRKVFEES
PIFQPTREVL EKMLVTYDVG EALVAFNLAV KFVLDELILQ HLTQPFSKLG DEMIKHIHLS
FYNDTLRHRH QAQELFKYAF SKEPSLKDVI KPWVKDWQEM GFKATEGFRD VLKGEYDNAI
RQIRKAHSEY LGGIGL