Gene Ssol_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2100 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1878045 
End bp1879358 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content39% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionACX92306 
Protein GI261602703 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGTC AAGATGAGAA ACAAATACAA AGGGAAGTTC GAGAAGCCTT CCCAATGTCA 
GATGATGTTG ACTGGAATGA GGTTTATCAG AGAATAATTT ATAGGTATAG CACACCTCAT
GGACTAGAAC ATGTTAAAGA GGAAATGTAT AAATTAGAAG ATAAGGGCGA AATTATAATA
CACCATATTA AGCCCTACAA TAACCCAGTA GAAGCCCAAA CACTAAACGG ATCTCCAAAG
AAAATACCCA CAACAAAATT ATGGCATCAT AAGAGTTGTG GACAGTGTGG CCACATACCC
GGTTATCCAA CCTCTGTTTT CTGGGTAATG AATAAGTTAG AAATAGATTA CCTAGATGAA
CCGCATCAAA CATCGTGTAC TGGATGGAAT TATCACGCGT CTGGTGCCTC CAACCCCGTA
GCCTTGGCAG GAGTATATGT AAGGAACATG TGGAGAGCTT ATGAAACGGG TTACTTCCCA
TTAATACATT GCGGAACATC ATTTGGTCAT TATAAAGAAG TTAGAAACAT GATAATATTA
CACAAAGAGA TAAGAGACAA ACTTAGACCA ATCATGAGAA AACTGGATAT GGACATTGTA
ATACCAGAAG AGGTAGTTCA TTATTCTGAA TGGTTATATG TAATGAGCAA GAAAGCTGCA
CAGCAGAAGA AATACAATCT AGATAATATT AAGGCAGCTG TACATACTCC TTGTCATGTT
TATAAGTTAG TTCCAGAGGA TACTGTTTAC GATCCCGAAG TATTCCAAGG TAGAAGACCA
GCAGCCCCAT CTGGAACTGT ACAGAATTTC GGCGCTAAAC TAGTGGATTA CTCAACCTGG
TGGGATTGCT GTGGATTCGG ATTTAGACAT ATCCTAACAG AGAGGGAATT CAGTAGAAGT
TTCGCACTAT TTAAGAAGGT TATACCAGCA GTTGAGGAAG GGAATGCTGA TATCTTTGTA
ACCTCAGATA CTGGGTGTGT TACTACTTTA GATAAGAGTC AGTGGGCTGG AAAGGCTCAT
GGTTTCAATT ATAACTTACC AGTATTAGCT GATGCGCAAT TTGCAGCTTT AGCAATGGGC
GCTGATCCAT ATATAATTGC TCAAATTCAC TGGCACGCGA CAGATGTAGA AGGTTTCTTA
AGAAAGATAG GTGTTCCGGT TGATGATTAT AAAGAGAAGT TCGTACAATA TCTACAAGAT
CTAAGAGAAG GTAAGACGGA ACCACAATAC TTATATCCAA AGCATAGAAA GATTGACTTC
TACTTATCAC TTCCAGATAG AGTAAAATGG TACAAGAAGG AGGTTCCAAA GTAA
 
Protein sequence
MLSQDEKQIQ REVREAFPMS DDVDWNEVYQ RIIYRYSTPH GLEHVKEEMY KLEDKGEIII 
HHIKPYNNPV EAQTLNGSPK KIPTTKLWHH KSCGQCGHIP GYPTSVFWVM NKLEIDYLDE
PHQTSCTGWN YHASGASNPV ALAGVYVRNM WRAYETGYFP LIHCGTSFGH YKEVRNMIIL
HKEIRDKLRP IMRKLDMDIV IPEEVVHYSE WLYVMSKKAA QQKKYNLDNI KAAVHTPCHV
YKLVPEDTVY DPEVFQGRRP AAPSGTVQNF GAKLVDYSTW WDCCGFGFRH ILTEREFSRS
FALFKKVIPA VEEGNADIFV TSDTGCVTTL DKSQWAGKAH GFNYNLPVLA DAQFAALAMG
ADPYIIAQIH WHATDVEGFL RKIGVPVDDY KEKFVQYLQD LREGKTEPQY LYPKHRKIDF
YLSLPDRVKW YKKEVPK