Gene Ssol_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0018 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp16455 
End bp18068 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content36% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionACX90324 
Protein GI261600721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.989354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATGG AAAGGGGAGA GATTATAGGA ATAGTACTGC AGAAGAGCGA AGCAAATGAA 
ATGCAAGGAC TAATTAGAGC TGATGAGGAA ATAAGCGTAG GACAATTGTT ATTAGTTGAT
GATTCCGAGA AGCTTTCACT AGTCAGGGTT GAAAATTACG AATTTCTGAA TGAGTTCTTT
GACGAAAAGG GGGAGATAGC TAAGTCAATA TTAAAAGAAC CTTCGATATA TGAAATTCTC
GATATGAATA CAATAATAAA AGCAACTTTG CACTTAATAA AAAAATATGA CCATAATACA
ACCCCTAAAC CTGGCTCTTT CGTAAGAAGA TTACCAGAAA TAAAGAGTGA AAAAGAGCTA
CTTTCCTTTT ATGGAATAAA AAACAAAAAA GGATTAATTG AGTACGGAGC CTTAGCAGGT
TCTGAAATAC CATTATTGTT AGACCTTAAT GCAATAACAA TGCACATGGG AGTTTTTGGA
GAGACTGGAA GTGGAAAAAG CTACAACATG AGATACTTAA TTAAGCTTTT ATCTAACATA
AAAATAGGAG ACAAGATCAC GGCCTTACCG ATGATTGTAA TTGATGCGAA TGGGGATTAC
ATAGATTTAG CTTCGACAAA TCTAGACATA GTATCTAAAG GAAGAGGCTG GATAAAGAGG
TATATATTGA AGGATCCTAA GGAGCAAAAC GACATTAAGC TTACAATAGA TTTGTCCATA
TTCACGCCTA GAGAATTATC AGAATTTATT ATGTCTCTAA AATACGGTGA GGCTTCATAT
AACACACTGC AACTAAATTT CCTAGAGCAA GTTTTAGCAA ACCATGAGAG CAAGGAATAT
AATACCCTTT TAGGTAGTGC AATAGGAATT GAGACCCTAA GGAATGAGAT TCTTACGATG
GCTCAAAATA AGGATATAGG AATCACAACG GGTACTGCTA GGGGAATTGC TAGCGCATTA
GAAATATTCA AAAACAAAGT AATTAGCAGA CTACAACTTG TCAACTCTTC TGCATCTTTG
ACTGAGAACA CATTAGAAGT GATTTGGAGG AATAGAGGTT TAGCAATAAT AGACTTCTCG
GCTGATGGTT CACCAGGAGT AGATGTCCTT ACGAAACAAC TCATCGTAAG CTACATAACT
AGATTAATAT TTAATTATCT TACGAGATCC AAATACAACG GTAATCAAAG GTTTTTGGGA
TTTGTGATAG AAGAGGCACA GAACTACATA CCTTCTATTG ATTATCCAGT AAACGCTAAC
TTGACAAAAG ACGTATTGGT AACACTAGCT ACTCAAGGAA GAAAATTTGG GGCATCTCTA
ATTCTGGTAA GTCAAAGACC AGCATTTATA GATAAATACG TATTATCCAT GATTAACACC
TTTTTCTTTC ATAGAATATA TCATGAAGAC GTAAGATATG TTATGTCCGC TTCAGGTGGT
TTACCCGAAT CATTAACTAA GAATTTGACA TCATTAGATA CTGGATACGT AATAGTAAGT
GGACTTATGT CAATAATGAA AAGTCCGGCA TTGGTAAGAA TCCCATGGGA TCCTAGGTTA
GGATCATACG CTGGAAACGT GGAAAGAATT GATTTAATTT TAAGCGAAGG GTGA
 
Protein sequence
MVMERGEIIG IVLQKSEANE MQGLIRADEE ISVGQLLLVD DSEKLSLVRV ENYEFLNEFF 
DEKGEIAKSI LKEPSIYEIL DMNTIIKATL HLIKKYDHNT TPKPGSFVRR LPEIKSEKEL
LSFYGIKNKK GLIEYGALAG SEIPLLLDLN AITMHMGVFG ETGSGKSYNM RYLIKLLSNI
KIGDKITALP MIVIDANGDY IDLASTNLDI VSKGRGWIKR YILKDPKEQN DIKLTIDLSI
FTPRELSEFI MSLKYGEASY NTLQLNFLEQ VLANHESKEY NTLLGSAIGI ETLRNEILTM
AQNKDIGITT GTARGIASAL EIFKNKVISR LQLVNSSASL TENTLEVIWR NRGLAIIDFS
ADGSPGVDVL TKQLIVSYIT RLIFNYLTRS KYNGNQRFLG FVIEEAQNYI PSIDYPVNAN
LTKDVLVTLA TQGRKFGASL ILVSQRPAFI DKYVLSMINT FFFHRIYHED VRYVMSASGG
LPESLTKNLT SLDTGYVIVS GLMSIMKSPA LVRIPWDPRL GSYAGNVERI DLILSEG