Gene Ssol_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2067 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1855469 
End bp1856743 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content33% 
IMG OID 
Productprotein of unknown function DUF402 
Protein accessionACX92273 
Protein GI261602670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA GAGTTAGAAT AAGGGGAATC TACGCTACAG CATTAACGTC AATTTTTTCC 
TCTCTTTCAT ACGAAATTGT ACAACAATCT GTGGAAATAT CAGAACGATT TATGCAAGAA
ATTAATAACT TATCAGCTGA TATCACAATA AAGGATTTTG AGGATGATAG AGGCAAGATC
ATCGTAATGG GAAATGGAAT TATAGAAGAT GACTTACGTA ACGTTTTTAA ATACTCATTC
CATTGGAGAA GCCCAGTTAA ACTATACTCG GTAATAGAAA TAGACGAAAG TTGCACTTAC
GCTAACTTTA AAGTAGAACC TTGCTTGAGA GAGGGAATCG TTATAAAACC ACCTTATGAC
GGAAAAATAA TACTAAGTGA AACTAAGGCC GTAAGTAAAT ACGCTATTGT ATGGAGAGGG
AAGGGAATAA CTACTTTTTC AGAGCACATC GTTGATGAGG AAGAAAAAAT GAGGCTATTA
ACCTTGAGTT TACCTCTTAA TAGAAAAGGA TATAATGTAA AGTGGAGAAG TAATGCAAAG
TATGTCGCAT TAAATGAATT GAAAGAGGAT CTAGAAAGGT TAATATTAAG GTATGAAAAT
AGGGAGTTCA GAGATCAAGG AGAGGATTTT TATTTAATAA CTCTTTCATT ACCGGATAAA
CTGTATTTAG ATGAGGTTAG AAAGAATGTA GTTGATACTG TTAAGTATCA TCATATGTTA
AAGTTAAGCT ATAATAGGGA AGTTGATTCT TTGGAAAAGG ATAAGAAAGG TTCTCTCGGT
AAATTATTGG AAGGGCTAAT CTCAGATTTC TTGAAAATTG AACACATTAA GGCTGATGGA
AAGGTAATTT ATTTGAGAGG TGGAAAGGTA ATTGAAAAGG AAGTTAACGA TAACGGATAT
AGAATAGTCC TTAGGCGTGA GTTTGAAGGT AACGGGATTC TAGATGGTAT AGGTAAGAAG
ATAGAGGAGG GTGATTACGA TATTGTAGAA TATAATTCTG ATAAGTGGTA TCAGATACAT
AAGTATTATA GTGGTATAGA TAACTCACTA AAGGGAGTCT ACATTAATAT ATCAACACCA
CCGGAATTAC TTAGAGGAAA AATAAGGTAT TTGGATCTAG AAATAGATAT TGCAATTAGA
GATTCAGAAA TAGCATTATT AGATGAAGAT GAACTAAATA AAAAGAGTAT TTACATGCCC
TCTTCGCTAG TAAATAAAGC TAAGGAAGTT GTAAATTATC TAATAAATCG AATTCAACAA
AATAAGTTGA GTTGA
 
Protein sequence
MKGRVRIRGI YATALTSIFS SLSYEIVQQS VEISERFMQE INNLSADITI KDFEDDRGKI 
IVMGNGIIED DLRNVFKYSF HWRSPVKLYS VIEIDESCTY ANFKVEPCLR EGIVIKPPYD
GKIILSETKA VSKYAIVWRG KGITTFSEHI VDEEEKMRLL TLSLPLNRKG YNVKWRSNAK
YVALNELKED LERLILRYEN REFRDQGEDF YLITLSLPDK LYLDEVRKNV VDTVKYHHML
KLSYNREVDS LEKDKKGSLG KLLEGLISDF LKIEHIKADG KVIYLRGGKV IEKEVNDNGY
RIVLRREFEG NGILDGIGKK IEEGDYDIVE YNSDKWYQIH KYYSGIDNSL KGVYINISTP
PELLRGKIRY LDLEIDIAIR DSEIALLDED ELNKKSIYMP SSLVNKAKEV VNYLINRIQQ
NKLS