Gene Ssol_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1456 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1339549 
End bp1341006 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content33% 
IMG OID 
Productprotein of unknown function DUF790 
Protein accessionACX91688 
Protein GI261602085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.471447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACCT CTGACCTAGC AAGGTTTAAG ATAGAAAACC AGAGGATAAT TCCCCTATTT 
GCTACAGATT CTGACATCGA CGTAGCAAAG GAAGTTATTA ACATGTTTAA AATTGGAGCA
AAAGTTGGCG ATGTTCTGGA AGATATAAAA TATCTATCTA AGATATATGA TTATAAGCTA
GTTAAAGGAT TGTGGAAAAC TTACTTAAGA TACTGTACAG TTGAAAGTGA AACTAAGATC
GATTATGTAG AATTGCGTAG ACAGCTGTTT AGTAGGGGTC CAGTATTAGA GGAGCAAGAT
AAGGAGAGAG CATTAAAAGA GGTCAGGGAT CTTTTTCACG TTGATCCCAT AAAGGTGATG
TATGAGGATC TAGATATAGA GAAGAAAATA GTGGGACTCC CCAAATTTTC TCCTGAAGAT
CTATTAAAAA TCTACAATCT ATCCCTTTTA CAAACCATTA TTTTCAACGC ATATAGGGTT
ACAGTTACCG TTAGCGATGG TTGGAAAGAA ATAGCGAGAA GAATAAAGAT GCTGGGTTTA
ATGTATTTAG CTTATGAAAA CCCACTTAGA ATCGAAATTT TCGGTCCATT ATCTCTAGTA
AAGATGACGG AAAAATATGG GAGAAACTTA GCTGCATTAG TCCCGTTTTT AGTTTCTAGG
AATAAGTGGA CTATTATCGC TGATATAGTT TTAGGTAAAA GGAAAAGAAG AACTTATAGG
CTGGAACTCT CGAGCAACTA TTCTAAATTC TTCAAATATA TTAACGAGGA AGAGATAGAG
AAGAGATTTG ATAGCTCAAT TGAGGAAAAG TTCTATGACG AGTTTAGAAG GATAATAAGG
GATTGGAATA TAGTAAGAGA ACCAGAACCC CTTGTAGTGG ATAAAAGGCT CTATTTTCCA
GATTTTGTAC TGAGTAAAGG TAATACTAAA GTATATGTAG AGATAATGGG ATTTTGGACT
AAAGAATATG TAAACTCTAA GGTGGAGAAA CTGAGAAAGT TCAAATATCC AATTTTAGTT
CTTTTGAATG AAGAGCTTTC ATACGAAAAT TACCTACCAG ATACTTTAAA CGTGATCAAA
TTTAAGAAAA AAGTTGATAT AGGTAAACTA TATTCGATTT TAAGAAAATT TCAAGAGAAT
GCTAATGAGG ATATCGACTT AAGTGATATT AATGACGACA TTATCTTAAT TAAGGAATTG
TCAGCTAAAT ATAATGTAGA CGAGAAGATA GTTAGGAGCA AGTTAATGCA AAGACCAGAT
TATATAGTAC TGAAAAATTA CGCTATAAAG AGAACGTTTA TCGAGGAGTT AAAAAAGGAA
GATTTTTCGA ATACACAACT TTCTGCTCTA GTTAAAAAAT ATGGGAATTA CATAGTTGAT
GTCATAGATT ATTTAGGTTA TAAAATAATC TGGAAAAATA TTGCTGATGC AATAGTAGAA
AAGGTCAGAG AGGTTTAA
 
Protein sequence
MLTSDLARFK IENQRIIPLF ATDSDIDVAK EVINMFKIGA KVGDVLEDIK YLSKIYDYKL 
VKGLWKTYLR YCTVESETKI DYVELRRQLF SRGPVLEEQD KERALKEVRD LFHVDPIKVM
YEDLDIEKKI VGLPKFSPED LLKIYNLSLL QTIIFNAYRV TVTVSDGWKE IARRIKMLGL
MYLAYENPLR IEIFGPLSLV KMTEKYGRNL AALVPFLVSR NKWTIIADIV LGKRKRRTYR
LELSSNYSKF FKYINEEEIE KRFDSSIEEK FYDEFRRIIR DWNIVREPEP LVVDKRLYFP
DFVLSKGNTK VYVEIMGFWT KEYVNSKVEK LRKFKYPILV LLNEELSYEN YLPDTLNVIK
FKKKVDIGKL YSILRKFQEN ANEDIDLSDI NDDIILIKEL SAKYNVDEKI VRSKLMQRPD
YIVLKNYAIK RTFIEELKKE DFSNTQLSAL VKKYGNYIVD VIDYLGYKII WKNIADAIVE
KVREV