Gene Ssol_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0587 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp535142 
End bp536737 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content38% 
IMG OID 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionACX90863 
Protein GI261601260 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0776726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAAA CTACGTCCGA GCTTCTAATT GATACGATTT CTTCTCAAGT TACCGACGTA 
TTCGGGATAC CTGGAACCCA TGGTTTATCA TTATACGAGG AGCTCAGAAA AAGGGTAAGT
AGAGGGGAAA TTAGATACTA TATGCCTAGA TTAGAATACG GAGGGGCAAT AATGGCAGAC
TATTATGCTA GATTAAAGGG AAATGTGGGA GTTTTCTTAT CAGTAAATGG TCCTGGCTTT
ACTAATTCTT TAACTGCTCT GGTCGGCGCT TATTCTGAAG GTTCTCCTCT TGTCCTTATC
TCCCTCAATA AGGAATTTAA ATATAGACAT AGGAGACAAC TTCACGATTC TGGCTATTAC
GACTTACAGT TAGAAATGGC CAGACAAGCA ACTAAGGCAT CATTTAGAAT TTACTCTCCA
GAAGATGTGC CAATTATAAT GGAAAGGGCT TTTAAAATAG CTCTCGAAGA TAAGATGGGA
CCGGTTTACA TTGAGGTTCC GGTCGATGTA TTGGAAGAGA AAGGTGATTT TGAGAATTAT
AAGATTAAGA AGGTTAATAG AACATTAATT TATCCTACGA AAGAGGAAGT AAGGGAAGCG
TTAAATTTCT TGAGTGAGTG TTCTAAACCA ATTCTACTAT TAGGTTATGG TGCGTCTAGA
TCGAACATCT TGAACTACAT TGAAAAATTA GGGATTCCCG TATTCACCAC AATAAGGGGT
AAGGGAAGCA TTCCGGAGAA TCATCCCTTA TATGCTGGAA CAATATTTAA CCTCAAGGAG
ATACCAGGGG ATTGCCTCAT AGCACTAGGG ACATCATTTA ACGATCTCGA AACTAGTAGA
TGGAGCATTA AATTGCCGGA TAGGATACTT CACGTGGATC CGGACGTTAA CGTATTTAAC
ACCTCAATAA ATGCAGAAGT TACTATAAAA GCAAGTGCCG AAGCTTTTCT AGAGGAGATC
GTTGAGAAGG TTAATTTGCC TAAATGGAGT TATAAAGTGG AGGAAAAGAA CAGCGATATA
GTTGATAACA CAAGTGAAAT AACTCATGAT TACTTAGCTA AAGTTTTAGA TGAGACGTTA
AGTGAAGATA GGGTTATCAT CTCTGATGCA GGGACAAATC AAGTTATGGC AATGGATATA
AAAGTGTATA AACCGAACTC ATACTTTAAT TCGCTTATCT TTAACGCAAT GGGATCTGCT
ATTCCAGCTA GCATAGGGGG TAAAATTGCA TCTCCAGAGA GGCAAATAGT GAGTATTATA
GGAGATCTAG GATTTCAAGG ATGTTTTAAT GAACTAATTA CTGCAGCACA GTATAAGATC
AACTTCTTAA CAGTTTTAGT AGAGGATGGT GTACAGCACT TCCTAAGGTT AAATCAGAAA
ATGAGATATG GAAATACTTT TACAACTGAT GTATTTCAAA TAGATTACAC TAAGGTTGTG
GAAGGGATTG GGGTTAACGT AATTGAGGTT AAGGATAGGA AAGACCTTAA GAAAAGTGTA
GAAGAGGCCG TTGGATTATC TCTCAAGAGT CCAACAGTTC TAAGAGTTCA CGTTAGCCCT
AATAGTATAC CTTCTAGATT GTTAATGAAA AGATAG
 
Protein sequence
MGKTTSELLI DTISSQVTDV FGIPGTHGLS LYEELRKRVS RGEIRYYMPR LEYGGAIMAD 
YYARLKGNVG VFLSVNGPGF TNSLTALVGA YSEGSPLVLI SLNKEFKYRH RRQLHDSGYY
DLQLEMARQA TKASFRIYSP EDVPIIMERA FKIALEDKMG PVYIEVPVDV LEEKGDFENY
KIKKVNRTLI YPTKEEVREA LNFLSECSKP ILLLGYGASR SNILNYIEKL GIPVFTTIRG
KGSIPENHPL YAGTIFNLKE IPGDCLIALG TSFNDLETSR WSIKLPDRIL HVDPDVNVFN
TSINAEVTIK ASAEAFLEEI VEKVNLPKWS YKVEEKNSDI VDNTSEITHD YLAKVLDETL
SEDRVIISDA GTNQVMAMDI KVYKPNSYFN SLIFNAMGSA IPASIGGKIA SPERQIVSII
GDLGFQGCFN ELITAAQYKI NFLTVLVEDG VQHFLRLNQK MRYGNTFTTD VFQIDYTKVV
EGIGVNVIEV KDRKDLKKSV EEAVGLSLKS PTVLRVHVSP NSIPSRLLMK R