Gene Ssol_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0947 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp889955 
End bp891586 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content38% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionACX91192 
Protein GI261601589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAT ATTTAGTTAT GAATGCCGGA AGGCTATTTT TATCTCTTCT TAAAGAAAGC 
GGGGTAAATA AAATATTTAT AGTATCTGGT ACGGATTACG CTTCATTAAT AGAGGCTAAG
GTCGAAGATT CCAGTTTACC AGAATTTGAA ATAGTACCTC ATGAGATTAC TGCTATATCA
ACTGCAATAG GTTACGCACT AGGCAATAAG CTAAGTGCTG TTGCAGTTCA CACTACACCT
GGAACGGCAA ACGCATTGGG AGGTATTATG AGTGCGTTCA CTTCCAGAAT ACCACTTTTA
GTTATTGCGG GGAGGAGTCC ATATACTGAG AAAGGCAATA CTGCAAGCAG GAACTTGAGA
ATCCACTGGA CACAAGAGGC TAGGGATCAA GGAGAATTGG TTAGGCAATA TGTCAAGTAC
GATTTTGAAA TTAGGATGGC TGATCAGTTA CCAGCAGTAG TATCTAGAGC AATTCAAATA
ATGATGAGTG AACCAAGAGG TCCAGTATAT ATTGTTTTGC CAAGAGAAGT TAGCATTCAA
GAAGTTAATG AAGCTAGGAG AATACCAATG GATTATTATG AACCCGCACC TTCTCCGGAT
AAAATTAACA AGGCTAAGGA AATGCTGGAA AAGTCAGAAA GACCACTAAT TATAACATGG
AGGGCTGGAA GGAGAAAAGA ATGGTTTGAA TCCCTAAGGA GATTTGCAGA TAATTATAAT
ATTCCAGTTC TGAACTACGC TGGCGAGGTT TTGAATTATC CCAGCAGTGG ACCAATGGCT
TTGGATAGGT TTGATTTACG AAATAGTGAT TTGTTATTAG TAGTGGAGGC AGAAGTGCCG
TATTTTCCGA AGAAAATTGA CCTAGATATT CCAATAGTTA AGATTGATGT TGATCCATCT
TATTCTTACA TTCCATATTA TGGCTTTAGA TGTGATCTAT GTATACAATC CACACCCAGC
AACTTCTTTG ATTATATTTC GATTAGACCG AAGAGTTATG ACGAGATTAA GGAGCTAAGG
GCTAAGCAAG AAGAATATAA GAAGCAAGAG ATTGAGAGAT TAAAGGATAA GAAACCAATT
CATCCGAAAT ATTTATCATA TGAAATTGGA ATTGTTGCGT CTGAGTACAA TTTGGTAATA
TTTAATGAGT ATCAATTTAA TCCTAGATAC GCTAGGCTAA ATGAGTTTGG TTCTTACTTT
GCTGACTTGT CCGTAGGATA TCTAGGCTTC GCATTAGGTG CTGGCGTTGG ATATAAGATA
GCTACTAACA AGGACGTACT TATCACTACT GGTGATGGCT CATTTATATT TGGCGTTCCG
GAAGCCTTCT ATTATGTAGC ATCGAAATAT CCGACGATGG TTGTGATTTA TGATAATGGT
GGCTGGTTGG CATCAGCTGA GGCAGTAGAT GAGGTTTTTC CTGAGGGTTT AGCAAAAAGT
AAAAAGTATT ATCCTGGCGC AGATTTTGAT AGAAGGTTCG AGATTGGGAA GACTGTTGAA
ACTTTTCATG GCTACTATGA ACTAGTTGAG GATCCTTGGG AGATTAAGCC TGCCTTAATA
AGGGGCTTAG AGAAAATGAG AAGAGAGAAT AAAATAGCTG TGATTCAAGT TATAGTAGAC
AAGGTGAGAT AA
 
Protein sequence
MIKYLVMNAG RLFLSLLKES GVNKIFIVSG TDYASLIEAK VEDSSLPEFE IVPHEITAIS 
TAIGYALGNK LSAVAVHTTP GTANALGGIM SAFTSRIPLL VIAGRSPYTE KGNTASRNLR
IHWTQEARDQ GELVRQYVKY DFEIRMADQL PAVVSRAIQI MMSEPRGPVY IVLPREVSIQ
EVNEARRIPM DYYEPAPSPD KINKAKEMLE KSERPLIITW RAGRRKEWFE SLRRFADNYN
IPVLNYAGEV LNYPSSGPMA LDRFDLRNSD LLLVVEAEVP YFPKKIDLDI PIVKIDVDPS
YSYIPYYGFR CDLCIQSTPS NFFDYISIRP KSYDEIKELR AKQEEYKKQE IERLKDKKPI
HPKYLSYEIG IVASEYNLVI FNEYQFNPRY ARLNEFGSYF ADLSVGYLGF ALGAGVGYKI
ATNKDVLITT GDGSFIFGVP EAFYYVASKY PTMVVIYDNG GWLASAEAVD EVFPEGLAKS
KKYYPGADFD RRFEIGKTVE TFHGYYELVE DPWEIKPALI RGLEKMRREN KIAVIQVIVD
KVR