Gene Ssol_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1986 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1774418 
End bp1776067 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content35% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionACX92197 
Protein GI261602594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAG TTGCAGAAGT AATAATAAGA GTATTAGAAG ATAATGGAAT TCAAAGAATA 
TATGGAATTC CTGGAGATTC CATTGACCCT TTAGTTGACG CGATAAGAAA ATCTAAAGTA
AAATACGTAC AAGTAAGACA TGAAGAAGGT GCAGCTTTAG CTGCCTCGGT CGAAGCGAAA
ATAACTGGTA AGCCTTCAGC ATGTATGGGT ACTTCTGGTC CTGGATCAAT CCATTTATTA
AATGGATTAT ACGATGCAAA AATGGATCAT GCTCCAGTAA TAGCGCTAAC TGGACAAGTA
GAGTCAGATA TGATAGGTCA CGATTATTTT CAAGAAGTTA ACCTAACTAA GTTATTTGAT
GATGTGGCAG TATATAATCA AATTTTAATT AACCCAGAAA ACGCGGAATA TATAATAAGG
AGGGCTATAA GAGAGGCTAT TTCCAAAAGG GGAGTAGCTC ACATAAATTT ACCAGTAGAT
ATTCTCAGAA AGTCCTCAGA ATATAAGGGT AGCAAGAATA CTGAAGTAGG TAAAGTTAAA
TATTCGATAG ATTTTTCTAG AGCTAAGGAA TTAATCAAAG AAAGTGAGAA ACCAGTTTTA
CTAATTGGAG GAGGGACTAG AGGCCTAGGT AAAGAGATAA ATAGGTTTGC TGAAAAAATA
GGAGCACCAA TAATATATAC ATTAAATGGT AAGGGGATTT TACCAGATTT AGATCCTAAA
GTTATGGGCG GAATAGGTCT TTTAGGAACT AAGCCTTCCA TAGAGGCGAT GGATAAGGCT
GATTTATTAA TAATGTTAGG CGCATCATTT CCTTACGTTA ATTTTCTAAA TAAGAGTGCC
AAAGTGATAC AGGTTGATAT AGATAATTCT AATATAGGTA AGAGGTTAGA TGTTAATCTC
TCTTATCCGA TTCCAGTTGC TGAGTTCCTA AATATAGATA TCGAAGAGAA ATCAGATAAA
TACTATGAAG AGTTAAAAGG AAAGAAGGAA GATTGGCTAG ATTCTATAAG TAAGCAGGAG
AATAGTTTAG ATAAACCAAT GAAACCTCAG AGAGTAGCTT ATATAGTTTC CCAGAAGTGC
AAGAAAGACG CAGTAATAGT TACTGATACT GGAAATGTAA CTATGTGGAC TGCTAGACAC
TTTAGAGCTT CAGGAGAGCA AACCTTTATA TTTTCTGCTT GGCTAGGTTC AATGGGCATT
GGAGTCCCAG GAAGTGTAGG AGCTTCTTTT GCTGTAGAAA ATAAAAGACA AGTTATTTCT
TTTGTAGGAG ATGGAGGTTT TACTATGACT ATGATGGAAA TGATAACTGC TAAGAAATAT
GATCTTCCAG TTAAAATAAT CGTTTATAAT AATTCTAAAT TAGGAATGAT AAAATTTGAA
CAAGAAGTAA TGGGGTACCC AGAATGGGGA GTCGATTTAT ATAACCCAGA TTTCACAAAG
ATAGCTGAAT CTATTGGATT TAAAGGATTT AGATTAGAAG AGCCAAAAGA GGCTGAGGAA
ATAATAGAAG ATTTTCTAAA CACTAAAGGA CAGGCACTTT TAGATGCAAT AGTAGATCCA
AATGAGAGAC CAATGCCACC TAAACTAACT TTTAAGCAAG CTGGAGAATA CGTTCTTTCA
ATATTTAGAG AGAAATTAGA GGGTATTTAA
 
Protein sequence
MPSVAEVIIR VLEDNGIQRI YGIPGDSIDP LVDAIRKSKV KYVQVRHEEG AALAASVEAK 
ITGKPSACMG TSGPGSIHLL NGLYDAKMDH APVIALTGQV ESDMIGHDYF QEVNLTKLFD
DVAVYNQILI NPENAEYIIR RAIREAISKR GVAHINLPVD ILRKSSEYKG SKNTEVGKVK
YSIDFSRAKE LIKESEKPVL LIGGGTRGLG KEINRFAEKI GAPIIYTLNG KGILPDLDPK
VMGGIGLLGT KPSIEAMDKA DLLIMLGASF PYVNFLNKSA KVIQVDIDNS NIGKRLDVNL
SYPIPVAEFL NIDIEEKSDK YYEELKGKKE DWLDSISKQE NSLDKPMKPQ RVAYIVSQKC
KKDAVIVTDT GNVTMWTARH FRASGEQTFI FSAWLGSMGI GVPGSVGASF AVENKRQVIS
FVGDGGFTMT MMEMITAKKY DLPVKIIVYN NSKLGMIKFE QEVMGYPEWG VDLYNPDFTK
IAESIGFKGF RLEEPKEAEE IIEDFLNTKG QALLDAIVDP NERPMPPKLT FKQAGEYVLS
IFREKLEGI