Gene Ssol_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1147 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1068065 
End bp1069696 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content35% 
IMG OID 
Productthiamine pyrophosphate protein central region 
Protein accessionACX91385 
Protein GI261601782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC CCAAAAGAAA AGAGGAGACT GTAGGCGTAG AAATGAAAGG CGAAGAAGCC 
CTTGCATACG TTTTAAATGA TATTGGCTTA ACTAAGGTAT TTACAACTTA CTCCCTACCA
AATATTGTTA AGGAAATGTT AAAGAAGTAC AATATTGAAG TTGATTTTTC TATTTCAGTA
AAAGACGCAA GCTGGCTAGC TTATGCTTAT GCAATGGAAA ATAATTCAGT AGGGACTATA
ATTCAGATAC CGGGCTCGAA GTTAACTGAT GCAGTAGATG TTATAGCTCA AGCCTATATG
GAGTCAGTTC CCCTTCTTAT AATATCTAGC GTAAGATCAC ATAAAGACAC AGGAAGGGCC
AGAATTGGTG AATTTAGAAC GATCGATGAT TTATCAAACA TTTTATCTCC AATAATTAAG
ACTAAAGAAA GAGTTATCAG TATAGAAGAA ATAACTGTTA CAATAGAGAA AGCCTATAAG
GAAGCAGTTA GTAATAGACC AAGACCTTCT TATGTTGAAA TATCTGAAGA CTTATTCAGG
GCAAAAGCTT ATCCGTTATC CACTGCGGGG CAAAAGCCAG AGAAGAGGAC TCCAGATAAG
AATAGTGTAG CCAAAGTAGC TGAACTTTTA ACTAATGCTA AATTACCAGT TATAATTGCG
GGATATGGAG TAGTTTTAAG CGACGCAGAA GACATGTTAG TCGAACTGGC TGAATTAATA
GATTCACCAG TAGTTACTAC ATTTAAGGCT AAAGGTTCAA TACCATCTAA CCATAAATTA
TTTGCTGGAG AAGGATTAGG TGCCTTTAGC ACTAGTGCCG CAAACTATCT AATTGAGAAT
GCTGATGTAA TACTAGCGTT AGGCACTAGG TTTACTCAGT TAAGTACAGC TGGCTGGTCA
TTAAAGTATA AAGGCATCCT AGTGCATAAC AATGTTGATG GTGAGGATAT AGGCAAAGTT
TTCATGCCTC ATGTTCCAAT AGTAGCTGAT ACCGGGTTAT TCTTAAGAGA ACTATTAACT
CAATTAAAGG CTAAGATAAA GGAGAAGATA AATAGAGGGG CTAGTGATAT TATTTATAAA
ACTCAACGTC AAGCATACCC AATTACATCT CATAATGATA TATGGCCTAT AGATGTGGTG
AAAATGCTAA GCAGTATAGG TGGCTTTGAG AAAGTCTATG TGGATATTTC GGCTACAACA
ATAGATTTAG TTAGATTACC GATAAATGCT AAGAAGACGT GGTATACTGC AGAATCTTTA
CTAGAGAGAG GTATCGCGGT AGGTGGTATT ATAGCTTCTA AATACGTTGC ATATGGAATT
ACTGATATAG AGGGAATATT ACCTCATTTA TCATTACTAA AATATAAGAT GGATAAGATT
AAAGGAAAGT TAATTATATT AAATGATGGG GGCGCAAATT ACATTGAAGT TTCTAACTCT
GATTTACCTA CTATTGCTAG ATCACAGACT AGTTTCAATG CCAATTTTGA TGAGATTGCA
GAAAAAGCCT TAGGTGGTGT AACTGTTAAT ACATTAACTG AGTTAGAAGA GGCTTTGAAA
TCTGTCGATA AGAAAATCAT AAATGTAAAG ATTGATCCTA ACTTCGAGTC GGTAATACTT
TCAAGAATTT AA
 
Protein sequence
MSQPKRKEET VGVEMKGEEA LAYVLNDIGL TKVFTTYSLP NIVKEMLKKY NIEVDFSISV 
KDASWLAYAY AMENNSVGTI IQIPGSKLTD AVDVIAQAYM ESVPLLIISS VRSHKDTGRA
RIGEFRTIDD LSNILSPIIK TKERVISIEE ITVTIEKAYK EAVSNRPRPS YVEISEDLFR
AKAYPLSTAG QKPEKRTPDK NSVAKVAELL TNAKLPVIIA GYGVVLSDAE DMLVELAELI
DSPVVTTFKA KGSIPSNHKL FAGEGLGAFS TSAANYLIEN ADVILALGTR FTQLSTAGWS
LKYKGILVHN NVDGEDIGKV FMPHVPIVAD TGLFLRELLT QLKAKIKEKI NRGASDIIYK
TQRQAYPITS HNDIWPIDVV KMLSSIGGFE KVYVDISATT IDLVRLPINA KKTWYTAESL
LERGIAVGGI IASKYVAYGI TDIEGILPHL SLLKYKMDKI KGKLIILNDG GANYIEVSNS
DLPTIARSQT SFNANFDEIA EKALGGVTVN TLTELEEALK SVDKKIINVK IDPNFESVIL
SRI