Gene Ssol_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0972 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp916175 
End bp918307 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content38% 
IMG OID 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein 
Protein accessionACX91216 
Protein GI261601613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACGTTG GCAAACCTAT TAAGCGAATC GAAGATCCTA AATTCTTAAC TGGAGGCTCA 
ACATACGTTG ATGATATAGA ACTTCCAGGT ACATTATTTG TGGCATTTCT CAGATCGGTT
AAACCACACG CTAAAATAAA GATAAAGAAG AATGGTAATA ACGTATTTAC TGGTTATGAC
ATCAATCCAG GTAAGGATTT TCCAATACCC ATAGAGGAGA CAACTTACGT TGGTCAACCA
TTAGCTATTG TAGTAGGAAG GGATAGATAT GAAGCTTATG ACTTGTTAGA AAGTATTGAA
GTAGAATATG AGGAGTTACC TTACGTTATT GATCCTCAAG ATGCTTTAAA AAACGATGTA
AAAGTTTACA GTAAGAAGGA GTCTAATATT TACGAATATA AGAAATGGGA AGGAGGAAAT
ATTGAACAGA GTCTTAAAGA GGCTGATGTG GTAATTAACG GAGAGTTATA TAATCAAAGA
GTAATAGCAA ACCCCTTAGA AACTAGGGGA ATATTAGCTT ACTTCGATGG TAATAGGTTA
AATGTATGGT CATCTACTCA ATCAGCTCAT TATCTTAGAA GAAATCTCAT GAACTTTCTT
GGAATTGATA ACATAAGGGT AATTCAGCCA GATGTAGGAG GAGCGTTTGG GAGTAAAATT
ATAGCTCATC CGGAAGAATA TGCTATAGCT AAGCTAGCCT TAAAAATGAA GAGACCTTTA
AAGTGGATAC CAACCAGGTC TGAAGAGATG CAGAGTGCAG GTCATGGAAG AGATAAGAGA
TTAAAGTTTA AGGTTGCAGT AAAGAGAGAT GGTACAATAC TAGGTATAGA TGGTACTTTA
ATTGCCGATT TGGGAGCGCC TTATCCAGAC GCAAACGATG ATGAGATAGG CAATGTTCAC
AGTACAGTTA GAATGCTATT GGGACCTTAT AGGATTCAAA ACGTGAGAAT AGAGGAATAC
GCTGTAAACA CCAACAAGGC ACCTACTCAA TCATATAGGG GAGCGGGTAG ACCAGAGGCA
ACATATTTCA TAGAGAGAAT TATTAATATA ATCTCTTTGG AGCTAGGAAA GGATGAATTT
GATATAAGAG AGAAGAATTT GATTAGAGAA CTACCATACA AGAATGCCTT GGGTATAACC
TATGATACTG GAGATTATAT TGGTCTATTA AATAAAGCAA GAGAATACTA TGAGACGTTG
AAAAAAGAAG CTAGCACGGA TGAATGTATA GGTTCAAGTA TGTACGTCGA AATAACAGCA
TTTGGCCCTT GGGAGACCGC AAGAGTTCTA GCTAAAAGTG ACGGAAAAAT CATGATAATA
ACTGGCAGTG GACCTCATGG GCAAGGTGAT GGTACTGCTT TTGCCCAAAT AGTAGCCGAT
GTTCTAGAGA TTCCAATAGA GAATATTGAA GTTAGATGGG GAGACACTGA TATAATTTCA
GATGGGATTG GAACTTGGGG AAGTAGAACA GTAACAATTG GTGGGTCAGC GATGTATAAG
GCTGCAGAAG AGTTAAGGAG AAGATTGATT GAAGTAAGTG CAAAAATGTT AAACGCTGAC
GTGGAAGAGG TAGAGTATAA GAATGGGATA TTTTCGCATA AGAAGAGCAG TAAGAGTTTT
ACTATAAAGG AAGTAATACA AAACGCTTAT TCAATGGGAT ACTCCTTAGA TGTAACCTAT
GTCTATAACG TAACTAAACC AGGTTACACT GTACCTTATG GAGTTCATTT AGCGTTAGTT
AAGGTAGATA AAGAGACTGG AAGTATAAGA GTGAGGAAAT ACATTGCCTT AGACGATGTT
GGGAGAGTTA TAAATCCACT ACTTGCAGAA GGTCAGATAA TTGGTGGGGC TTTACAAGGG
ATAGGACAAG CTATATATGA AGGAACAATA TATAGTAAAG AGGGTTATTT GCTAAATTCA
AACTTAACCG ACTATGGGTT TCCTACTGCA GTGGAAGCAC CAAGGATTGA GTGGCATTAC
ATTGAAAAGG GATTATCAGG CCATCCTACT AATTCTAAAG GAATAGGCGA GGCTGGAGCT
ATTGCCTCAA CTCCTGCTGT AGTAAATGCA GTAGAAAAGT GCATTAGGAA GAAAATAGTT
AACATGCCGA TAAGACCAGA AGAGGTTATT TAA
 
Protein sequence
MYVGKPIKRI EDPKFLTGGS TYVDDIELPG TLFVAFLRSV KPHAKIKIKK NGNNVFTGYD 
INPGKDFPIP IEETTYVGQP LAIVVGRDRY EAYDLLESIE VEYEELPYVI DPQDALKNDV
KVYSKKESNI YEYKKWEGGN IEQSLKEADV VINGELYNQR VIANPLETRG ILAYFDGNRL
NVWSSTQSAH YLRRNLMNFL GIDNIRVIQP DVGGAFGSKI IAHPEEYAIA KLALKMKRPL
KWIPTRSEEM QSAGHGRDKR LKFKVAVKRD GTILGIDGTL IADLGAPYPD ANDDEIGNVH
STVRMLLGPY RIQNVRIEEY AVNTNKAPTQ SYRGAGRPEA TYFIERIINI ISLELGKDEF
DIREKNLIRE LPYKNALGIT YDTGDYIGLL NKAREYYETL KKEASTDECI GSSMYVEITA
FGPWETARVL AKSDGKIMII TGSGPHGQGD GTAFAQIVAD VLEIPIENIE VRWGDTDIIS
DGIGTWGSRT VTIGGSAMYK AAEELRRRLI EVSAKMLNAD VEEVEYKNGI FSHKKSSKSF
TIKEVIQNAY SMGYSLDVTY VYNVTKPGYT VPYGVHLALV KVDKETGSIR VRKYIALDDV
GRVINPLLAE GQIIGGALQG IGQAIYEGTI YSKEGYLLNS NLTDYGFPTA VEAPRIEWHY
IEKGLSGHPT NSKGIGEAGA IASTPAVVNA VEKCIRKKIV NMPIRPEEVI