Gene Ssol_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1801 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1604258 
End bp1605469 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content35% 
IMG OID 
ProductNucleotidyl transferase 
Protein accessionACX92017 
Protein GI261602414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0230412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAGCAG CGGGTAAAGG AGAGAGATTA GAACCAATAA CCCATACAAG ACCTAAACCT 
TTTGTTCCTG TTCTTGAAAC TCCCTTAATT TTAAGGCATA TTCGGATATT AAAGAAATAT
ATAAATAAGA TTATAATTGT AATAAACTCT AACCATAAGG ATTATTTCAA AACTATTGAA
GGAGTTAGTC TAGTTGAACA GACTGAAGGA AAAGGTACCG CTGCAGCTTT ACGAGCAGCC
GAGAAATATC TTGAGGGAGA TGAAGAATTT TTAGTAATTT ACGGAGACCT TCTTTTTGAA
GAAGATGCAT TGGATAAAAT AGTAAATACT GAGGGGGAGG CAATTCTAGC TAGAGAGTCT
GAAGATCCTA GAAAATTTGG AGTTATAGTG AAAGACTCAG AAAATAGATT AGTGAGAATA
GTCGAAAAGC CTGAGAATCC TCCGTCAAAT ATAATTAACG CAGGTATCTA TAAGTTTACC
TATGATATTT TCTCATATAT TGATAAAATA AGTTTATCAA GTAGAGGTGA ATTTGAGCTT
ACAGATGCTG TAAATCTTAT CGGAAATAAG GTTAAGGTAG TTACGTACAA TGGAATATGG
TTAGATATAG GAAGGCCTTG GGATTTAATA GAAGCTAATA AGGTACTTTT AGATAAGGAG
AAGGATCGAA ATCTAGGTGT AATCGAAGAA AATGTTAAAA TCAAAGGTAA AGTAGTTATT
GAAGATGGGG TTATAATAAA ATCTGGTACC TATATAGAGG GTCCCGTTTA TATAGGTAAA
AATTCTGTTA TCGGACCTAA TGCGTATATA AGGCCATATA GTGTGATAGG AAGTAACGTT
AAGGTTGGTG CGTTTAACGA GATAAAGGAA AGCGTGATAA TGGAAAACGC AAAGATTCCG
CATTTAAGTT ATGTCGGAGA CAGTGTAATC TGTGAGGGTG TAAATTTTGG AGCTGGAACT
ATAACCGCGA ATTTGCGGTT TGATGAAGAG GAAGTTAAGG TTAATATAAA AAACGAAAGG
GTAAGCAGTG GTAGAAAGAA ATTAGGTGCA ATAGTAGGTG CCCATGTAAG AACTGGGATT
AATGTATCAA TATTGCCTGG GGTAAAGATT GGTGCATATG CTTGGATTTA TCCAGGAGCT
GTTGTTGATA GAGATGTTGA GAAAGGAGAG AAATATGTTC CATATTACCT AAGAAGGTCT
AGCGGTACTT GA
 
Protein sequence
MLAAGKGERL EPITHTRPKP FVPVLETPLI LRHIRILKKY INKIIIVINS NHKDYFKTIE 
GVSLVEQTEG KGTAAALRAA EKYLEGDEEF LVIYGDLLFE EDALDKIVNT EGEAILARES
EDPRKFGVIV KDSENRLVRI VEKPENPPSN IINAGIYKFT YDIFSYIDKI SLSSRGEFEL
TDAVNLIGNK VKVVTYNGIW LDIGRPWDLI EANKVLLDKE KDRNLGVIEE NVKIKGKVVI
EDGVIIKSGT YIEGPVYIGK NSVIGPNAYI RPYSVIGSNV KVGAFNEIKE SVIMENAKIP
HLSYVGDSVI CEGVNFGAGT ITANLRFDEE EVKVNIKNER VSSGRKKLGA IVGAHVRTGI
NVSILPGVKI GAYAWIYPGA VVDRDVEKGE KYVPYYLRRS SGT