Gene Ssol_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1251 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1161341 
End bp1162852 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content32% 
IMG OID 
ProducttRNA-guanine transglycosylase, various specificities 
Protein accessionACX91487 
Protein GI261601884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTAT TTGAAGTAAA ATATGAGGAC TTAGCAGGGA GAATAGGAAC CTTAAGAACA 
AGAAGTGGTA CCTTAGAAAC TCCAGCATTC TTTCCAGTAA TTAACGTATT AAAAAAAGAT
GAAATATCGA TAGATGAGAT AAGAAATATA GGATTTAAAA ACTTTATCAC AAATTCTTAC
ATATTATACA AAAATAACTA TATAAAGGAT GATATCCATA AGGAGTTACG CTCTGAAGAA
ATGATCATAA TGACAGATTC AGGGGCATAT CAAATTCTAG AGTATGGAGA AATAGGAATA
ACCAATCTCC AGATCGTGAA TTATCAGCTT AAAATAAAAC CAGATATCGG AGTAATATTA
GATTTACCTA CCGGGAATAT AAATGATTAT GATAACGCTA AAAAGACAGT ATATGAGACA
TTAAAAAGAG CGGAAGAAGC TTCAGAAATC ATAGTAAAAA ATCAAGATAA CAATATCATT
TGGGTATATC CAATACAGGG AGGAAGATAT CTTGATCTAG TTAAGACTTC TGCTGAAGGT
CTATCTAAAT TTGAACATAT ATACAATATG GCCGCTCTTG GTAGCCCAAC AGTTCTCTTA
GAGAAGTACA TGTATGATAC TGTAATTGAC ATGATTTATA CTGCTAAATC TAACATAAAA
AGAGGAATCC CGTTTCATCT ATTTGGAGGA GGGTTACCTC ATATCATTCC ATTTGCAGTA
GCGTTAGGAG TTGACAGTTT TGACTCTGCT TCATATATAA TATATGCCAG AGACAATAGA
TATATTACTA GGACACGCGT ATACAAATTA GAGGATTTAG AATATTTTCC ATGTTCTTGT
CCAATATGCT CTAAATACAC ACCTAAGGAT TTACTTGAAA TGAATGAGAA AGAAAGAACA
AAAGCATTGG CTATTCATAA CCTTTATACT ATTTTAGAAG AATTTAAAGC AACTAAACAG
GCGATTAAGG AAGGAAGATT ATTTGAATAT CTCCAAGAAA AAGCTTACTC TCATCCAGCA
GTATATTCTG CATTCAAACG ATTGATGAAA TATAAGGATT ATCTAGAGAA ATTTGACCCT
AGAATAAGGG GAGATCCAAA AGGTTTGTTT TTATTTGACG GTAACTCTTT ACATAGGCCA
GAAATTATAC GTCACTCGAG ATTTCTAGAA AGATACATAC AAAAGAAAGA TAAAATATCC
ATATATTGCT ATGATAAAGC AATAAGTGAT ACTGCTTATG ATTTCAAGGA AAAAATAAGG
GAAAAAATAG CTGATCGTAA TGAGAGCGAC GTATTTATAG CAGTACCGTT TTTTGGTTTA
ATACCGTTAG AGATCTCAGA TTCTTATCCT CTATCTCAAT TCGAGATACC AAATGAAATA
GATGAAGATG TAATAGACGA TATGAAAACT AAAATCATTT CGTTCTTAAG ACGTAATAAT
TACCAAAAAG TAGAGTTAAT TAACTGTGAA AAACTAGGCT TACATATAGA CTCTATCAGC
ACTTCCTCTT GA
 
Protein sequence
MTVFEVKYED LAGRIGTLRT RSGTLETPAF FPVINVLKKD EISIDEIRNI GFKNFITNSY 
ILYKNNYIKD DIHKELRSEE MIIMTDSGAY QILEYGEIGI TNLQIVNYQL KIKPDIGVIL
DLPTGNINDY DNAKKTVYET LKRAEEASEI IVKNQDNNII WVYPIQGGRY LDLVKTSAEG
LSKFEHIYNM AALGSPTVLL EKYMYDTVID MIYTAKSNIK RGIPFHLFGG GLPHIIPFAV
ALGVDSFDSA SYIIYARDNR YITRTRVYKL EDLEYFPCSC PICSKYTPKD LLEMNEKERT
KALAIHNLYT ILEEFKATKQ AIKEGRLFEY LQEKAYSHPA VYSAFKRLMK YKDYLEKFDP
RIRGDPKGLF LFDGNSLHRP EIIRHSRFLE RYIQKKDKIS IYCYDKAISD TAYDFKEKIR
EKIADRNESD VFIAVPFFGL IPLEISDSYP LSQFEIPNEI DEDVIDDMKT KIISFLRRNN
YQKVELINCE KLGLHIDSIS TSS