Gene Ssol_2252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2252 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2035804 
End bp2037048 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content35% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX92439 
Protein GI261602836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATACG AATTAGAAAA CGTGCTGTAT ATGCTTGAAT ACGGTGTCTT CATATATTTT 
CCAGTGAATT TTGCAATATT CTTCTTGTTT AGACAGTTTT TATTGCGCCA TTTCTCTAAA
AGCTATAAAC CTTATATCAG TGGTTTAAGT AGAGATAAGG TAAAAGTAAG TGCAATAGTT
CCAGAGTACG GTGAGAATCT TGAGATTTTT GAAAAATGTC TTAGATCAGT TGCAAAAAAT
AAACCAGATG AGATAATAGT AGTCCACGAT GATAAGAGAA AGGAAGTAGT GGATATATCC
AAAAAGTATG GTGCTAAGGT TATAAGCCTT AGTAGAAGGG TTGGTAAACG CGGTGCATTA
ATCATAGGTT GGTTAAACGC AATTGGAGAT ATAATAGTTC AGTTAGATAG TGATACGATT
ATGGAAGATA ATACTATTAA TGAAATCGTT AAACCGTTTG CGGACCCTAA GGTAGCTGGA
GTTCAAGGGA GACCAGTATT ATTCAGAACT GATGGCAGAA TTCCCTATTT GTTTGGACAA
ATAATAGAGT ATAGTAGGGA TATTGTTGTT AGAGCGTTAA ATGGAACGTT AAATGTGATT
GATGGAAAGA TTGCCGCTTA CAGAAGAAGT TATCTACTAG AGACTATAAG GCATTTCAAT
CACGAGACTT ACGGAAAGAG AAAACTAATT GCTGCAGACG ATAAAGCACT GACTTATTAC
GCAAATATGA ATGGTTATAA GACAGTCTAT CAAGCTACGG CAGTGGCTAA ATCAGCGGCC
CAACCTACGT TTTTAAAATT CCTCAACCAG CAGTTAAGAT GGGCTAGAAG CGGTTATCTT
TACCTAATTA AGGAAATGAG GAGTGGCTTA TTTTTCAAAA TGCCCGGAAA ATATAGATTT
CATATGTTAA CATATCTATT AGCTCCATTT TCATTTGCTT TGGCATTAAT AGACACGCTC
TTAGTTCCTG GAAATCCCAC TGCATTGACT TGGAGTTATT TAGCCTATTA TGGATTTAAC
ATACCTATAA TATTATATTC GCTCCTTATC TTTATTTTTG GTCTTTACTT AAGTATGAAA
ATATCTTTTG GAATCTTGAA CCTTAAACTC CCAGATAAAA TATCCTTCGT TGATCTTATT
ACACTAGGCA TTCTCGGTTT ATTCGTAATA TTCCCCATGT TTATATATGC AGCAATCACC
CATTACGGTG TTTCCGAATG GAGGGGAAGT AGCTATTTGG GTTAG
 
Protein sequence
MLYELENVLY MLEYGVFIYF PVNFAIFFLF RQFLLRHFSK SYKPYISGLS RDKVKVSAIV 
PEYGENLEIF EKCLRSVAKN KPDEIIVVHD DKRKEVVDIS KKYGAKVISL SRRVGKRGAL
IIGWLNAIGD IIVQLDSDTI MEDNTINEIV KPFADPKVAG VQGRPVLFRT DGRIPYLFGQ
IIEYSRDIVV RALNGTLNVI DGKIAAYRRS YLLETIRHFN HETYGKRKLI AADDKALTYY
ANMNGYKTVY QATAVAKSAA QPTFLKFLNQ QLRWARSGYL YLIKEMRSGL FFKMPGKYRF
HMLTYLLAPF SFALALIDTL LVPGNPTALT WSYLAYYGFN IPIILYSLLI FIFGLYLSMK
ISFGILNLKL PDKISFVDLI TLGILGLFVI FPMFIYAAIT HYGVSEWRGS SYLG