Gene Ssol_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0984 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp925282 
End bp926463 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content35% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX91228 
Protein GI261601625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00499556 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGATA TTCTGCTAAT AACGTTAAGT GGTATAGTAT CCTCGTGGAC TGTGTATAAT 
GGTATATTAT CAATACTCGG GGTAACGTGG AAACCGTTCG AGAGCAAAAA TCATAGTGGA
ATAACATTTA GTCTGATAGT TCCAGTCAAA AATGAGGAAA GAGTGCTTCC TAGATTATTA
GATCGGCTCG TAAATCTAGA ATATGACAAG TCTAAATATG AAATAATAGT AGTAGAAGAT
GGTTCTACTG ATAGAACCTT CCAAATTTGT AAAGAGTACG AGATAAAGTA TAACAACCTT
ATAAGATGCT ATAGTCTTCC GAGAGCTAAC GTACCGAATG GGAAAAGTAG AGCGTTAAAT
TTTGCCTTAA GGATTAGTAA GGGAGAAATT ATAGGTATAT TCGATGGAGA TACAGTACCA
AGATTAGATA TTTTGGAGTA CGTTGAACCA AAATTTGAGG ATATTACCGT TGGTGCAGTT
CAAGGAAAAT TAGTTCCAAT AAATGTGAGG GAAAGTGTAA CAAGTAGATT AGCTGCTATT
GAGGAATTAA TATATGAATA TTCAATAGCT GGAAGAGCTA AAGTTGGGCT CTTTGTACCA
ATTGAAGGAA CTTGTTCTTT CATAAGGAAA AGTATAATTA TGGAGTTAGG CGGATGGAAT
GAATATTCTC TCACTGAAGA TCTAGATATT AGCCTCAAAA TCGTTAATAA AGGCTGTAAG
ATCGTTTATT CTCCCACAAC TATTAGTTGG AGGGAAGTTC CAGTTAGCTT GAGGGTTTTA
ATTAGGCAAA GATTAAGATG GTATAGAGGG CATTTAGAAG TGCAATTAGG CAAACTTAGA
AAGATCGATT TAAGAATAAT AGACGGTATA CTAATAGTGC TTACGCCATT CTTTATGGTC
TTAAATCTGG TGAATTACTC TCTAGTATTA GTATACTCTT CTTCCCTATA CATTGTCGCA
GCAAGCTTAG TTTCTTTAGC CTCTCTGCTT TCATTATTAC TTATAATCTT AATAGCAAGG
AGACATATGA TAGAGTATTT CTATATGATT CCTTCGTTCG TTTATATGAA CTTTATTGTC
GCACTAAACT TCACTGCAAT ATTTTTAGAG TTAATAAGAG CACCTAGAGT GTGGGTAAAA
ACTGAAAGAA GTGCCAAGGT TACGGGGGAG GTCATGGGAT GA
 
Protein sequence
MLDILLITLS GIVSSWTVYN GILSILGVTW KPFESKNHSG ITFSLIVPVK NEERVLPRLL 
DRLVNLEYDK SKYEIIVVED GSTDRTFQIC KEYEIKYNNL IRCYSLPRAN VPNGKSRALN
FALRISKGEI IGIFDGDTVP RLDILEYVEP KFEDITVGAV QGKLVPINVR ESVTSRLAAI
EELIYEYSIA GRAKVGLFVP IEGTCSFIRK SIIMELGGWN EYSLTEDLDI SLKIVNKGCK
IVYSPTTISW REVPVSLRVL IRQRLRWYRG HLEVQLGKLR KIDLRIIDGI LIVLTPFFMV
LNLVNYSLVL VYSSSLYIVA ASLVSLASLL SLLLIILIAR RHMIEYFYMI PSFVYMNFIV
ALNFTAIFLE LIRAPRVWVK TERSAKVTGE VMG