Gene Ssol_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0197 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp170332 
End bp171612 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content31% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX90493 
Protein GI261600890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.823444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGGG AAATTTTTGA AATTTTGCTT TTTATATCAT CATTTTTTAC ATCTTTATGG 
ATACTCCTTC AAGCATTCTA CTATAAAGTT TCTAATCAAA ACATTATACA ATTTTCTACT
AAAAATGATA AAAATTCTAA CAAAAGAATA GACATAATCG TTGCAATAAA AGACGAAGAT
GAAAAGACGA TTAAAGAACT AATTAATAAC CTATCGGGAT TGGACTATAG ATTTTACAAA
GTTATTATAG TATCTGATGA TACAGAGGAG ACTTTTAAAA AAATTATAGA ATCACTAGAT
AAACTTCCGG ACAATTTCGT AATTATAAGG AGACCAGAAA ACAAGGGAAG AAAAGCTGGA
GCACTAAACT TCGCCACTAA TATTTCTGAT GCTGAAATGT TAGTGTATCT GGATGCAGAA
GCCAGGGTCG AAAAGGACTT TTTACGTAAA ATTTCTCAAC TTGACTACGA TGCGGTTGCG
TTTAGATTAA AAGTTAGAGA TGTTAATACA CAAGTTCAAA AGATATACTC ATATACCAAT
GAATTTGTAA TGAACGCATT ATTCAAGGCT AGAGACAAGT TAGGTCTAAT AATATTTGCA
AATGGTTCAG CATTCGGAAT AAAGAGAGAT ATTTTAAGGA AGATAGGTGG ATGGAAAGAA
AATAGCGTAG CAGAAGACTT AGAGCTAGGT ATTAGACTTG CTCTGAGTAA TATTAAAGTA
AAATACGTTG ATGACATCAC AGTTTATACC TTAGCTCCCT ATACCCATAC TGATTTATAT
AACCAAATTA AAAGATGGGC TTATGGTTCT GGAGAATTGA TCTCTTACAG CATGAGATTG
TTTAAATTAG GAATAAGGGG AATTGAGGGA TTTATATACT CACAACAGTG GGGAATTTAC
CCCCTATACC TACTACTATT TCTTATTATT ATCTCAATAC AGTTTATATT AAATATAAAC
TACTTTTATG TCTTTACCTC ACTAATCCCA ATACTAGTCT CGAATGGAAT TTACATAGCT
CTGATAAAAC CTAAGGGAGA TTATAAAAGT GGCATTGTAA CCCTAATTGC TTCTCTTATC
GGCTACATTC AAGGAATATT CAAAGTCAGG TTTAAATGGA AAGTTACACC CAAAAGCCTA
GTTGGGAAAG AAGAAGAAAT CTTGAGTATA AAAATATTAG GGATTATTCT TGCGATAATG
GCATATATTA ATAGTCTTTT CAATAACACA ATTTCATCTT TATTAATAAT TTTGTTTTCA
CTTATTCTTT TAACTCTATA G
 
Protein sequence
MIREIFEILL FISSFFTSLW ILLQAFYYKV SNQNIIQFST KNDKNSNKRI DIIVAIKDED 
EKTIKELINN LSGLDYRFYK VIIVSDDTEE TFKKIIESLD KLPDNFVIIR RPENKGRKAG
ALNFATNISD AEMLVYLDAE ARVEKDFLRK ISQLDYDAVA FRLKVRDVNT QVQKIYSYTN
EFVMNALFKA RDKLGLIIFA NGSAFGIKRD ILRKIGGWKE NSVAEDLELG IRLALSNIKV
KYVDDITVYT LAPYTHTDLY NQIKRWAYGS GELISYSMRL FKLGIRGIEG FIYSQQWGIY
PLYLLLFLII ISIQFILNIN YFYVFTSLIP ILVSNGIYIA LIKPKGDYKS GIVTLIASLI
GYIQGIFKVR FKWKVTPKSL VGKEEEILSI KILGIILAIM AYINSLFNNT ISSLLIILFS
LILLTL