Gene Ssol_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0222 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp188303 
End bp189799 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content36% 
IMG OID 
Productdihydropteroate synthase-related protein 
Protein accessionACX90518 
Protein GI261600915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.70107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGTAT TAGTAGTAAC TGGAACTTTA GCTGCACCAA TACTTTCTGA AGTTGCTAAA 
AACATTAAAG ATACTAAAGT TGAGATTAAG GTACTTAATT ACCCTGTTGC CTCATTAATG
AGTACAAAGT TTATAGCAGA AAACTTGAAG CAGACCAAAT TTGATGCAGA TTACATCCTT
TTACCAGGTT TAGTCTATGG CGACGCCAAA ATTGTTGAGG AAGTTACGGG AGTGAGAACT
TTTAAAGGGA CAGAGGAAGC ATGGGATCTC CCTAGAGTAA TTGAGGCATT GAAAAATGGG
ATACAACTCT CCACAACGGA ATCTGCTGAT AAGATCATAG GTAAAATGGA TAATATTGAA
GAGAAGCTAA GAAAAATGGA GGAAGAAGCT AAGATTTCGT TTGAAATAAA TGGAATTAAG
ATTACAACTT ATCCACCGCC ATTTAGAATT TTCTTGGAAA TAGACAATAA GCAAGAATTC
GAGAAATTGG ATAGAATAAG AAAAAACGTT GACGTAGTAG TATTAGGTTT TCCAGTGGGT
CACTACGATT TGGACGAGGT TAAAAATAAG GTTAAACAAT TAGTGGACTA CGGTTACGTT
GTTGGAATAG ATGCTGAATC GCCCAGAGAA TTAAAAGAAG GTGTAAGAGC TGGAGCTTCA
TTCGTATTTA ACCTAAATGA AAATAACTTT GAGGAACTTG AGGAAATTAG GAAAGAAGCA
GCATTTGTTG TAGCCCCGTT TAACACTGAA AATAGAGGAG AAATAACTAT TGATCTCGTA
AAAAAAGCCA AACAAAAAGG ATTCGATAAA TTAATAGCAG ATCCCGTATT ATCGCCTCCC
CTAAGAGGGT TAGTAAGCAG TATAATTGGG TATAAGTACG TCAGAGAGAC GTTGCAAGAT
ATACCTATTC TAATGGGAAT TCTTAACGTG ACTGAACTCA TTGATGCAGA TAGTATAGGA
ATGAATGCAC TACTAACCGC AATTGCTGGA GAGTTGGGAA TTTCTAACTT ATTAATTATG
GAAAAGGGGA AAACGAGGTG GAGTAGTTGG GAAGTATCAC AAGCTACAAA AATGATAAGT
GTAGCTTTGA AGGAAAATAG ACTTCCCAAA GATATAGGAA TAGATTTGTT AGTACTTAAG
GATAAGAGAA GATTTAGGGA GAGTTTTAAC GCTGACGTGA TTGTTAATAG GCGTATAGAG
CCTGAAATGG ATAATACCGG ATTTGCAAAA ATTTTCGTGA GTGAAGATGG ATTTGGAGTA
GAATGGATTG GGAAAAACAA GATAACAATA AAAGGAAAAG ATGGGCTAAG CATTGGCAGA
GAGCTGATTA GGAGAGTTAA GGATATTAGC AAAGAGCATG CCGTGTATAT AGGATATGAG
TTGGCCAAGG CTGAGATTGC GTATCAACTT GATAAAAATT ATATTCAAGA CAAGCCATTG
TTCAAAAAGA TAATTAATGA TAATCTCCAT ACCGAGCACG ATAAGAAAAG AGGTTAA
 
Protein sequence
MKVLVVTGTL AAPILSEVAK NIKDTKVEIK VLNYPVASLM STKFIAENLK QTKFDADYIL 
LPGLVYGDAK IVEEVTGVRT FKGTEEAWDL PRVIEALKNG IQLSTTESAD KIIGKMDNIE
EKLRKMEEEA KISFEINGIK ITTYPPPFRI FLEIDNKQEF EKLDRIRKNV DVVVLGFPVG
HYDLDEVKNK VKQLVDYGYV VGIDAESPRE LKEGVRAGAS FVFNLNENNF EELEEIRKEA
AFVVAPFNTE NRGEITIDLV KKAKQKGFDK LIADPVLSPP LRGLVSSIIG YKYVRETLQD
IPILMGILNV TELIDADSIG MNALLTAIAG ELGISNLLIM EKGKTRWSSW EVSQATKMIS
VALKENRLPK DIGIDLLVLK DKRRFRESFN ADVIVNRRIE PEMDNTGFAK IFVSEDGFGV
EWIGKNKITI KGKDGLSIGR ELIRRVKDIS KEHAVYIGYE LAKAEIAYQL DKNYIQDKPL
FKKIINDNLH TEHDKKRG