Gene Ssol_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0174 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp149208 
End bp150566 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content36% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionACX90470 
Protein GI261600867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAG AGGAAGGTGT ATTAAGAGGA ATAGACTATA GAGATCCTCT AGTTAAATAT 
AGAGGAGAAA GAATATCATA TACTTTGAAA AAATTACTAG GAAGAGATGT ACAATTAAAT
GAAACGTTTT TAGTTGACAC CTATTACGTG CACTATTTAC CATTGCCAAT AACCAAAGGA
AAAAGTGAAA TAGAAAAGAA CCAAGAGATA GCTTATTCAT TAGTAAATTC CACTCTATCT
TCTGACATAG TTCTAAAAAA CAGGGAATAC TCAATTGTGA ACTCTGCTGT GAGCTTAGCT
TTAACCGTGA GTTACGTCCA AAATCTTATT GAGGAGTTAG AAAGAATAAA GAAGACCTCC
CAATCTATGG AGGAGAGGGA AGCGGCTGAA GAGATACTAA ACGGTTTAAT GAAAGGAAGC
TCTTCAAAAG AAGGGAAAGA ACAAAAGAAT TCCAATCAAC AATCTATGGA AAAGGTCCTC
AGGCAAGCCC ATGAGAAGGC AATGTCTAAG GCCATAGAGG ATGCCAATTC AGTGAGAAAC
ATGCAAAAGA TCGTTGGAGG GAATGGAGCA GGCACTGGAA GCGTCCTAAC GTTTGAAGGA
GAGATTCATG AAGTGTTAAG ACTCGCAAGG AATACTGAAA TTAAGAAGAT CTTGGAGTTT
TTAAGTGGTA TTCCAAAATT AGGTAGTATT ACAAAGAGGA GGACAACTAG ATTCTCGAAA
GGTGAATTAT ACGGATATGA AGAGGGAAGT GATATTGAAA GGATAGTCTA CTCCGAATTG
GCCCTACCAG ATATGCTCTT TTACTTGAAA CTAGCAGAAG GCCAGTTGTT ATTATATCAA
AAACAGATTA AAGAAACATT AGGCCCCATA TATCTATTAC TTGATAAATC GGGAAGTATG
GATGGAGAGA AAATATTATG GGCCAAAGCT GTAGCACTAG CATTATACAG TAGAGCAAAA
AGAGAAAATA GAGATTTCTA CCTCAGATTC TTCGACAATA TTCCGTATCC ATTAATTAAA
GTTCAGAAGA ATGCCAAGAG CAAAGACGTC ATAAAAATGA TAGAGTATAT AGGGAAAATT
AGAGGAGGAG GTGGTACAGA TATAAGCAGA TCAATAATAT CTGCTTGCGA AGACATAAAG
GAAGGTCATG TTAAAGGGGT AAGTGAAATA ATATTATTAA CAGATGGAGA AGATAAAATT
GCAGAAACTA CTGTGAGAAG ATCATTAAAA GAGGCCAACT CTCAACTAAT AAGTGTCATG
ATTAGGGGAG ATAATGCTGA TCTTAGAAGA GTATCCGATG AGTATTTAAT AACCTATAAA
TTAGACCACG AAGACTTGTT GAAAGTAGTG GAAAGTTAA
 
Protein sequence
MSEEEGVLRG IDYRDPLVKY RGERISYTLK KLLGRDVQLN ETFLVDTYYV HYLPLPITKG 
KSEIEKNQEI AYSLVNSTLS SDIVLKNREY SIVNSAVSLA LTVSYVQNLI EELERIKKTS
QSMEEREAAE EILNGLMKGS SSKEGKEQKN SNQQSMEKVL RQAHEKAMSK AIEDANSVRN
MQKIVGGNGA GTGSVLTFEG EIHEVLRLAR NTEIKKILEF LSGIPKLGSI TKRRTTRFSK
GELYGYEEGS DIERIVYSEL ALPDMLFYLK LAEGQLLLYQ KQIKETLGPI YLLLDKSGSM
DGEKILWAKA VALALYSRAK RENRDFYLRF FDNIPYPLIK VQKNAKSKDV IKMIEYIGKI
RGGGGTDISR SIISACEDIK EGHVKGVSEI ILLTDGEDKI AETTVRRSLK EANSQLISVM
IRGDNADLRR VSDEYLITYK LDHEDLLKVV ES