Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0174 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 149208 |
End bp | 150566 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | ACX90470 |
Protein GI | 261600867 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAG AGGAAGGTGT ATTAAGAGGA ATAGACTATA GAGATCCTCT AGTTAAATAT AGAGGAGAAA GAATATCATA TACTTTGAAA AAATTACTAG GAAGAGATGT ACAATTAAAT GAAACGTTTT TAGTTGACAC CTATTACGTG CACTATTTAC CATTGCCAAT AACCAAAGGA AAAAGTGAAA TAGAAAAGAA CCAAGAGATA GCTTATTCAT TAGTAAATTC CACTCTATCT TCTGACATAG TTCTAAAAAA CAGGGAATAC TCAATTGTGA ACTCTGCTGT GAGCTTAGCT TTAACCGTGA GTTACGTCCA AAATCTTATT GAGGAGTTAG AAAGAATAAA GAAGACCTCC CAATCTATGG AGGAGAGGGA AGCGGCTGAA GAGATACTAA ACGGTTTAAT GAAAGGAAGC TCTTCAAAAG AAGGGAAAGA ACAAAAGAAT TCCAATCAAC AATCTATGGA AAAGGTCCTC AGGCAAGCCC ATGAGAAGGC AATGTCTAAG GCCATAGAGG ATGCCAATTC AGTGAGAAAC ATGCAAAAGA TCGTTGGAGG GAATGGAGCA GGCACTGGAA GCGTCCTAAC GTTTGAAGGA GAGATTCATG AAGTGTTAAG ACTCGCAAGG AATACTGAAA TTAAGAAGAT CTTGGAGTTT TTAAGTGGTA TTCCAAAATT AGGTAGTATT ACAAAGAGGA GGACAACTAG ATTCTCGAAA GGTGAATTAT ACGGATATGA AGAGGGAAGT GATATTGAAA GGATAGTCTA CTCCGAATTG GCCCTACCAG ATATGCTCTT TTACTTGAAA CTAGCAGAAG GCCAGTTGTT ATTATATCAA AAACAGATTA AAGAAACATT AGGCCCCATA TATCTATTAC TTGATAAATC GGGAAGTATG GATGGAGAGA AAATATTATG GGCCAAAGCT GTAGCACTAG CATTATACAG TAGAGCAAAA AGAGAAAATA GAGATTTCTA CCTCAGATTC TTCGACAATA TTCCGTATCC ATTAATTAAA GTTCAGAAGA ATGCCAAGAG CAAAGACGTC ATAAAAATGA TAGAGTATAT AGGGAAAATT AGAGGAGGAG GTGGTACAGA TATAAGCAGA TCAATAATAT CTGCTTGCGA AGACATAAAG GAAGGTCATG TTAAAGGGGT AAGTGAAATA ATATTATTAA CAGATGGAGA AGATAAAATT GCAGAAACTA CTGTGAGAAG ATCATTAAAA GAGGCCAACT CTCAACTAAT AAGTGTCATG ATTAGGGGAG ATAATGCTGA TCTTAGAAGA GTATCCGATG AGTATTTAAT AACCTATAAA TTAGACCACG AAGACTTGTT GAAAGTAGTG GAAAGTTAA
|
Protein sequence | MSEEEGVLRG IDYRDPLVKY RGERISYTLK KLLGRDVQLN ETFLVDTYYV HYLPLPITKG KSEIEKNQEI AYSLVNSTLS SDIVLKNREY SIVNSAVSLA LTVSYVQNLI EELERIKKTS QSMEEREAAE EILNGLMKGS SSKEGKEQKN SNQQSMEKVL RQAHEKAMSK AIEDANSVRN MQKIVGGNGA GTGSVLTFEG EIHEVLRLAR NTEIKKILEF LSGIPKLGSI TKRRTTRFSK GELYGYEEGS DIERIVYSEL ALPDMLFYLK LAEGQLLLYQ KQIKETLGPI YLLLDKSGSM DGEKILWAKA VALALYSRAK RENRDFYLRF FDNIPYPLIK VQKNAKSKDV IKMIEYIGKI RGGGGTDISR SIISACEDIK EGHVKGVSEI ILLTDGEDKI AETTVRRSLK EANSQLISVM IRGDNADLRR VSDEYLITYK LDHEDLLKVV ES
|
| |