Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1957 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1741603 |
End bp | 1742988 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | homocitrate synthase |
Protein accession | ACX92168 |
Protein GI | 261602565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAG TAGGTATTTT AGATTCGACG TTAAGAGAAG GAGAACAAAC TCCTGGAGTA ATATTTACTG TAGACCAAAG AGTAGAGATA GCTAAGGCTC TATCCGATTT AGGAGTATCT ATGATAGAAG CCGGTCATCC GGCTGTATCT CCAGATATTT ACGAAGGGAT AAAAAGAATA GTCAAATTGA AAAAAGAGGG TATTATAACA TCAGAAATTG TAGGACACAG TAGAGCCGTA AAAAGAGATA TAGAAATTGC AGCAGAATTA GAGGTAGATA GGATAGCAAT ATTTTACGGC GTAAGTGATA TACATCTAAA GGCGAAACAT AAAGCAACAA GAGAAGAGGC TTTAAGGGTA ATAGCTGAGA CAATTAGTTA CGCTAGGAGT CACGGCGTAA AAGTCAGATT TACCGCAGAA GATGGTTCAA GGACAGACTT TGACTTCTTA GTTACAGTAT CGAGAACGGC TAGAGATGCA GGTGCGGATA GGGTTAGTAT AGCTGATACT GTAGGCATAT TATATCCATC AAAAACCAAG GAATTATTTA GCGCGTTAAT AAGGGAAGTT CCAAACTTGG AGTATGATAT TCACGCTCAC AATGACTTAG GTCTAGCAGT AGCAAATGCA TTGGCTGCAG TAGAAGGTGG AGCTACGATT GTTCATGCAA CGGTTAATGG GCTTGGAGAG AGGGTTGGTA TAGTACCTTT GCAACAAATC GTAGCAGCTA TTAAGTATCA TTTTGGTATA GAAGTAGTTA AACTAGATAA ATTACAGTAC GTTTCCAGTT TAATTGAAAA GTACAGTGGA ATTCCGATGC CACCTAATTA TCCCATAACT GGGGATTACG CTTTTTTGCA TAAGGCAGGA GTTCATGTTG CGGGTGTGTT GAGTGATCCT AGAACATATG AATTTATGCC TCCAGAGACG TTTGGTAGAA CAAGAGATTA CACTATTGAT AAATATACAG GAAAGCATGC GTTAAGAGAT AAATATGAAA AACTAGGTGT GAAAATCAGT GAGGCTGAAA TGGATCAGAT TTTAGCTAAA ATTAAGTCAA ATACGACTAT AAGATTTTAC AGAGATGTGG ATTTACTAGA GTTAGCTGAA GAAGTTACCG GAAGAGTTTT GAAGCCAAGA CCACCTGAGC AAATAGAAGC GTTAATTTCA GTTAAGTGTG ATTCTAACGT TTATACCACA TCAGTAACTC GTCGTTTATC AGTTATTAAT GGCGTTAAAG AGGTTATGGA AATTTCAGGA GATTATGACA TACTGGTCAA GGTTCAAGCT AAGGACTCTA ATGAATTAAA CCAGATAATC GAAAGTATAA GAGCAACTAA AGGTGTGAGA TCAACATTAA CATCATTAGT CCTTAAGAAA ATGTAA
|
Protein sequence | MIKVGILDST LREGEQTPGV IFTVDQRVEI AKALSDLGVS MIEAGHPAVS PDIYEGIKRI VKLKKEGIIT SEIVGHSRAV KRDIEIAAEL EVDRIAIFYG VSDIHLKAKH KATREEALRV IAETISYARS HGVKVRFTAE DGSRTDFDFL VTVSRTARDA GADRVSIADT VGILYPSKTK ELFSALIREV PNLEYDIHAH NDLGLAVANA LAAVEGGATI VHATVNGLGE RVGIVPLQQI VAAIKYHFGI EVVKLDKLQY VSSLIEKYSG IPMPPNYPIT GDYAFLHKAG VHVAGVLSDP RTYEFMPPET FGRTRDYTID KYTGKHALRD KYEKLGVKIS EAEMDQILAK IKSNTTIRFY RDVDLLELAE EVTGRVLKPR PPEQIEALIS VKCDSNVYTT SVTRRLSVIN GVKEVMEISG DYDILVKVQA KDSNELNQII ESIRATKGVR STLTSLVLKK M
|
| |