Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1780 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1585762 |
End bp | 1588923 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | isoleucyl-tRNA synthetase |
Protein accession | ACX91996 |
Protein GI | 261602393 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0700208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTATCTGCCG TTATCAAGCC CCTTACTGGT AATTATGATC CTAAGAGGAT TGAAGAAGAA ATAATTTCGT ATTGGGAAGA GAATAAAATT TACAATAAAC TAAAAGATAT TGTAAGCAAA AGAAGGGAAA AGTTTCTATT TATAGATGGT CCTCCATACC CTTCAAGTCC TACTCCTCAT ATAGGGACGA TTTGGAATAA GGTGATAAAG GATTGCATAC TGCGTTATCA GAGGGTTCTG GGCAAAAGGG TTCATGACCA GCCAGGATAT GATACACATG GATTACCTAT AGAAGTTGCA ACTGAAAAAC TACTTGGAAT ATCAAATAAG CAAGAGATTA TAGATAAAAT CGGAGTTGAA ACTTTCATTA ATAAATGCAA AGAGTTTGCA TTATCCAATG CAGATAAAAT GACACAGAAC TTCAAGAACG TTGGAGTCTT TATGGACTGG GAGAGGCCTT ACTATACATT AGACCCTTCT TATATAAGCT CTTCATGGAG TGTAATTAAG AAAGCTTATG AAAAAGGGAT GCTAGATAAA GGCACTGCAG TATTGCATTG GTGTCCTAGA TGTGAAACGA CTCTATCAGA TTATGAGGTG TCTGAGTATA GAGACTTAGA GGATCCCTCC ATTTATGTCA AATTTAAAAT CAAAGGTGAG AAAAACAGAT ATCTATTAAT ATGGACTACC ACACCATGGA CTATACCATC TAATGTCTTT GTAATGATAA ATAAGGACTA TGATTATGCT GATGTAGAGG TTAATGGTGA GATACTCGTA ATTGCAAAGG ATCGTGTAGA AGCTGTTATG AAAGAGGCAA GTATAACTAA TTATAAGATA CTCAGAACCT ATAAAGGTAG TGAGTTAATA GGAATAAAGT ACGAGCATCC CCTAAGAGAG TTTGTTAGTG CCCAGACTAA ATTAGATGAC TTTCATCAAG TCGTTGATGC TGGTAATATA GTCAAATTAA CTGATGGTAC CGGTCTAGTT CATAGTGCTA CTGGACATGG AGAAGAGGAT TTCACGGTAG GTCAGAAATA CGGTTTTCCA GTAGTGATGT TCGTTAACGA TAGAGGAGAA TTTACTGAAG AAGGTGGAAA ATATAAAGGA TTGAAAGTTA GGGACGCATC TAAGGCGATA ATTAGCGATT TGAAATCTAA AAATACGTTG TTCTTTGAAG GGAAAATAGT TCATCGTTAT CCGGTATGTT GGAGATGTAA GACTCCACTA ATACTTAGAG CTATTGATCA ATGGTTCATA AGAGTTACAA AAATAAAAGA TAAAATGCTC AATGAAATAG AGAAAGTAAA CTGGATTCCA GATTGGGGTA AATCCAGAAT ATCTAACATG GTTAAGGAAC TCAGAGATTG GGTTATAAGT AGACAAAGGT TCTGGGGAAC TCCTCTTCCC ATATGGATCT GTGAAAGGTG TAATAATGTT ATGGTAGTTG GGAGTAAAGA AGAATTAGAG AGTATAGCAA TAGATCCAGT CCCTAATGAC TTACATAGAC CGTGGATAGA TAATGTAAGG GTAAAATGTA ACAAGTGTGG TGGAGTTGCT AAGAGAATCC CTGATGTTGC AGATGTTTGG TTTGATAGTG GTGTGGCTTT CTTTGCTAGT TTAGGTAAAG ATTGGCAAGA GAAGTGGAAG GAGTTAGGTC CAGTAGATCT AGTTCTAGAA GGTCATGATC AGTTGAGGGG TTGGTTCTTT AGTTTGCTTA GATCTGGGCT AATATTACTA GATAGAGCTC CATATACTTC TGTATTGGTT CATGGATTTA TGCTAGACGA ACAAGGTAGA GAAATGCATA AGAGTCTAGG TAATTATGTT GAACCTTCAG TAGTAGTTGA AAAATATGGG AGAGACATAT TACGTTTATG GTTACTTAGG AATACTACAT GGGAAGATGC AAAATTTTCA TGGAAAGCGT TAGAGCTAAC TAAAAGGGAT TTACAAATAA TTTGGAACAC ATTCGTCTTC GCGTCAATGT ATATGAATTT AGATAACTTT GAACCCGATA AGTATACTCT TGATGATATT ATAAAATATG CTAAGATAGA GGATTTATGG ATATTATCAA GGTTTAACTC AATGCTAAAG AAAGTAAATG AATCCATGAA GGATTACAAG GTTCACGAAA TGACTAATTA CTTGATAAAC TTTCTAATTG AGGATGTCAG TAGGTTTTAC ATAAGGCTAA TAAGAAAAAG GGCGTGGATA GAAGCTAATA CTCAAGATAA GATAGCAATG TATTATATTC TTTATTATAT ATTAAAACAA TGGATTATAT TAGCTTCTAC TATAATTCCA TTTATTTCTG AAAAAATATA TAAATCCTTT GTAGTTAATG CTAAAGAATC AGTTTCAATG GAATCTAGTA TCAATTATGA TGAGAGATTT ATCGATAATG AATTGGAAAG AGCCTTTGAA GTTGCTAGAG AAATAAACGA GGCTTCGTTG AACGCTCGAG CTAAAGCTGG AATAAAATTA AGATGGCCAT TGGCTAAAGT TTATATCTTC ATAGAAAACG AGGATACATT GGCTAAGGTT GGTAGAATAA AAGATGTCTT GATATCTATG CTAAACGCTA AAGATATAGA AATAAGCAAA ATAGAAGGAT TTAAGAGTTT CAGTAAATAT AAGGTCGAGC CGAATAGGTC GATCATAGGG AAGGAATATA AGAGTATGTC GCCAAAAATA GTAGAATATA TTGAGAATAA TAGAGATATA ATAGCTATGG ATATACTTAA TAAAAAGCAG CATGTCGCTA AGATAGATAA TTTTGATATA ATACTTAATG CTTCGTATGT GATTATCTCA GAAGAAACAG TTGAAGGATT CATTTCATCT AAATTTAGTA AGGGTATTGT GGTTATAAGT AAGGAAATTT CGGAGAGTGA AGAGGAAGAG GGATTAATTA GGGACATTAT AAGGAGGATA CAATTCATGA GGAAACAACT AAAACTAAAT GTTTTAGATT ATATTGAGAT TAGCATGAAG GTACCAGAAG AAAGAGTAAA AACTATTCAG AAATGGGAGG AATTTATTAA GAGTGAGACA AGGGCTAGTA ACATAATTCT AGGTGAAGCT AAGGGAGATA TTACAATGGA TTGGGATATA GAAGGGGAAT CTTATATAAT AGGGATAAAG AAGTCTACAT GA
|
Protein sequence | MSAVIKPLTG NYDPKRIEEE IISYWEENKI YNKLKDIVSK RREKFLFIDG PPYPSSPTPH IGTIWNKVIK DCILRYQRVL GKRVHDQPGY DTHGLPIEVA TEKLLGISNK QEIIDKIGVE TFINKCKEFA LSNADKMTQN FKNVGVFMDW ERPYYTLDPS YISSSWSVIK KAYEKGMLDK GTAVLHWCPR CETTLSDYEV SEYRDLEDPS IYVKFKIKGE KNRYLLIWTT TPWTIPSNVF VMINKDYDYA DVEVNGEILV IAKDRVEAVM KEASITNYKI LRTYKGSELI GIKYEHPLRE FVSAQTKLDD FHQVVDAGNI VKLTDGTGLV HSATGHGEED FTVGQKYGFP VVMFVNDRGE FTEEGGKYKG LKVRDASKAI ISDLKSKNTL FFEGKIVHRY PVCWRCKTPL ILRAIDQWFI RVTKIKDKML NEIEKVNWIP DWGKSRISNM VKELRDWVIS RQRFWGTPLP IWICERCNNV MVVGSKEELE SIAIDPVPND LHRPWIDNVR VKCNKCGGVA KRIPDVADVW FDSGVAFFAS LGKDWQEKWK ELGPVDLVLE GHDQLRGWFF SLLRSGLILL DRAPYTSVLV HGFMLDEQGR EMHKSLGNYV EPSVVVEKYG RDILRLWLLR NTTWEDAKFS WKALELTKRD LQIIWNTFVF ASMYMNLDNF EPDKYTLDDI IKYAKIEDLW ILSRFNSMLK KVNESMKDYK VHEMTNYLIN FLIEDVSRFY IRLIRKRAWI EANTQDKIAM YYILYYILKQ WIILASTIIP FISEKIYKSF VVNAKESVSM ESSINYDERF IDNELERAFE VAREINEASL NARAKAGIKL RWPLAKVYIF IENEDTLAKV GRIKDVLISM LNAKDIEISK IEGFKSFSKY KVEPNRSIIG KEYKSMSPKI VEYIENNRDI IAMDILNKKQ HVAKIDNFDI ILNASYVIIS EETVEGFISS KFSKGIVVIS KEISESEEEE GLIRDIIRRI QFMRKQLKLN VLDYIEISMK VPEERVKTIQ KWEEFIKSET RASNIILGEA KGDITMDWDI EGESYIIGIK KST
|
| |