Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1650 |
Symbol | ileS |
ID | 4617048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1498435 |
End bp | 1501368 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639784733 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_931145 |
Protein GI | 119873138 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0112735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATG GGGATTTTAA ACTCCTGCCC AGTTATAATC CTCATAAAGT AGAGACAGCA GTACTAGAGT TTTGGGATAA GAACAGAATA TTTGAGAAAT GGAAGACTTG GCGCGGCGGC CCTATATTCG CCTTTCTAGA GGGGCCTCCT ACCACAAACG GCATGCCACA TGTAGGACAT ATCAGAGGCC GTACGTATAA AGACGTAGTA TTGAGGTTTT ATAGACTTTT GGGCTACGAC GTATGGCCAC AAGGCGGCTG GGATATGCAA GGCATGCCCG TCGAGTGGGA GGTAGAGAAG AGACTTAAGC TAAAGAGTAA GAAAGAGGTG GAACTGTACG GGCTTGAACA GTTTGCAAAA GAGTGCAATA AGCTAGTAGA CGAATACCTC GCCTACTGGA GAGAGTGGGG AACCAGGAGA CTAGGACTTT GGCTAGACAT AGATAACGCA TATGAGACTA GGAGACCTGC ATATATAGAA TACGTTTGGC GGGTTATTAA GAGGGCGTAT GAACGCGGCC TGCTCGTAGA GGACTACAGA GTGTTGTGGT TCTGCCCCAG GTGCGAGACT TCGCTTAGCG ACCACGAAGT GGCGCTTGGC TACGAGGAGA GGAAAGACCC CTCGATATAC GTCAAATTCA AAGTAGAGGG GCGTGAAAAT GAGTATTTAA TCATTTGGAC CACAACGCCT TGGACTCTAG TTGATAATGA GGCCGTGGCG GTCCACCCGG AGTATGCCTA CGTCAAGGTT GAGGTGGAAA ATGGGGAAAG GTGGTGGGTC GCAGAACAGC TAGCGCCTAG GCTGATGGAA CTGTTCGGGG TAAAGAGGTG GCATATCGTA GAGGTGAGGA AGGGCTCCGA GCTCTTCGGC CTCCGCTACA TGCACCCACT TGCTGAAGAG GTGCCGGAGA GGGCGGGGAG AACCTACACA GTGGTCACCG CAGATTTCGT GACGCTAGAT CAAGGCACAG GCCTTGTACA CATAGCGCCC GGCCACGGCC CCGAGGACTT TGAAGTTGCT AGAAAGTACG GTCTCAGGGT TACAAACAGC GTGGAGATCA ACGGCATATA CAACGAAATG GGTGGCAAAT ACGCTGGGAA GTATGTACAC GACGTAGATA AAGAAATCAT CGAAGACCTC CGAAAGAAGG GCCTCCTAGT CAAGGCGGAG GAGATAAAGC ATGAGTACCC CCACTGCTGG AGGTGCGGCA CCAAGCTCAT ACTTAGGGCA GATAGACAGT GGTTCATTGC GATATCTAAG ATCAGAGAGC ATATGTACAA GGAGCTGAGA GGAGTAAACA TAGTACCTCA GAAGCTCAGG GACAGATTCG ATATCTTTGT CCAAAACGCT CGCGACTGGA ACATTTCAAG GAGTAGAGTG TGGGGTACCC CCCTGCCGAT ATGGCGCTGT AAAAAAGATG GGAGGATCCT CGTCGTAGGG TCCCTGGAGG AGTTGAAGAA GCTGGCTAAG GAGCTCCCGC CGGTAGACGA CTTCTGGCTT GTGCACAGGC CCTGGATAGA CAGAGTAGTG CTGAAGACTG AGGACTGCGA CGAGTGGGTT AGAGAGCCCT ACGTGATTGA CGTGTGGCTA GACAGCGGCG TTGCCTGGAT CGCGGCTGTA GACGGAGAGA GGAACAGAGA ACTCTGGTCT AAACTGTTCC CATACGACTT TGTGACCGAG GGCATAGACC AGACCAGGGG GTGGTTCTAC TCGCTACTGG CGTCTGCGAT GGTATACATC GGGAGAGCGC CGTATAAAAC TGTGTTAATA CAAGGCCTCA TCCTAGATAA ATACGGCCAG AAGATGTCTA AGAGCAGAGG CAACGTCATC TGGGCCAAGG ACCTCTTTGA GAAGTACGGC GCAGACCCAG TCCGTCTGTA CATCCTATTG AAGGCGGCGC CTTGGGAAGA CCTCGCCTTT GACCCCGACG AGGTAAAGAT GGCTATAAGC AATTTAAACA TACTATGGAA CATCGTAAAA TTTGCCGACA TGTATATGTC ACTTGATGGA TTCTCTGCAG AGAAGTATCC GCTTGAGGAG TGGATAGACA AAGCTCTTGA CGAAGATAGA TGGCTATTGT CTGAACTCAA CAAGTTGATA GAAGACTTTA CGCAATATAT CAGAAACTTT GAGTTTCACA AAGCGGCAAA CCTCTGGAGG GACTTTATAG TTGAGACTTT AAGCCACCGC TATATAAGAC TTCTACGTAG ACGTGTTTGG ACCGAAGAAC CTAGCGCCGA TAAATATGCC GCATATGCTG TCTTACACCA TGTGTTAAAA AACGTGCTAA TATTAGGCTC TATACTTGTA CCATTTATCA CCGAATATCT ATGGCAGACA TATGTTAAAA AATTCGAGAG AGAGACCGCA GAGTCTGTAC ATTTGGCGAA TTATCCAACC GCAGGCCCCA TAGATAAAGA GCTTATGGAA ATCTTCCATG AGTTATTCAC TGTCTTCTCT GCTTTAGCTG AGGCCAGAAA TAAGGCTGGC ATAAAACTCC GCTGGCCTAT AAGAGAGGTT TACATAAACG GAGGGAGATA TGTAGATAGA TATAAGGAAC TTCTAAAGTA TCTAAGCAAT GTAAAAGAGG TCAAAGTCGG CACATGTCCT AGCGAATATA TAAAAGCTAC AGAGGGCACA ATCGAGGTTT GTATACCTGC TAAGCTAGAG CCTGAGCTTT ACTATGAAGC GTTGGCAAGA GAAATCGTCC GTAGGATACA GGTCATGCGT AAAGAAGCTG GACTAGAGAT AAACGACGTC ATACAAATAG TAATAGACAC AGAGTTGGAA GATGTTAAAA AAGCAGTAGA GATCTTCCAA GACTACATAA AACGAGAAAC TCGCGCCATG GAGTTAAAAA TTGCAAAAAC GACAAGCGGC AAAGAGTGGG ACATATTAGG CGAAAGAGTG ACTATAGAAA TTAGAAAAAT ATGA
|
Protein sequence | MTNGDFKLLP SYNPHKVETA VLEFWDKNRI FEKWKTWRGG PIFAFLEGPP TTNGMPHVGH IRGRTYKDVV LRFYRLLGYD VWPQGGWDMQ GMPVEWEVEK RLKLKSKKEV ELYGLEQFAK ECNKLVDEYL AYWREWGTRR LGLWLDIDNA YETRRPAYIE YVWRVIKRAY ERGLLVEDYR VLWFCPRCET SLSDHEVALG YEERKDPSIY VKFKVEGREN EYLIIWTTTP WTLVDNEAVA VHPEYAYVKV EVENGERWWV AEQLAPRLME LFGVKRWHIV EVRKGSELFG LRYMHPLAEE VPERAGRTYT VVTADFVTLD QGTGLVHIAP GHGPEDFEVA RKYGLRVTNS VEINGIYNEM GGKYAGKYVH DVDKEIIEDL RKKGLLVKAE EIKHEYPHCW RCGTKLILRA DRQWFIAISK IREHMYKELR GVNIVPQKLR DRFDIFVQNA RDWNISRSRV WGTPLPIWRC KKDGRILVVG SLEELKKLAK ELPPVDDFWL VHRPWIDRVV LKTEDCDEWV REPYVIDVWL DSGVAWIAAV DGERNRELWS KLFPYDFVTE GIDQTRGWFY SLLASAMVYI GRAPYKTVLI QGLILDKYGQ KMSKSRGNVI WAKDLFEKYG ADPVRLYILL KAAPWEDLAF DPDEVKMAIS NLNILWNIVK FADMYMSLDG FSAEKYPLEE WIDKALDEDR WLLSELNKLI EDFTQYIRNF EFHKAANLWR DFIVETLSHR YIRLLRRRVW TEEPSADKYA AYAVLHHVLK NVLILGSILV PFITEYLWQT YVKKFERETA ESVHLANYPT AGPIDKELME IFHELFTVFS ALAEARNKAG IKLRWPIREV YINGGRYVDR YKELLKYLSN VKEVKVGTCP SEYIKATEGT IEVCIPAKLE PELYYEALAR EIVRRIQVMR KEAGLEINDV IQIVIDTELE DVKKAVEIFQ DYIKRETRAM ELKIAKTTSG KEWDILGERV TIEIRKI
|
| |