Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0677 |
Symbol | ileS |
ID | 6165322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 606899 |
End bp | 609832 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641667830 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001794062 |
Protein GI | 171185143 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTG GGGATTTTAA GCTACAGCCT ACCTACGTCC CATACGAGGT TGAGAGACGT GTCCTAGAGT ACTGGCGGAC AAACGGGATA TTCCAGAAGT GGAAGAGCTG GCGCGGAGGC CCCATCTTCG CCTTCCTTGA GGGCCCGCCT ACCACCAACG GGATGCCGCA CGTAGGCCAC GTCAGAGGAC GGACTTACAA AGATGTGGTG CTTAGGTTCC ACAGGCTTCT GGGCTACGAC GTCTGGCCGC AGGGTGGTTG GGACATGCAG GGCATGCCGG TTGAGTGGGA GGTGGAGAAG AGGCTTAAGC TAAGGAGCAA GAAGGAGGTT GAGCGATATG GGCTTGAGCA GTTCGCCAAA AAGTGCAACG AGCTTGTGGA GGAGTATCTC ACATACTGGA GGGAGTGGGG GACCGAGAGG CTGGGGCTCT GGCTTGACGT AGAAAACGCG TACGAAACCA GGCAGCCTAC ATACATTGAA TACGCTTGGC GCGTGATCAA GAAGGCGTAT GAACGCGGTT TGTTGGTAGA GGACTACAGA GTGTTGTGGT TCTGCCCCCG GTGTGAGACC TCGCTCAGCG ACCACGAGGT GGCCCTTGGC TACGAGGAGA GGAGAGACCC CTCGATATAC GTCAAGTTTA GGGTTGAGGG CCGCGGGGAC GAGTACCTCG TCATCTGGAC CACGACCCCC TGGACCCTAG TCGACAACGA GGCCGTGGCG GTTCACCCGG AGTACGCCTA CGCCAAGGTC GAGGTTGAGA ACGGGGAGAA GTGGTGGGTC GCGGAGCAGC TAGCGCCAAG GCTGATGGAG CTGTTTGGGA TAAGGAGGTG GCGCATCGTA GAGGTGAAGA GGGGCTCCGA GCTCTTCGGC CTCCGCTACA CGCACCCACT TGCTGAAGAG GTGCCGGAGA GGGCGGGGAG GACATACACG GTGGTCACCG CCGATTTTGT GACTCTAGAC CAAGGCACGG GACTTGTACA CATGGCGCCC GGCCACGGCC CCGAGGACTT CGAAGTCGCG AAGAAGTACG GTCTCAGGGT TACTAACAGC GTGGAGATCA ACGGCATATA CAACGAAATG GGCGGCAAAT ACGCTGGGAA GTATGTACAC GACGTAGATA AAGAAATCAT CGAAGACCTC CGAAAGAAGG GACTCCTAGT CAAGGCGGAG GAGATAAAGC ATGAGTACCC CCACTGCTGG AGGTGCGGCA CCAAGCTCAT ACTTAGGGCA GATAGGCAGT GGTTCATCGC TATATCTAAG ATCAGGGAGC ACATGTACAA AGAACTAAGG GGGGTAAACG TGGTCCCCCA GAAGCTCAGG GATAGATTCG ACATCTTTGT CCAAAACGCC CGCGACTGGA ACATCTCAAG GAGCAGGGTG TGGGGCACCC CGCTCCCGAT ATGGCGCTGT AGAAAAGACG GGAGGATCCT CGTCGTGGGG TCTCTGGAGG AGCTGAAGAG GCTGGCTAAG GAGCTCCCGC CAGTAGACGA CTTCTGGCTT GTGCACAGGC CCTGGATAGA TAGAGTAGTG CTGAAGACCG AGGACTGCGA CGAGTGGGTT AGAGAGCCCT ATGTGATGGA CGTGTGGCTA GACAGCGGAG TTGCTTGGAT CGCAGCCGTA GACGGAGAGA GAAACGGAGA ACTCTGGTCT AGGCTGTTCC CATACGACTT CGTAACAGAG GGCATAGACC AGACCAGAGG GTGGTTCTAT TCGCTACTGG CCTCCGCGAT GGTTTACGTC GGAAAGGCCC CATACAAAAC CGTATTGATC CAGGGCCTCA TCCTAGACAA ACACGGCCAG AAGATGTCTA AGAGCAAGGG CAACGTCATA TGGGCGAGGG ACCTCTTTGA GAAATACGGC GCGGACCCGG TCCGGCTCTA CATCCTATCG AAGGCGGCCC CCTGGGAGGA CCTTGCCTTT GACCCCGACG AGGTGAAGAC CACAATAAGC GATTTAAACA TCCTATGGAA CGTCGTAAAA TTCGCAGACA CCTACATGGC GCTAGACGGC TTCACAGCCG AGAAGTACCC ACTCGAGAAG TGGCTCAGTA AGGCGCTAGA GGAGGATAGA TGGCTGCTCT CCGAATTCAA CCAGCTTGTG GAGGCGTTTA CACAATACAT GAAAAACTAC GAGTTCCACA AGGCCGCCAA CCTCTGGAGA GAGTTCGTTG TCGAGACGCT CAGCCACCGC TACATAAGAC TACTACGTAG GCGCGTCTGG AGCGAAGAAC CAAGCGACGA CAAATACGCA GCATACGCCG TCTTACACCA CGTGCTGAAA AACGTGATAG TACTCGGCTC TATATTTACG CCCTTTGTGG CGGAGTACCT ATGGCAGGCC TACGTGAAAA AGTACGAAGG TGGAGCCGCC GAGTCGGTGC ACCTCGCGAG CTACCCAACC GCGGGCCCCA TCGAGAGAGA GCTGGTGGAC GCCTTCCGCG AACTGTTCAC GGCTTTTTCA GCCCTAGCCG AAGCCAGAAA CAGAGCCGGT ATAAAACTCC GCTGGCCGAT AAGAGAGGTC TACATAAACG GCGGAAGATA TCTCGATAGA TACAGGGAGC TTCTGAAATA CCTCGGCAAC GTGAAAGAGG TGAAGACGGG GAGTTGCCCC AGTGGATATG TAAAAGCCTC TGAAGACGCC GTCGAGGCTT GTATACCCCC CAAGCTTGAG CCAGAGCTTT ACTACGAGGC GTTGGCTAGG GAGATCGTCC GCAGGATCCA GGTGATGCGT AAGGAGGCTG GGCTAGAGAT AAACGACATG ATCAAGGTCG CCGTTGGCAC CAAGTCGGAA GACGTCAGAA AGGCTGTGGA GACGCTTAAG GACTACATAC AGCGGGAGAC CCGTGCAGTG GAGCTGACTA TTGGAGAGGA AGTAGACGGC AAGGTGTGGG AGATATCAGG CGAAAAGGTG GCCATAGCGA TAAGGAAGGC CTAG
|
Protein sequence | MSRGDFKLQP TYVPYEVERR VLEYWRTNGI FQKWKSWRGG PIFAFLEGPP TTNGMPHVGH VRGRTYKDVV LRFHRLLGYD VWPQGGWDMQ GMPVEWEVEK RLKLRSKKEV ERYGLEQFAK KCNELVEEYL TYWREWGTER LGLWLDVENA YETRQPTYIE YAWRVIKKAY ERGLLVEDYR VLWFCPRCET SLSDHEVALG YEERRDPSIY VKFRVEGRGD EYLVIWTTTP WTLVDNEAVA VHPEYAYAKV EVENGEKWWV AEQLAPRLME LFGIRRWRIV EVKRGSELFG LRYTHPLAEE VPERAGRTYT VVTADFVTLD QGTGLVHMAP GHGPEDFEVA KKYGLRVTNS VEINGIYNEM GGKYAGKYVH DVDKEIIEDL RKKGLLVKAE EIKHEYPHCW RCGTKLILRA DRQWFIAISK IREHMYKELR GVNVVPQKLR DRFDIFVQNA RDWNISRSRV WGTPLPIWRC RKDGRILVVG SLEELKRLAK ELPPVDDFWL VHRPWIDRVV LKTEDCDEWV REPYVMDVWL DSGVAWIAAV DGERNGELWS RLFPYDFVTE GIDQTRGWFY SLLASAMVYV GKAPYKTVLI QGLILDKHGQ KMSKSKGNVI WARDLFEKYG ADPVRLYILS KAAPWEDLAF DPDEVKTTIS DLNILWNVVK FADTYMALDG FTAEKYPLEK WLSKALEEDR WLLSEFNQLV EAFTQYMKNY EFHKAANLWR EFVVETLSHR YIRLLRRRVW SEEPSDDKYA AYAVLHHVLK NVIVLGSIFT PFVAEYLWQA YVKKYEGGAA ESVHLASYPT AGPIERELVD AFRELFTAFS ALAEARNRAG IKLRWPIREV YINGGRYLDR YRELLKYLGN VKEVKTGSCP SGYVKASEDA VEACIPPKLE PELYYEALAR EIVRRIQVMR KEAGLEINDM IKVAVGTKSE DVRKAVETLK DYIQRETRAV ELTIGEEVDG KVWEISGEKV AIAIRKA
|
| |