Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0887 |
Symbol | ileS |
ID | 7401258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 878566 |
End bp | 881718 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643707952 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_002565555 |
Protein GI | 222479318 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATA TCGACGACCA GTACACGCCC GCGGACGTCG AGTCGGCGGT CGACGAGTAC TGGGACGAGA GCGACGCCTA CGATTCGGCG AAGGAGGCGC ACGCCGACGA CCCGCCCTTC TTCTTCGTGG ACGGGCCGCC GTACACCTCC GGCCAGATGC ACCTCGGCAC GGCGTGGAAC AAGACGCTGA AGGACGCGAT CATCCGCCAC AAGCGGATGA CCGGCCACCG CGTCACCGAC CGCCCGGGCT ACGACATGCA CGGACTCCCG ATCGAGGTGA AAGTCGAGGA GGAACTCGGC TTCGAGAACA AGCGGGACAT CGAGGAGTAC GGCATGGAAT CGTTCATCGA GAAGTGCAAA GAGTTCGCCC TACGCAATCG GGCGGCGATG GACGAGGACT TCCAGTCGAT CGGCGTGTGG ATGGACTGGG ACGACCCCTA CGAGACGATC GATCCCGAGT ACATGGAGGC CGCGTGGTGG GCGTTCTCGA ACGTCGCCGA TAACGGGCTC GTCGAGCAGG GGAAGCGCTC CATCTCGCAA TGCCCGCGCT GTGAGACCGG ACTCGCCAAC AACGAGGTCG AATACGAGGA CGTTGAGGAC CCCTCCATCT ACGTGAAGTT CCCGCTCGCC GACCGCGAGG GAAGTCTCGT CATCTGGACG ACGACGCCGT GGACCATTCC CGCGAACACC TTCGTCGCCG TCGACGAGGA CCTCACCTAC AACGCGGTCC GCGCCGAGAG GGACGGCGAG AGCGAACTCA TCTACGTCGC CGCCGACTGC GTCGAGGACG TGCTCCAGGA GGGTCGCTAC GAGGAGTACG AGGTCGAAGC CGAGCTGTCG GGCGCAGAGC TCGTCGGCTG GGCGTACGAC CACCCGCTCG CCGACCACGT CGCCGAATAC CCGAACTTCG AAGGCGCCGG GCAGGTGTAC GCCGCCGACT ACGTCGAGGC CGACCGCACC GGACTCGTCC ATTCCGCGCC GGGACACGGG GAGGAGGACT TCCACCGCGG TAGCGAGTTG GGCCTCGACA TCTACTGTCC GGTCGGTCCC AACGGCGAAT TCACCGAGGC CGCCGGGGAG TACGCGGGCG AGTTCGTCCG CGACGCCAAC GACGACATCG TCGACGACCT GGTCGCCGAC GGCCACATGC TCGCGCACGG CACAGTGAAC CACAGCTACG GGCACTGCTG GCGGTGTGAC ACCGGCATCA TCCAGCTGGT CACCGACCAG TGGTTCATCT CGATCACGGA CGTGAAGGAG GACCTCCTCG ACAACATGGA GCAGTCCGAG TGGCACCCGC AGTGGGCGCG GGACAACCGG TTCCGCGACT TCATCGAGGA CGCGCCCGAC TGGAACGTCT CCCGCCAGCG CTACTGGGGG ATCCCGATCC CGATTTGGGT GCCGGAGGAC GCCGAGGCCG GGAACCTCGA CGACGATATG ATCGTGATCG GCACCCGCGA GGAGCTCGGC GAGCGCGTCG AGGAGGACAT CGACCCCAAG ACGATCGACC TTCATCGCCC CGCCGTCGAC GACCTGACGG TCGTCGAGGA CGGGACCACC TACCGCCGAG TCGAGGACGT GTTCGACGTG TGGCTCGACT CCTCAGTGGC CTCGTGGGGC ACGCTCGGGT ACCCGAGCGA TGAGACCGCC CACGACGAGC TGTGGCCCGC CGACTTCATC GTCGAGGCGC ACGACCAGAC CCGCGGCTGG TTCTGGTCGC AGCTCGGGAT GGGAACCGCC GCAGTCGGAG AAATTCCCTA CGAAGAGGTC CTCATGCACG GGTTCGCCAA CGACGAGAAC GGCCGAAAGA TGTCCAAGTC TGTCGGTAAC ATCGTCACGC CCGAGGAGGC GATCGAGCGC GCCGGCCGCG ACCCGCTCCG CACCTACCTG CTCAGCCACG ACCAACAGGG AGTCGACCTC GCGTTCGAAT GGGACGGACT CGGTGAGTTG CAGGGGAAGC TCAACATTCT CTGGAACGTC TTCCGATTCC CGCTAGAGTA TATGGATCTC GACGGCTACG ACCCCGCCGA GGCCGACCTC GATGACGGCG AACTCGAACT CGTCGACGAG TGGGTGCTCT CCCGGCTCCA GTCCGTCGAG ACGGAGGTGA CGGAGGCGTG GGCCGACTAC CGCGTCAGCG ACGCCGTCAA CGCCGTGATC GAGTTCGTTA CGCAGGACGT GTCGCGCTTC TACGTGAAGG CGGTCCGCGA CCGCATGTGG GAGGAGGCCG ACTCCGCCTC CAAGCGCGGC GCCTACGCGA CGCTCGCGAC CGCCCTCGAC GAGACGACCC GGCTGCTCGC GCCGATCGCG CCGTACATGA CCGAGCGCAT GTACCAGACG CTCGACGGTG AGGCCACGAC CGTCCACCAG CTCGACTACC CGGAGCCCGA CGAGGATCTC CACGATCCCG AGCTTGAGCG TGACGTGGCC GTCCTCCGCG ACGTGGAGGA GGCCGCCGCG AACGCCCGTC AGCAGGCCGG TCGCAAGCTC CGCTGGCCCG TGCCGCGCGT GGTCGTCGAG AGCGACGACG AGAACGTGAT CGCCGCGGTC GAGCGGCTCT CGGATCTGAT CGCCGACCGC GTCAACGCCC GCGAAGTGAC CGTCACCGAC GCCTTCGACG AGTTAGTCGA GACCGCCGAG CCGCAGATGG GTGCGATTGG TCCCGCCTTC GGCGCCGACG CCCAGAAGGT GATGAACGCG GTGCAGGGCG CGACCCGCGC GGCGGTCGAG GGCGGCGAGG TGACCGTCGA CGGCGATCCG GTCGACCTCG CCGACGAGAT GGTCGAGTAC GTCGCGGAGC CGCCGGAACA CGTCTCCGGT GCCGACTTCG ACGGCGGCGC CGTCTACGTC GATACCTCCC TGACCCCGGA GATCGAATCG GAGGGCTACG CCCGCGACGT GATCCGCCGG ATTCAGGAGA TGCGCAAGGA GCTCGATCTG GACGTGGAGG CCCGGATCCG CGTCGGCGTC ACGGTCGACG ACGATCGGGT GGCCGACTTC GTCGACGAGC ACGCCGACCT GATCGCCGGC GAGGTGCGCG CCGACGCGTG GATCGACGAC CCGAGCGACG CCGCGGACGC CGAGGGCGGT CTCGTCGAGG AGTGGGAGGT CGAAGGTGTC GCGGTCACGA TCGGAATCGA ACCGGTCGCG TGA
|
Protein sequence | MDDIDDQYTP ADVESAVDEY WDESDAYDSA KEAHADDPPF FFVDGPPYTS GQMHLGTAWN KTLKDAIIRH KRMTGHRVTD RPGYDMHGLP IEVKVEEELG FENKRDIEEY GMESFIEKCK EFALRNRAAM DEDFQSIGVW MDWDDPYETI DPEYMEAAWW AFSNVADNGL VEQGKRSISQ CPRCETGLAN NEVEYEDVED PSIYVKFPLA DREGSLVIWT TTPWTIPANT FVAVDEDLTY NAVRAERDGE SELIYVAADC VEDVLQEGRY EEYEVEAELS GAELVGWAYD HPLADHVAEY PNFEGAGQVY AADYVEADRT GLVHSAPGHG EEDFHRGSEL GLDIYCPVGP NGEFTEAAGE YAGEFVRDAN DDIVDDLVAD GHMLAHGTVN HSYGHCWRCD TGIIQLVTDQ WFISITDVKE DLLDNMEQSE WHPQWARDNR FRDFIEDAPD WNVSRQRYWG IPIPIWVPED AEAGNLDDDM IVIGTREELG ERVEEDIDPK TIDLHRPAVD DLTVVEDGTT YRRVEDVFDV WLDSSVASWG TLGYPSDETA HDELWPADFI VEAHDQTRGW FWSQLGMGTA AVGEIPYEEV LMHGFANDEN GRKMSKSVGN IVTPEEAIER AGRDPLRTYL LSHDQQGVDL AFEWDGLGEL QGKLNILWNV FRFPLEYMDL DGYDPAEADL DDGELELVDE WVLSRLQSVE TEVTEAWADY RVSDAVNAVI EFVTQDVSRF YVKAVRDRMW EEADSASKRG AYATLATALD ETTRLLAPIA PYMTERMYQT LDGEATTVHQ LDYPEPDEDL HDPELERDVA VLRDVEEAAA NARQQAGRKL RWPVPRVVVE SDDENVIAAV ERLSDLIADR VNAREVTVTD AFDELVETAE PQMGAIGPAF GADAQKVMNA VQGATRAAVE GGEVTVDGDP VDLADEMVEY VAEPPEHVSG ADFDGGAVYV DTSLTPEIES EGYARDVIRR IQEMRKELDL DVEARIRVGV TVDDDRVADF VDEHADLIAG EVRADAWIDD PSDAADAEGG LVEEWEVEGV AVTIGIEPVA
|
| |