Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0089 |
Symbol | |
ID | 8740652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 98661 |
End bp | 101867 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646510652 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_003401663 |
Protein GI | 284163384 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGAT TCGGCGAGGT CGACGACCAG TACGATCCGC ACGAACTCGA GGGGCGGATC TTCGACTACT GGGACGACGT CGACGCCTAC GAGCAGACGG TCGAGCACCG ATCCGACGGC GAGTCGTTCT TCTTCGTCGA CGGGCCGCCG TACACGTCTG GGTCGGCGCA CATGGGAACG ACCTGGAACA AGTCGCTGAA GGACGTCTAC CTGCGGTTCC TGCGGATGCA GGGGTACGAC GTCACCGACC GGCCGGGCTA CGACATGCAC GGCCTGCCCA TCGAGACCCG CGTCGAGGAG CGACTCGGCT TCGAGAACAA GAAGGACATC GAGGAGTTCG GCGAGGAGAA CTTCATCCAG GAGTGTAAGG ACTACGCCGA CGAGCAACTC GAGGGGCTCC AGGAGGACTT CCAAGACTTC GGCGTCTGGA TGGACTGGGA CGACCCCTAC CGGACGGTCG AACCGGAGTA CATGGAGGCC GCCTGGTGGG GCTTTTCGAA GGCCGCCGAC CGCGGGTTAG TCGAGAAGGG CCACCGCTCG ATCTCCCAGT GTCCGCGCTG CGAGACCGCG ATCGCGAACA ACGAGGTCGA GTACGAGGAC GTCGAGGACC CCTCGATCTA CGTCAAGTTC GACCTCGAGG ACCGCGAGGG CAAGATCGTC ATCTGGACGA CCACGCCGTG GACGGTTCCG GCGAACACGT TCGTCGCCGT TGACCCCGAA GGCGACTACG TCGGCGTCCG GGCCGAGAAG GACGGCGAGG AGGAACTGCT GTACGTCGCT GACGCGAAAC ACGAGGAGGT CCTCCGAGAG GGCCGCTACG ACGACTACGA GGTCGTCGAG GAGCGCTCCG GCGAGGAGCT GATCGGCTGG TCCTACGAGC ACCCGCTCGC CGAGGAGGTA CCCGACCACG TCGACGTCGA GGGGGCGCTC GAGGTCTACG CCGCCGACTA CGTCGACACC GACGGCGACG GCACGGGGCT CGTCCACTCC GCGCCCGGCC ACGGTGAAGT GGACTTCGAG CGCGGCCGCG AACTCGGCTT CCCGATCTTC TGTCCCGTCG GCAGCGACGG CGTCTACACC GAGGAGGCCG GCACGTACGA GGGCCAGTTC GTCAAGGACG CCGATCCGGA GATCACCGAC GACCTCGAGG ACAACGGCGC GCTGCTCGCC TCGGGTACCG TCCACCACAG TTACGGCCAC TGCTGGCGCT GTGACACGGG GATCCTCCAG ATCGTCACCG ACCAGTGGTT CATCACGATC ACCGACGTCA AGGACGAACT GCTGGACAAC ATCGAGGACA GCGAGTGGCA CCCCGAGTGG GCCCGCGACA ACCGTTTCCG GGACTTCGTC GAGGAGGCCC CCGACTGGAA CGTCTCCCGC CAGCGCTACT GGGGCATCCC GCTGCCCGTT TGGACGCCCG AGGACCGAGA TGATGACGAG GACATGATCG TCATCGGCGA CCGCGAGGAA CTCGCCGATC GGGTCGATCA GGACGTCGAC CCCGAGGACG TCGACCTCCA CAAGGACACG GTCGACGACC TCACCATCAC CGAGGACGGG ACCACCTACA CGCGCGTCCC CGACGTCTTC GACGTCTGGC TCGACTCCTC GGTCGCCTCC TGGGGCACGC TGAACTACCC CGAAGACGAC AGCCGCTTCG ACGAGCTCTG GCCCGCCGAC TTCATCCTCG AGGCCCACGA CCAGACGCGG GGCTGGTTCT GGTCCCAGCT GGGGATGAGC ACCGCCGCGC TCGGCGAGAG CCCGTATCAG GAGGTGCTGA TGCACGGCCA CGCGCTGATG CCCGACGGCC GCGCGATGTC CAAGTCCAAG GACATCCTGA TCGACCCCCA CGAGGCCATC GACCGCCACG GCCGCGACGT CATGCGCATG TTCCTCCTGT CGAACAACCC GCAGGGCGAG GACATGCGCT TCGACTGGGA CGGGATGCAG ACGATGGAGA ACCACCTCCG GACGCTGTGG AACGTCTTCC GGTTCCCGCT GCCCTACATG CGCTTAGACG AGTTCGATCC CACCGCAACC ACTCTCGAGG ACGTCGATTC GGACCTCGAA CTCATCGACG AATGGGTGCT CGCCCGCCTC CAGTCCACGA AGGCCGAGAT GACCGAGGCC TTCGAGGACC GCCGGCAGGA CCGCGCGCTC GACGCCCTGA TCGAGTTCGT CGTCGAGGAC GTCTCGCGAT TCTACGTGCA GGCCGTCCGC GAGCGCATGT GGGCCGAGGA GGACAGCGGC TCGAAGCGGG CCGCCTACGC GACGATCTAT CAGGTCCTCC GGGAGAGCGT CGCCCTGCTC GCGCCGTACG CGCCGTTCAT CAGCGAGGAA ATCTACGGGA CGCTGACCGG CGACGACGGG TTCGACACCG TCCACATGGA GGACTGGCCC GAGGTCGACG AGTACTGGCA GGACGAACAG CTCGAGGACG ACGTCGCCAT CCTCCGCGCG ATCGAAGAGG CCGGCGCGAA CGCCCGCCAG CAGGCCGGCC GCAAGCTGCG CTGGCCCGTC CCGCGGGTCG TCGTGGCTGC TGACGACGAC CGCGTCGTCG AGGCCGTCGA GCGCCACACG CCGCTGCTCG AGGACCGACT CAACGCCCGC GAGATCGAGC TCGTCTCCGC GGAGGACCGC TGGGAAGAAC TGCAGTACAG CGCCGAGGCC GACATGAGCG AACTCGGACC GGCTTTCGGC GACCGCGCCG GGCAGGTCAT GAACGCGCTC AACGAGGCCC GCATCGACGA GCCGAGCCTC GAGGCGATCG AAGAGGCCGT TGCAGACGTG CTCGAGGAGG GCGAGGAGAT CACCGACGAG ATGGTCTCGT TCGTCACCCA GACGCCCGAC GGCATCGCCG GCACCGCGTT CGGACTGAAC GGCGACGATC GCGGCGTCGC CTACGTCGAC GCCTCGCTGA CCGACGACAT CGAGAGCGAG GGCTACGCCC GCGAGGTCAT CCGCCGCGTC CAGGAGATGC GCAAGGACCT CGATCTCGAC GTCGAGGAAC GGATCGCCCT GGATCTGGAG ATCGACGACG ACCGCGTCGC CGACCTCGTC GCCGAGCGCG AGGACCTGAT CCGCGAGGAG GTCCGCGCCG ACGAGATCGG CGAGATCGAC GACGGCCACC GCAAGGAGTG GGAGGTCGAG GACGTGACGA TGGAAATCGG GGTCGAGCCG CTGGCAGCGG CCGAAGCGTC GGATTAG
|
Protein sequence | MSRFGEVDDQ YDPHELEGRI FDYWDDVDAY EQTVEHRSDG ESFFFVDGPP YTSGSAHMGT TWNKSLKDVY LRFLRMQGYD VTDRPGYDMH GLPIETRVEE RLGFENKKDI EEFGEENFIQ ECKDYADEQL EGLQEDFQDF GVWMDWDDPY RTVEPEYMEA AWWGFSKAAD RGLVEKGHRS ISQCPRCETA IANNEVEYED VEDPSIYVKF DLEDREGKIV IWTTTPWTVP ANTFVAVDPE GDYVGVRAEK DGEEELLYVA DAKHEEVLRE GRYDDYEVVE ERSGEELIGW SYEHPLAEEV PDHVDVEGAL EVYAADYVDT DGDGTGLVHS APGHGEVDFE RGRELGFPIF CPVGSDGVYT EEAGTYEGQF VKDADPEITD DLEDNGALLA SGTVHHSYGH CWRCDTGILQ IVTDQWFITI TDVKDELLDN IEDSEWHPEW ARDNRFRDFV EEAPDWNVSR QRYWGIPLPV WTPEDRDDDE DMIVIGDREE LADRVDQDVD PEDVDLHKDT VDDLTITEDG TTYTRVPDVF DVWLDSSVAS WGTLNYPEDD SRFDELWPAD FILEAHDQTR GWFWSQLGMS TAALGESPYQ EVLMHGHALM PDGRAMSKSK DILIDPHEAI DRHGRDVMRM FLLSNNPQGE DMRFDWDGMQ TMENHLRTLW NVFRFPLPYM RLDEFDPTAT TLEDVDSDLE LIDEWVLARL QSTKAEMTEA FEDRRQDRAL DALIEFVVED VSRFYVQAVR ERMWAEEDSG SKRAAYATIY QVLRESVALL APYAPFISEE IYGTLTGDDG FDTVHMEDWP EVDEYWQDEQ LEDDVAILRA IEEAGANARQ QAGRKLRWPV PRVVVAADDD RVVEAVERHT PLLEDRLNAR EIELVSAEDR WEELQYSAEA DMSELGPAFG DRAGQVMNAL NEARIDEPSL EAIEEAVADV LEEGEEITDE MVSFVTQTPD GIAGTAFGLN GDDRGVAYVD ASLTDDIESE GYAREVIRRV QEMRKDLDLD VEERIALDLE IDDDRVADLV AEREDLIREE VRADEIGEID DGHRKEWEVE DVTMEIGVEP LAAAEASD
|
| |