Gene Htur_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0089 
Symbol 
ID8740652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp98661 
End bp101867 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content67% 
IMG OID646510652 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_003401663 
Protein GI284163384 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGAT TCGGCGAGGT CGACGACCAG TACGATCCGC ACGAACTCGA GGGGCGGATC 
TTCGACTACT GGGACGACGT CGACGCCTAC GAGCAGACGG TCGAGCACCG ATCCGACGGC
GAGTCGTTCT TCTTCGTCGA CGGGCCGCCG TACACGTCTG GGTCGGCGCA CATGGGAACG
ACCTGGAACA AGTCGCTGAA GGACGTCTAC CTGCGGTTCC TGCGGATGCA GGGGTACGAC
GTCACCGACC GGCCGGGCTA CGACATGCAC GGCCTGCCCA TCGAGACCCG CGTCGAGGAG
CGACTCGGCT TCGAGAACAA GAAGGACATC GAGGAGTTCG GCGAGGAGAA CTTCATCCAG
GAGTGTAAGG ACTACGCCGA CGAGCAACTC GAGGGGCTCC AGGAGGACTT CCAAGACTTC
GGCGTCTGGA TGGACTGGGA CGACCCCTAC CGGACGGTCG AACCGGAGTA CATGGAGGCC
GCCTGGTGGG GCTTTTCGAA GGCCGCCGAC CGCGGGTTAG TCGAGAAGGG CCACCGCTCG
ATCTCCCAGT GTCCGCGCTG CGAGACCGCG ATCGCGAACA ACGAGGTCGA GTACGAGGAC
GTCGAGGACC CCTCGATCTA CGTCAAGTTC GACCTCGAGG ACCGCGAGGG CAAGATCGTC
ATCTGGACGA CCACGCCGTG GACGGTTCCG GCGAACACGT TCGTCGCCGT TGACCCCGAA
GGCGACTACG TCGGCGTCCG GGCCGAGAAG GACGGCGAGG AGGAACTGCT GTACGTCGCT
GACGCGAAAC ACGAGGAGGT CCTCCGAGAG GGCCGCTACG ACGACTACGA GGTCGTCGAG
GAGCGCTCCG GCGAGGAGCT GATCGGCTGG TCCTACGAGC ACCCGCTCGC CGAGGAGGTA
CCCGACCACG TCGACGTCGA GGGGGCGCTC GAGGTCTACG CCGCCGACTA CGTCGACACC
GACGGCGACG GCACGGGGCT CGTCCACTCC GCGCCCGGCC ACGGTGAAGT GGACTTCGAG
CGCGGCCGCG AACTCGGCTT CCCGATCTTC TGTCCCGTCG GCAGCGACGG CGTCTACACC
GAGGAGGCCG GCACGTACGA GGGCCAGTTC GTCAAGGACG CCGATCCGGA GATCACCGAC
GACCTCGAGG ACAACGGCGC GCTGCTCGCC TCGGGTACCG TCCACCACAG TTACGGCCAC
TGCTGGCGCT GTGACACGGG GATCCTCCAG ATCGTCACCG ACCAGTGGTT CATCACGATC
ACCGACGTCA AGGACGAACT GCTGGACAAC ATCGAGGACA GCGAGTGGCA CCCCGAGTGG
GCCCGCGACA ACCGTTTCCG GGACTTCGTC GAGGAGGCCC CCGACTGGAA CGTCTCCCGC
CAGCGCTACT GGGGCATCCC GCTGCCCGTT TGGACGCCCG AGGACCGAGA TGATGACGAG
GACATGATCG TCATCGGCGA CCGCGAGGAA CTCGCCGATC GGGTCGATCA GGACGTCGAC
CCCGAGGACG TCGACCTCCA CAAGGACACG GTCGACGACC TCACCATCAC CGAGGACGGG
ACCACCTACA CGCGCGTCCC CGACGTCTTC GACGTCTGGC TCGACTCCTC GGTCGCCTCC
TGGGGCACGC TGAACTACCC CGAAGACGAC AGCCGCTTCG ACGAGCTCTG GCCCGCCGAC
TTCATCCTCG AGGCCCACGA CCAGACGCGG GGCTGGTTCT GGTCCCAGCT GGGGATGAGC
ACCGCCGCGC TCGGCGAGAG CCCGTATCAG GAGGTGCTGA TGCACGGCCA CGCGCTGATG
CCCGACGGCC GCGCGATGTC CAAGTCCAAG GACATCCTGA TCGACCCCCA CGAGGCCATC
GACCGCCACG GCCGCGACGT CATGCGCATG TTCCTCCTGT CGAACAACCC GCAGGGCGAG
GACATGCGCT TCGACTGGGA CGGGATGCAG ACGATGGAGA ACCACCTCCG GACGCTGTGG
AACGTCTTCC GGTTCCCGCT GCCCTACATG CGCTTAGACG AGTTCGATCC CACCGCAACC
ACTCTCGAGG ACGTCGATTC GGACCTCGAA CTCATCGACG AATGGGTGCT CGCCCGCCTC
CAGTCCACGA AGGCCGAGAT GACCGAGGCC TTCGAGGACC GCCGGCAGGA CCGCGCGCTC
GACGCCCTGA TCGAGTTCGT CGTCGAGGAC GTCTCGCGAT TCTACGTGCA GGCCGTCCGC
GAGCGCATGT GGGCCGAGGA GGACAGCGGC TCGAAGCGGG CCGCCTACGC GACGATCTAT
CAGGTCCTCC GGGAGAGCGT CGCCCTGCTC GCGCCGTACG CGCCGTTCAT CAGCGAGGAA
ATCTACGGGA CGCTGACCGG CGACGACGGG TTCGACACCG TCCACATGGA GGACTGGCCC
GAGGTCGACG AGTACTGGCA GGACGAACAG CTCGAGGACG ACGTCGCCAT CCTCCGCGCG
ATCGAAGAGG CCGGCGCGAA CGCCCGCCAG CAGGCCGGCC GCAAGCTGCG CTGGCCCGTC
CCGCGGGTCG TCGTGGCTGC TGACGACGAC CGCGTCGTCG AGGCCGTCGA GCGCCACACG
CCGCTGCTCG AGGACCGACT CAACGCCCGC GAGATCGAGC TCGTCTCCGC GGAGGACCGC
TGGGAAGAAC TGCAGTACAG CGCCGAGGCC GACATGAGCG AACTCGGACC GGCTTTCGGC
GACCGCGCCG GGCAGGTCAT GAACGCGCTC AACGAGGCCC GCATCGACGA GCCGAGCCTC
GAGGCGATCG AAGAGGCCGT TGCAGACGTG CTCGAGGAGG GCGAGGAGAT CACCGACGAG
ATGGTCTCGT TCGTCACCCA GACGCCCGAC GGCATCGCCG GCACCGCGTT CGGACTGAAC
GGCGACGATC GCGGCGTCGC CTACGTCGAC GCCTCGCTGA CCGACGACAT CGAGAGCGAG
GGCTACGCCC GCGAGGTCAT CCGCCGCGTC CAGGAGATGC GCAAGGACCT CGATCTCGAC
GTCGAGGAAC GGATCGCCCT GGATCTGGAG ATCGACGACG ACCGCGTCGC CGACCTCGTC
GCCGAGCGCG AGGACCTGAT CCGCGAGGAG GTCCGCGCCG ACGAGATCGG CGAGATCGAC
GACGGCCACC GCAAGGAGTG GGAGGTCGAG GACGTGACGA TGGAAATCGG GGTCGAGCCG
CTGGCAGCGG CCGAAGCGTC GGATTAG
 
Protein sequence
MSRFGEVDDQ YDPHELEGRI FDYWDDVDAY EQTVEHRSDG ESFFFVDGPP YTSGSAHMGT 
TWNKSLKDVY LRFLRMQGYD VTDRPGYDMH GLPIETRVEE RLGFENKKDI EEFGEENFIQ
ECKDYADEQL EGLQEDFQDF GVWMDWDDPY RTVEPEYMEA AWWGFSKAAD RGLVEKGHRS
ISQCPRCETA IANNEVEYED VEDPSIYVKF DLEDREGKIV IWTTTPWTVP ANTFVAVDPE
GDYVGVRAEK DGEEELLYVA DAKHEEVLRE GRYDDYEVVE ERSGEELIGW SYEHPLAEEV
PDHVDVEGAL EVYAADYVDT DGDGTGLVHS APGHGEVDFE RGRELGFPIF CPVGSDGVYT
EEAGTYEGQF VKDADPEITD DLEDNGALLA SGTVHHSYGH CWRCDTGILQ IVTDQWFITI
TDVKDELLDN IEDSEWHPEW ARDNRFRDFV EEAPDWNVSR QRYWGIPLPV WTPEDRDDDE
DMIVIGDREE LADRVDQDVD PEDVDLHKDT VDDLTITEDG TTYTRVPDVF DVWLDSSVAS
WGTLNYPEDD SRFDELWPAD FILEAHDQTR GWFWSQLGMS TAALGESPYQ EVLMHGHALM
PDGRAMSKSK DILIDPHEAI DRHGRDVMRM FLLSNNPQGE DMRFDWDGMQ TMENHLRTLW
NVFRFPLPYM RLDEFDPTAT TLEDVDSDLE LIDEWVLARL QSTKAEMTEA FEDRRQDRAL
DALIEFVVED VSRFYVQAVR ERMWAEEDSG SKRAAYATIY QVLRESVALL APYAPFISEE
IYGTLTGDDG FDTVHMEDWP EVDEYWQDEQ LEDDVAILRA IEEAGANARQ QAGRKLRWPV
PRVVVAADDD RVVEAVERHT PLLEDRLNAR EIELVSAEDR WEELQYSAEA DMSELGPAFG
DRAGQVMNAL NEARIDEPSL EAIEEAVADV LEEGEEITDE MVSFVTQTPD GIAGTAFGLN
GDDRGVAYVD ASLTDDIESE GYAREVIRRV QEMRKDLDLD VEERIALDLE IDDDRVADLV
AEREDLIREE VRADEIGEID DGHRKEWEVE DVTMEIGVEP LAAAEASD