Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0366 |
Symbol | leuS |
ID | 7399759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 386724 |
End bp | 389444 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643707430 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_002565039 |
Protein GI | 222478802 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAGG AGAGTTACGA CCACACTGCG GTCGAGAAAC GGTGGCAGGA GGCGTGGGAC GACGCCTCCG TGTATCGCAC GCCGGACGAG GTGGACGATC CCACCTACGT GCTGGGAATG TACCCGTACC CGTCCGGGAA GCTCCACATG GGCCACGTCC GCAACTACAC GATCACGGAC GCGTACGCCC GGTACCGCCG AATGCGCGGC GACGACGTGC TCCATCCGAT GGGGTGGGAC GCGTTCGGGC TCCCCGCCGA GAACGCCGCC AAGGAGCGGG ACACGAACCC CCGCGACTGG ACGTTCGACT GCATCGACAC GATGAAAGGG CAGATGCAGT CGATGGGGTT CGGCTACGAC TGGGACCGCG AGGTCACCAC CTGTACCCCC GACTACTACC AGTGGAACCA GTGGCTGTTC CGCCGCTTCC ACGAGGCGGG GCTCGTCGAG CGCCGCGACG CCGAGGTGAA CTGGTGTCCC TCCTGTGAGA CCGTCCTCGC CGACGAGCAG GTGGAAGGGG ACGACGAGCT GTGCTGGCGG TGTGACACCC CGGTCGAGAC CCGCGATCTG GACCAGTGGT TCCTGGAGAT TACGGAGTAC GCCGACGAGC TGTTGGAGGC GATCGACGGG TTGGCGGGGT GGCCCGACTC CGTGCGCCAG ATGCAGCGCA ACTGGATCGG TCGCCAGCAC GGGACGACCT TGGACTTCGA AGTGAGCGAG GCGCGAAGCG CCTCGAACGG ACGCGGCGAA GGGACCGATC CGCGGGAGTA CGGCCCCGTC GAGGCGTTCA CCACCCGCGT GGACACGATC CACGGCGCGA CGTTCTTCGC GCTCGCGCCG GACCACCCGA TCAGCGAGGA GCTGGCGGAA TCGGACGCGG ACGTGCGACA CTTCATCGAG GAGGAGGCCG ACCCCGAGGG CGACGAGCCG AACGGCGTCG CGACCGGCCT CACCGCCACC AACCCGGTCA CGAGCGAGGA GATCCCCGTC TTCGTCGCCG ACTTCGTCCT CTCCGACGTG GGGACGGGCG CGCTGATGGC GGTGCCCGGC CACGACGACC GCGACCACGC GTTCGCCGAG AAGATGGGCG TCGAGATCAA GCCGGTGATC GCCCCGAAGC CCGAGGACTG GGACGGCGAG ACGATTCCCG ACGCGCCGGA CGTGAGCGAG GGCGCCTTCA CTGACGACGG CGTCGTGATC GACTCCGGCG AGTACAGCGG CCTCGACAGC GAGACGGCCC GCGAGCGGCT CACAGAGGAC ATCGAAAGCG CCGAGACGAC CACCCAGTAT CGCCTGCGCG ACTGGGGGAT CTCCCGACAG CGCTACTGGG GGACCCCGAT CCCGGTCGTC CACTGCGACG ACTGCGGCTC GGTGCTGGTG CCGGAAGCGG ACCTGCCGGT CGAGCTGCCG GAGTTCATCA ACACGACCGG GAACCCGCTC GACGCCGCCG AGGAGTGGAA GGAGACGACC TGTCCGGAGT GCGGCGAGCC CGCCACCCGC GAGACCGACA CGATGGACAC GTTCGTCGAC TCCTCGTGGT ACTTCCTGCG CTACGTCTCG CCCGGCGCCG AGGACGTACC CTTCGACCTG GACCGCGCGA ACGACTGGAT GCCGGTCGAC CAGTACGTCG GCGGCATCGA ACACGCCGTG ATGCACCTGC TGTACTCGCG GTTCGTCACC AAGGTGTTAG CCGACGAGGA GGGGCTGGCA CACCGCGAGC CGTTCACGAA CCTGCTGGCG CAGGGGATGG TCCAGCTGGA GGGCGAGAAG ATGTCCAAGT CGAAGGGGAA CACGGTCTCT CCCCAGCGCA TCGTCGACGA GTACGGTGCC GACACGGCCC GGCTGTTCAT TATGCAGGCG GCCCAGCCCG AGCGCGACTT CGACTGGGCC GAGGAGGGCG TCAAGTCGAC GCACCGGTTC CTGGCGCGCC TGACGGACTT GGTCGAGGAG TACGCGGCGG GGGAGGCGGA GACCGCGAGC TCGGACGCCG ACCGCGACAC GATCGACGAT TACGTCGCCG ACGAGGTCGA CGCCGCGGTC GCCATCGCGG GCGCGGAGTA CGACGACCTG ACGTTCAACG TCGCGCTCCG CGAGGCGCAG GACCTCGTGG GGACGCTCCG GAGCTACCGG GGCCACGCCG ACCCGCACCC GGAGACCTAC GAGCGCGGGC TGGACGTTGC GGTCCGCCTG CTCGCACCGG TCGTCCCGCA CCTCGCCGAG GAGCTGTGGG AGACGCTGGA CCGCGAGGGG TTCGTCGTCG AGGCCGAGTG GCCGACCGCG ACGGTCGACC GCGAGACGGT CGAGCGCCGC CGCCGGCTGG TCGCGAACAC CCGTGAGGAC GTGCGCGACA TCGTCGAGGT CGCCGGCATC GAGGACCCCG AGCGGATCGA CGTCGTCGTC GCGCCCGACT GGAAGTACGA CGCGCTCTCG ATCGCGATCG ACAGCGACGC CGACAACCTC ATCTCGGAGC TGATGGGGGA GCCGCATATC CGCGAACAGG GCGACGACGC CGCCTCGTAC GGCCAAGATC TGCAGGCGAA CCGCGAGGCG CTGCAGGAGA CGCTCTCGGG AGACGACGAG TACGACGCCC TTCGCGCGGC CTCGTGGCTG ATCGAGCGCG AGTTCGACGC GCCGGTCCGC GTCGAGCGCG CCGCGGACGC CGACGAGTCG GTGGTTCGCA AGGCGGAGCC CGGCCGGCCG GCGATCGACA TCGTCGAGTA G
|
Protein sequence | MSQESYDHTA VEKRWQEAWD DASVYRTPDE VDDPTYVLGM YPYPSGKLHM GHVRNYTITD AYARYRRMRG DDVLHPMGWD AFGLPAENAA KERDTNPRDW TFDCIDTMKG QMQSMGFGYD WDREVTTCTP DYYQWNQWLF RRFHEAGLVE RRDAEVNWCP SCETVLADEQ VEGDDELCWR CDTPVETRDL DQWFLEITEY ADELLEAIDG LAGWPDSVRQ MQRNWIGRQH GTTLDFEVSE ARSASNGRGE GTDPREYGPV EAFTTRVDTI HGATFFALAP DHPISEELAE SDADVRHFIE EEADPEGDEP NGVATGLTAT NPVTSEEIPV FVADFVLSDV GTGALMAVPG HDDRDHAFAE KMGVEIKPVI APKPEDWDGE TIPDAPDVSE GAFTDDGVVI DSGEYSGLDS ETARERLTED IESAETTTQY RLRDWGISRQ RYWGTPIPVV HCDDCGSVLV PEADLPVELP EFINTTGNPL DAAEEWKETT CPECGEPATR ETDTMDTFVD SSWYFLRYVS PGAEDVPFDL DRANDWMPVD QYVGGIEHAV MHLLYSRFVT KVLADEEGLA HREPFTNLLA QGMVQLEGEK MSKSKGNTVS PQRIVDEYGA DTARLFIMQA AQPERDFDWA EEGVKSTHRF LARLTDLVEE YAAGEAETAS SDADRDTIDD YVADEVDAAV AIAGAEYDDL TFNVALREAQ DLVGTLRSYR GHADPHPETY ERGLDVAVRL LAPVVPHLAE ELWETLDREG FVVEAEWPTA TVDRETVERR RRLVANTRED VRDIVEVAGI EDPERIDVVV APDWKYDALS IAIDSDADNL ISELMGEPHI REQGDDAASY GQDLQANREA LQETLSGDDE YDALRAASWL IEREFDAPVR VERAADADES VVRKAEPGRP AIDIVE
|
| |