Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0568 |
Symbol | leuS |
ID | 4600612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 516368 |
End bp | 519301 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639773338 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_919976 |
Protein GI | 119719481 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGTCA TTGCTAGCAA TAATCAAAGG ATGAAGTTCC CGGAAGTGAA TGCTGAGAGA AGGGATTTTC TACGCAAAGT GGAAGAGAAG TGGCAGCGCA GGTGGGAAGA GAGCGGTTTA TTCGAAGCGG ATCCAGACCC CTCGCGCCCC AAGTTCTTCG TTACCTTTCC TTACCCCTAC ATAAACTCCT TCCCACACCT CGGCACAGCG TATACCGTTC TCCGCGTCGA TATACTCGCC AGGTTTAAGA GGATGCAGGG CTTCAACGTC CTCTTCCCGC AAGGGTGGCA CGCGACGGGA GGGCCCATAG TAGCCGCCGC GCTAAGAGTC AGGGAGGGCG ACGAGAAGCA GATACGCATA TTGAAGTCTA TCGGAATCCC CGACTCGGAA ATCCCGAAGT TCAGGGACCC TGAGTACTGG GTCGAGTTCT TCAGAAAGGG TTTCAAACAG GACTTTTCGA GGTACGGGCT CTCGATAGAC TGGAGGAGGG AGTTTTTCAC CACGTACCTA AATCCCCCCT ACAGCAAGTT CATACAGTGG CAGTACACAG TCCTGAGAGA GAAGGGTCTG ATAACCAGAG GGTCTCACCC GGTTGTGTGG TGCCCGAAAG AGCATAAAGT GGTCGGAGAC CACGATAGGC CGGACGAGTA CGCTGGCATA GGACCGGAAA GAGTCGTGAT TATAAAGTTT AAGGGAGAGG ACGGCCTCGT CTACCCCTGT CTCACCTATA GACCCGAGAC GGTTTACGGG GCTGTAAACA TCTGGGTGAA CCCGGAGTCC AAGTACCTGG TAGCAGAGGT GGACGGTGAG AAATGGGTTA TAGGGGAGTA CGGCGCCAGG GAGCTCGCCG ACCAGGATCA CTCCGTGAAG ATAGTGGGAG AGGTAAAGGG CTCCGAGCTC GTGGGTAGGT TTGCGAGGAA CCCCGTTACC GGCTGGAGGA TCCCAGTTCT GCCCGCGTAC TTCGTCCAGG CGGACGCAGG TACAGGTATA GTGATGTCGG TCCCAGCGCA TGCCCCCTAC GACTTCGCGG GCCTGGAGGA CCTAAAGAAG GATCCCTACC TCCTAGAGAA GTTTGGCCTG GACCCCGCTA TCTTAGACGC TGTTAGACCA GTAAAGCTGA TAGACGTCGA GGAATACGGC GGGCTACCCG CGGAGGAGGT GGTTAGACGC CTTGGCGTGA CGTCGCAGTT TGACCGTGAA AAGCTCGAGG AGGCGACGAA GGAGGTTTAC TCGAAGGAAT TTTACAAGGG AGTCCTAAAG CCCGAGGTAT TCGGGGAGCG CTGGGGCGGC AGGAAAGTCT TCGAGGTAAA GGAGGACGTC GTGGAGAACC TTGTGAGCAG AGGCATCGCG CTGAGACACT ACACTCTGCC GAGCCCCGTG TACTGCCGTT GCGGGGCTAG GACGCACGTA AAGCTGGTCA AAGACCAGTG GTTCCTCAGG TACAGCGACC CGGAGTGGAA GAGGAGGGCC CACGAGTGTA TTGACAGGAT GAGGTTCGTG CCGGAGGAAG TTAGGCAGGA GTTCCACAGG CTCGTTGACT GGTACGAGGA CTGGGCCTGC ACGCATGAAA GAGAGCTCGG GACACCGCTC CCGTGGGACG AGAGGTGGGT CCTGGAGTCA CTCAGCGATT CGACCATATA CATGGCCTAC TACACGCTGG CGAAGTACTT ACAGCACCCG GAGAAGTACG GCATAGACTG GTCCAAGCTG AACAACGAGT TCTTCGACTA CGTCCTCCTG GGAAAGGGGG ATCCCGGTAG CGTCGCCGAG AGGACGGGGA TTCCCAAGGA GTTGCTCGAA GAGATGAGGA AGGAGTTCCT CTACTGGTAC CCTGTCGATA TGAGGGTTTC CGGGAAGGAC CTGATCGGGA ACCACCTCGT ATTCTTCATA ATGCACCACG TGGCGATATT CCCGGAAGAA CACTGGCCGA GAGGTATAGG CGTCAACGGC TGGGTCCTGG TTGCCGGGAA GAAGATGTCC AAGTCCGCCG GGAACTTCAT ACTTCTACGC GAGGCCCTGG AGTACTGGGG CGCGGATGCT ACGCGTTTCG CCGAGGCCTA CGCGGGTAAC TCGGGGCTCG ACGACGGAAA CTTTGAGCCC GAGGTTGCGA GTAAGGCTGT AGACCTGCTG TACGAGTGGT ACGAATTCGC AGTGAACAAT TACGGCAAGG GAGATGAAAA CAGAAGATTC GTGGACGACT GGTTTGAAAG CGTTCTCTAC AGAACCCTGG AAAAGGTGAC TAAAGAGTAC GAAGAGCTTA ACACGAAAAA CGTGCTCGTA GAGGGCTTCT TCAATCTCCA GAACGCGTAT AGGTGGTACG TAAAGAGGCG GGGCGGCACA GCGAATAAAG AGGTGCTTAA GAAGTTCGTA GAGATACAAA CCCTCATCCT TGCCCCCATA ACACCCCACA TAGCCGAGGA GATATGGGAG GCAACAGGGC ACAAAGAATT CATATCAAGG ACGAGCTGGC CCGCAGTCGA TAAGAGCAAG ATCAAGGACG AGGTGGAAAA AGCCGAGTCC ATAGTCGTCA AGCTATACGA GGATATTCAG GAGGTCCTGA AACTGAAGAA GAGCGGCGTA GAAAGGATAA CAATAGTCGC GCCGTCCAAG TGGAAGTACG GCTTCCTCGA AGGAGTCAAG CGGAGATACT CGACCTACGG GAAGCTTTCT CAGGCTATAA GCGAGACAAT AAAGGAAGTC GAGCCCAGCC TTAAACCCGC CGCTGGACAG CTAGCATCGC TTATCCAGAA GAACCCAGAG GTCCTAGACC TTCTAGTAAG CCCCGAAGCG GAGCAGAAAG CACTCTCCGA CGCGCTCGAA TTCCTTAAAG ACTCTCTAGG AGTCCCCGTA GAACTTGTAG CGGAGGAAGA GCTAAGGGAA AACCCGAGAG CCAGAACAAC CCTCCCAGGT AGACCCTCGA TCATTCTTTC GTAG
|
Protein sequence | MSVIASNNQR MKFPEVNAER RDFLRKVEEK WQRRWEESGL FEADPDPSRP KFFVTFPYPY INSFPHLGTA YTVLRVDILA RFKRMQGFNV LFPQGWHATG GPIVAAALRV REGDEKQIRI LKSIGIPDSE IPKFRDPEYW VEFFRKGFKQ DFSRYGLSID WRREFFTTYL NPPYSKFIQW QYTVLREKGL ITRGSHPVVW CPKEHKVVGD HDRPDEYAGI GPERVVIIKF KGEDGLVYPC LTYRPETVYG AVNIWVNPES KYLVAEVDGE KWVIGEYGAR ELADQDHSVK IVGEVKGSEL VGRFARNPVT GWRIPVLPAY FVQADAGTGI VMSVPAHAPY DFAGLEDLKK DPYLLEKFGL DPAILDAVRP VKLIDVEEYG GLPAEEVVRR LGVTSQFDRE KLEEATKEVY SKEFYKGVLK PEVFGERWGG RKVFEVKEDV VENLVSRGIA LRHYTLPSPV YCRCGARTHV KLVKDQWFLR YSDPEWKRRA HECIDRMRFV PEEVRQEFHR LVDWYEDWAC THERELGTPL PWDERWVLES LSDSTIYMAY YTLAKYLQHP EKYGIDWSKL NNEFFDYVLL GKGDPGSVAE RTGIPKELLE EMRKEFLYWY PVDMRVSGKD LIGNHLVFFI MHHVAIFPEE HWPRGIGVNG WVLVAGKKMS KSAGNFILLR EALEYWGADA TRFAEAYAGN SGLDDGNFEP EVASKAVDLL YEWYEFAVNN YGKGDENRRF VDDWFESVLY RTLEKVTKEY EELNTKNVLV EGFFNLQNAY RWYVKRRGGT ANKEVLKKFV EIQTLILAPI TPHIAEEIWE ATGHKEFISR TSWPAVDKSK IKDEVEKAES IVVKLYEDIQ EVLKLKKSGV ERITIVAPSK WKYGFLEGVK RRYSTYGKLS QAISETIKEV EPSLKPAAGQ LASLIQKNPE VLDLLVSPEA EQKALSDALE FLKDSLGVPV ELVAEEELRE NPRARTTLPG RPSIILS
|
| |