Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0260 |
Symbol | leuS |
ID | 5054166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 231969 |
End bp | 234806 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640467838 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_001152525 |
Protein GI | 145590523 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.76888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0524168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGAGT TAGCTAGGTT TTTCATAGAG CTTGGCGAGA AGTGGCAGAA GAGGTGGGCA GAGGCACGGG TGTACGAGCC GGCTCCGACG CTTGGGGTTC CCAAGTTTTT CATAACAGCG GCGTATCCCT ATCCCAATGG CGCAATCCAC ATCGGCCACG GCCGCACCTA CCTAATTGCC GACGTCTTGG CAAGATTCCA TCGCCACTTG GGCCGCGCCG TGCTTTTTCC CATGGCCTTC CACTACACGG GCACCCCCAT CCTCACCATC GCCGAGGCAA TCGCCGCAGG CGACGAGACT GTGGTAGAGG AGTATATGGC CATCTACGGC GTTCCCGAAG AGGAGATGAG GAAGATGGGG GATCCGCTCT ACCTGGCGCG GTACTTCCAC GAGCAGTCTA AGCGGGCTAT GCAGAAGTTC GGCCTCTCTA TTGACTGGAC GCGTGAGTTC ACCACCATCG ACCCCGAGTA CCAGCGCTTC ATCCAGTGGC AGTTCGAGAA GCTTAGGAAG AAGGGGCTAA TCGTCAGGGG GAGGCACCCA GTTGGCTGGT GCCCCCGCCA CTCGATGCCG GTGGGGGCCC ACGACACAAA AGACGACAAG GAACCCGACA TAGGGGAGTG GACTTTGATA TACTTCGCGG ATAGGGATGG GCTGATTTTC CCCGCCGCCA CGCTTAGGCC GGAGACCGTC CTGGGTGTTA CGAATATGTG GATTAATCCT GAGGGGGAGT ACGTCGTGGC CGAGTACGAC GGTAGGAAGA TGGTGTTGAG CAGAGACGCC GCGTACCGCC TTTCTTTCCA AGGCTCTGTG AAGGTGTTAC GCGAAGCGAA GGGGCGGGAG TTCGTGGGTA GAGAGGTGCA GAACCCCGTG ACGGGGGAGT GGGTGCCGAT ATACGGGGCA AAGTTCGTCG ACCCCAAGGT GGGCACCGGC GTCGTGATGT CTGTGCCGGC CCACGCGCCT TACGACTACG CCGCGCTCCG GGACATCGGC GCAATTAGGC TAATCCCGCT TATCAAGGTG GAGGGCTACG GCGAGTACCC AGCTAAGGAT GTTGTGGAGC GGATGGAGAT TAAGAGCCAG ACAGATCCGG CGCTGGAGGA GGCCACGAAG GAGGTGTACT CGGCTGAGTA CGCCCGGGGC GTGATGCGGG AGGACGTGGT GGAGCTGGTC GGCCGCCATC TTCCCGAGCC TGCGAGGTCT ATGGTTATGG CCGTGTTTAA GATGTACTTC GCCGGGAGGC CGGTGAGGGA GGCTAGGGAG TTCATTTCTA AGTGGCTCGC CGAGGCGGGG CTGGGCGGCG TCATGTACGA TATTATGAAC AAGCCGGTGT ACTGCCGTTG CGGCACGGAG ATCGTTGTGA AGGTGCTTGA GGATCAGTGG TTTATAAACT ACGGCGAGCC TAGGTGGAAG GAGCTCGCCA AGAAGCTAGT GGAGGAGATG ACCATAGTGC CCCCCGAGGC GAAGGCCCAG TTCTTCGCCA CTATAGACTG GCTGGATAAG AGGGCTTGCG CCCGGACTAG GGGCCTTGGG ACGCCGCTTC CGTGGAGCCA TGGTTGGGTT ATTGAGAGCC TGAGCGACTC CACAATATAC ATGGCCTACT ACGCGGTGAT TAAGGGGATA AGGAGGCACA ACCTCAGGCC GGAGCAACTG ACAGAGGAGT TCTGGGACTA CGTCTTCCTC GGAGTTGGGA CGCCAGAGGA GGTGTCTGCG AAGACAGGAA TACCAGCCGA GGCGTTGAGG GCGATTAGGG AGGAGTTCGA GTTCTGGTAC CCCCTGGACT CGAGGAATTC CGGCAAGGAC CTCATCCCTA ACCACCTCAC CTTCTTCATC TTTAACCACG TGGCGATTTT CCCCAGGGAG AAGTGGCCGA GGCAGATCGT GGCCAACGGC TGGGTTCTGA GGGAGGGGGA GAAGATGTCT AAGTCGAAGC GCAACGTCCT GCCGCTGGAC AAGGCAGTGG AGATGTACGG TCCCGACCCG CTGAGGGCCA CGCTGGCCAT CTCCGCGGAG GTGGAGCAAG ACCTCGATTT CAGACACGCC GAAGCCGTTA GGAACGCCCA ACAACTCATG TCCATATACA CGCTGGCCCA GAGGCTGGCC CAATCGGCCG AGGATCGGGA ACCCACTTGG CTAGACCGCT GGTTGCTGTC CGAGGTGGCT CTGGCGCTTG AGAGAGTGAG GGATGCTTAT GAAAAGGTGA GGGTGCGCCA AGCCGCCGTG GAGCTCCTCT ACAACATCAA GAATATCTTC GACTCGTACA TGACTGCGGT GGAGCGGCCG TCGAGGCTCG CAGTGGAGGT AGCCAAGGCG TGGGCTGTGG CTCTTGAGCC AATTGCACCG CACCTCGCCG AGGAGGTTTG GAGCCTCTTG GGAGGGGAGG GGTTCGTGAC AAGCGCAAAG TGGCCTCAGC TCAAGCCCGA CCCGGCGGCG TTGCTGGCGA GGAGGTATGT CGACATGGTG GTGGAGGACG TAAAGAAGAT CCCGGCGTAC GGCGAGGGCG TAAGGCGGGT TGTGCTGTAC ATCAACCCCA ACTTCACGTG GGTTAAGGCC GCCCTAAACA ACGACGTCAA GTCCGCCATA GCGGCAGGTA CCCCGCCTCA GCTGGCCAAG CGCCTTGTGG AGCTGGTGAG GACGCTGGGC GACGAGGTCA GGTCGCTCAT CGCCGCGGTT GAGAACTTCG ACGAGCGGGA GGCCCTCCTG TCTTACAAAA ACTACGTGGA GAAGGCGCTG GGAGCCCCGG TGGAGGTCTA CACTGCGGAG GACCCGGCGG CGCCCGATCT CGGCGGGAAG AAAAAGGCGG CGCTCCCGCT GAAGCCCGGG ATATTTATAG AGCGCTAG
|
Protein sequence | MSELARFFIE LGEKWQKRWA EARVYEPAPT LGVPKFFITA AYPYPNGAIH IGHGRTYLIA DVLARFHRHL GRAVLFPMAF HYTGTPILTI AEAIAAGDET VVEEYMAIYG VPEEEMRKMG DPLYLARYFH EQSKRAMQKF GLSIDWTREF TTIDPEYQRF IQWQFEKLRK KGLIVRGRHP VGWCPRHSMP VGAHDTKDDK EPDIGEWTLI YFADRDGLIF PAATLRPETV LGVTNMWINP EGEYVVAEYD GRKMVLSRDA AYRLSFQGSV KVLREAKGRE FVGREVQNPV TGEWVPIYGA KFVDPKVGTG VVMSVPAHAP YDYAALRDIG AIRLIPLIKV EGYGEYPAKD VVERMEIKSQ TDPALEEATK EVYSAEYARG VMREDVVELV GRHLPEPARS MVMAVFKMYF AGRPVREARE FISKWLAEAG LGGVMYDIMN KPVYCRCGTE IVVKVLEDQW FINYGEPRWK ELAKKLVEEM TIVPPEAKAQ FFATIDWLDK RACARTRGLG TPLPWSHGWV IESLSDSTIY MAYYAVIKGI RRHNLRPEQL TEEFWDYVFL GVGTPEEVSA KTGIPAEALR AIREEFEFWY PLDSRNSGKD LIPNHLTFFI FNHVAIFPRE KWPRQIVANG WVLREGEKMS KSKRNVLPLD KAVEMYGPDP LRATLAISAE VEQDLDFRHA EAVRNAQQLM SIYTLAQRLA QSAEDREPTW LDRWLLSEVA LALERVRDAY EKVRVRQAAV ELLYNIKNIF DSYMTAVERP SRLAVEVAKA WAVALEPIAP HLAEEVWSLL GGEGFVTSAK WPQLKPDPAA LLARRYVDMV VEDVKKIPAY GEGVRRVVLY INPNFTWVKA ALNNDVKSAI AAGTPPQLAK RLVELVRTLG DEVRSLIAAV ENFDEREALL SYKNYVEKAL GAPVEVYTAE DPAAPDLGGK KKAALPLKPG IFIER
|
| |