Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1612 |
Symbol | |
ID | 5056275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1454735 |
End bp | 1455859 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469153 |
Product | tryptophanyl-tRNA synthetase |
Protein accession | YP_001153818 |
Protein GI | 145591816 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0180] Tryptophanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00233] tryptophanyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGAGG ATTTTGTAGT AACGCCCTGG GAGGTAAGGG GCAAGGTCGA CTATGAAAAG TTGTTGAGAC AGTTCGGCGC CAAGCCCCTC ACAGCTGAGG AGGTAGCCCT TCTTGAAAAA TACGCCGGAG ATGTGCACCC CCTTATCAAG AGGGGGTTCT TCTACGCCCA CCGCGACTTC GACTTCATAT TGAAGTGGCA CGGCGAGGGG AGGCCTTGGG CCCTCTACAC GGGGCGGGGT CCCAGCGGAC CTGTACACAT CGGCCACATG GTGCCTTGGA TACTACTCAA GTGGTTCTCG GACAAATTCG GCGTAGAGGT CTACTTCCAG ATGACAGACG ACGAGAAGTT TTTCGACGAC CCGGAGATGA AGCTGGAGGA GGCCACCGGC TGGGCCTACG AAAACGCCCT GGACGTAATC GCCCTTGGCT TTGGGCCAGA TAAGCTACAT CTCATAGTGG ACACAAAAGA CATAGCCCCG CTCTACCCCA TAGCAGTCCG CGTGGCCAAG AAGCTTACTT GGAATACAGT AAAGGCCACC TTCGGCTTTA CCGACTCCTC CAACATAGGC CTTATCTTCT ACCCCTCTCT GCAAATAGCC GTGGCCTTCC TGCCGACAGA GCTTAAGAAA GAGCCGACGC CCGTACTCAT CCCCTGCGCC ATTGACCAGG ACCCCTACTT CCGCCTGGCG AGGGACATAG CGGACTCCCT AAGCTACCCC AAGCCCACGA CCCTGTACTC CAAGTTCATA ATGGCGCTGA CTGGGGAGAG CAAGATGTCT GCATCAAACC CCGACTCGGC CATATATACC ATGGACGACG ACAAGACCGT GAAGCACAAG ATATTGAACG CCTTCACCGG CGGCCGCCCC ACAGCTGAGG AGCAGAGGAA GTACGGCGGA AATCCAGACA TCTGCCCCGT GTTTCACTAC CACATGCTCT TTGACCCAGA CGACGCCTCA GTGGAGAAGA TTAGGCAAGA CTGCAAGTCT GGCGCCCTCC TCTGCGGCGA GTGCAAGCTT AAGCTTCACG AAAAGATCTC AAAATTCCTC AAAGAGCACC GAGAAAGGAG AGAAAAGGCG CGTGGAAAAG TAGACGAGTA CCGGCTAAGC GTGAAGCTTA AGTGA
|
Protein sequence | MGEDFVVTPW EVRGKVDYEK LLRQFGAKPL TAEEVALLEK YAGDVHPLIK RGFFYAHRDF DFILKWHGEG RPWALYTGRG PSGPVHIGHM VPWILLKWFS DKFGVEVYFQ MTDDEKFFDD PEMKLEEATG WAYENALDVI ALGFGPDKLH LIVDTKDIAP LYPIAVRVAK KLTWNTVKAT FGFTDSSNIG LIFYPSLQIA VAFLPTELKK EPTPVLIPCA IDQDPYFRLA RDIADSLSYP KPTTLYSKFI MALTGESKMS ASNPDSAIYT MDDDKTVKHK ILNAFTGGRP TAEEQRKYGG NPDICPVFHY HMLFDPDDAS VEKIRQDCKS GALLCGECKL KLHEKISKFL KEHRERREKA RGKVDEYRLS VKLK
|
| |