Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1497 |
Symbol | valS |
ID | 5054816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1354940 |
End bp | 1357339 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469039 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001153705 |
Protein GI | 145591703 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.702651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCAAA GGCTACCCAC TCGTTGGGAT ATTCGGTGGG AAGAAGACCT TATAAAGATT TGGGACAACG AGGGCAGATT CAAGACAAAG ATAAGCGGAA CGCGGCCGGT CTTCGTCATC GACACGCCGC CTCCCTACCT CTCCAGCAAC AGGCCCCACA TCGGCCAGAC GGCTTCCTAC GCCCACTTCG ACATGATAGC TCGGTTTTTG AGAATGCGCG GTATCGACGT GATCTTCCCC TTCTACGCAG ACAGAAACGG CCTCCCCATA GAGGTGCAGG TAGAGAAGAA GTACGGCATC GTGGCGCACG AGGTCCCGCG GGAGAAGTTC ATTCAGATGT GCAAAGAAGA GCTGGACAGA TACGAGGGCG AGTTCGTCGC ATCGCTGAGG AGGTGGGGGC TCTCCTTCGA CTACTGGCCC AACGGCACGG ACAGCCCCGA GTACCGCAGA ATGACCCAGA GCACCTTCAT AGAGCTGTGG CGCCGGGGCC TCGTCTACGA GGCGGAGAGA CCCACGCCTT GGTGCCCCCG CTGTAGGACG GCGCTGGCGG AACCTGAGAT CGAGTACAAG GAGGAGGAGA CATACCTAAA CTACATCAAG TTCAAGGTAA AGGAGACCGG TGAGGATATA ATCATCGCCA CGACGCGCCC CGAGCTCCTC CCCGCCACTG TGGCCGTCAT CTTCCACCCA GACGACCAGC GCTACAATAG GCTTGAGGGC CTCCACGCCG TGGTGCCTCC GGAGGGGCAG GTGGTGCCCA TCCTCCCTCA CAGAGCCGCC AATCCGCAGT ACGGAACTGG GCTGGTCATG ATCTCCACCT TCGGCGACAC GAGAGACCTC ATGATAGTCA ATGAGCTGAA GCTACCCATA AGGATAATTG TAGACGAGGC GGGCCGTATC AACTCCGGGA AGTACGCCGG CTTGACAATA AGGGAGGCCA GGGCCAAGAT AATAGAAGAT TTAAAGGTAG CCGGCCTCCT TGTCAAGCAG GAGAGGCTGG TGCACAACGT CCCCGTCTGC TGGCGTTGCA AGACGCCGCT TGAGATCATC GTCACCAGGG AGCTGTTCAT AAAACAGATA GAGTTCAAGG ACAAGCTCAT AGAGCTGGCC AACAAGATGG AGTTTAAGCC CCCCGAGTAC AGGCAAGTCC TCATCGACTG GATCAAGTCG CTGGAACTCG ACTGGCCCGT GTCGCGGAGG CGGTACTACG CCACCGAGAT CCCCATATGG TGGTGCGTTA AGCCAAACGG CGAGAGGGTA CCCATAGTCC CCAAGGGAGG CGAGTACTAC GTCCCGTGGA GAGACCCGCC GCCGCCCGAG GTCAAGGAGG CGTGTAAAGA CGGCAGGCTC GAGGGCGACA CCCGCGTATT CGACACCTGG ATGGACTCCT CCATCTCGTG GATGTACGCC TCGGGCTACA CCAAGGAGTT CAACGTCTTC CCGAAGGTCT ACCCGCACTC CATAATGAGA CCCCAGGGCT ACGACATCAT CAGGACGTGG CTCTACTACT CCCTCCTCCG TGCCTATCTC CTATACGGCA ACGTGCCGTT TAGGTACGTG AGGATAAACG GCATGGGCCT CGACGAGAAG GGAGAAGCCA TGCACAAGTC GAAGGGCAAC GTCATAGACC TCCTAGCCCC AGTGGAGAAG TACGGCGCCG ACGCCGTGAG GTTCTGGGCC GCCGCCGCCG GGAGGCTGGG CTCAGACTAC CGCTACAACG AAAACATCAT AAAAGAGGGC AAGGAGTTCG TCACCAAGGT GTGGAACATA TCCCGCTTCG TCCTCTCCTT CCCCGAACCC ACAGATAAGC CGGAGCTAAC GCCGGTAGAC AAGGCCATAC TTGCCAAGCT ATACCAAGTA GCGAAAAGAG CCATCTCCGC ATTTTCAGAT CTCGACGTCT ACGAGCCGGC CCACCTCCTC TACAACTTCA TATGGCACGA ATTCGCCGAC CACTACATAG AGCTGGCTAA GTCAAGGGCA TACAACAGAG ACAGCACATT TACACAAGAG GAGCAGAGGG CAGCCATCTG GACTCTCTAC GCAGTGTGGA AGTACAGCCT GAAGCTCCTG GCGCCGATAA TGCCCTTCGT CACCGACAAG ATCTGGCGCC TCGCCTACGG CAGATCCATC CACGACGAGA CAATAGAAGA CCCACCAGAG GAGTGGAGCT GGGGCGATGC CTCGCTCTTC GAGCTGTTGA AGAAGATAAA CAGCGCTGTG TGGCGGTATA AGAACAGACA AGGCATGAGC CTCGCGGCGA GCCTAGACAA GGCTCTGTAC GTGCCTGAGG CCGCTCTGCA GGCAGCCAAG GACTTGAAAT ATACGCATAA GGTCCTCGAC GTGAGGCCTG GCCGCGGCGC CGAGCAGATA GACGACGAGG GCCTCGTCTG GATAGGCTAA
|
Protein sequence | MSQRLPTRWD IRWEEDLIKI WDNEGRFKTK ISGTRPVFVI DTPPPYLSSN RPHIGQTASY AHFDMIARFL RMRGIDVIFP FYADRNGLPI EVQVEKKYGI VAHEVPREKF IQMCKEELDR YEGEFVASLR RWGLSFDYWP NGTDSPEYRR MTQSTFIELW RRGLVYEAER PTPWCPRCRT ALAEPEIEYK EEETYLNYIK FKVKETGEDI IIATTRPELL PATVAVIFHP DDQRYNRLEG LHAVVPPEGQ VVPILPHRAA NPQYGTGLVM ISTFGDTRDL MIVNELKLPI RIIVDEAGRI NSGKYAGLTI REARAKIIED LKVAGLLVKQ ERLVHNVPVC WRCKTPLEII VTRELFIKQI EFKDKLIELA NKMEFKPPEY RQVLIDWIKS LELDWPVSRR RYYATEIPIW WCVKPNGERV PIVPKGGEYY VPWRDPPPPE VKEACKDGRL EGDTRVFDTW MDSSISWMYA SGYTKEFNVF PKVYPHSIMR PQGYDIIRTW LYYSLLRAYL LYGNVPFRYV RINGMGLDEK GEAMHKSKGN VIDLLAPVEK YGADAVRFWA AAAGRLGSDY RYNENIIKEG KEFVTKVWNI SRFVLSFPEP TDKPELTPVD KAILAKLYQV AKRAISAFSD LDVYEPAHLL YNFIWHEFAD HYIELAKSRA YNRDSTFTQE EQRAAIWTLY AVWKYSLKLL APIMPFVTDK IWRLAYGRSI HDETIEDPPE EWSWGDASLF ELLKKINSAV WRYKNRQGMS LAASLDKALY VPEAALQAAK DLKYTHKVLD VRPGRGAEQI DDEGLVWIG
|
| |