Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1846 |
Symbol | valS |
ID | 3690354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 2014316 |
End bp | 2017183 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637728302 |
Product | valyl-tRNA synthetase |
Protein accession | YP_333247 |
Protein GI | 162210068 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.344373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA CCACGCTTGC GAAAAGTTTC GAGCCCCAGA CCATCGAATC CCAATGGGGG CCGGAATGGG AAAAGCGCGG CTATGCGACC CCCGCGCTCG ATCCGAGCCG GCCGGACTTC TCGATCCAGT TGCCGCCCCC GAACGTGACG GGCACGCTGC ACATGGGCCA CGCGTTCAAT CAGACGATCA TGGACGGGCT CGTGCGCTAC CACCGGATGC TCGGCCACAA CACGCTGTGG GTGCCCGGCA CCGACCACGC GGGCATCGCG ACGCAGATCG TCGTCGAGCG CCAGCTCGAT GCGCAGGGCG TGTCGCGCCA CGATCTCGGC CGCGAGAAAT TCGTCGAGCG CGTATGGGAA TGGAAGGAGC GGTCCGGCTC GACGATCACG GGCCAGGTTC GCCGCATCGG CGCGTCGCCC GACTGGTCGC GCGAATACTT CACGATGAAC GACAAGATGT CGGAGGCCGT GCGCGAAGTG TTCGTCCGCC TCTATGAACA AGGGCTCATC TATCGCGGCA AGCGCCTCGT GAACTGGGAC CCCGTGCTGC TCACCGCCGT GTCCGATCTC GAAGTGGTGA GCGAGGAGGA AAACGGCCAT CTGTGGCACA TCCGATACCC GCTCGCGGAC GGCTCGGGCC ACCTGAGCGT CGCGACGACG CGCCCCGAGA CGATGCTCGG CGACGTCGCG GTGATGGTGC ATCCGGAAGA CGAACGCTAC CGGCACCTCG TCGGCCGGCA CGTGAAGCTG CCGCTGTGCG AGCGCGAGAT TCCGATCATC GCGGACGACT ACGTCGATCG CGAGTTCGGC ACGGGCGTCG TGAAGGTCAC GCCCGCGCAC GATTTCAACG ACTACCAGGT CGGCCTGCGC CACGCGCTCG CGCCGATCGA GATTCTCACG CTCGACGCGA AGATCAACGA CAACGCGCCC GCCGCTTACC GCGGCCTCGA TCGCTTCGAC GCGCGCAAGG CCATCGTCGA CGAGCTCGAC GCGCAGGGCT TGCTCGAATC GGTGAAGCCG CACAAGCTGA TGGTGCCGCG CGGCGACCGC ACGGGCGTCG TGATCGAGCC GATGCTGACC GACCAGTGGT TCGTCGCGAT GACGAAGCCC GCGCCGCAAG GCACCTTCCA TCCGGGCAAG TCGATCACCG AGGTCTCGCT CGAGGTCGTG CGCCGCGGCG AGATCAAGTT CGTGCCCGAG AACTGGACGA CCACCTACTA CCAGTGGCTC GAGAACATCC AGGACTGGTG CATCTCGCGC CAGCTGTGGT GGGGCCACCA GATTCCCGCG TGGTATGGCG AAAACGGCGA GATCTTCGTC GCGCGCAACG AAGAGGACGC GCGCGCGCAA GCCGCCGCGA AGGGCTACAC GGGTGCGCTC AAGCGCGACG ACGACGTGCT CGACACGTGG TTCTCGTCGG CGCTCGTGCC GTTCTCCTCG CTCGGCTGGC CGAACGAGAC GCCCGAGATG AAACACTTCC TGCCGTCGTC GGTGCTCGTC ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGGATGG TGATGATGAC GACGCACTTC ACGGGCAAGG TGCCGTTCGG GACGGTCTAC GTGCACGGGC TCGTGCGCGA CGCCGAAGGC CAGAAGATGT CCAAGAGCAA GGGCAACACG CTCGACCCGA TCGACATCGT CGACGGCATC GGCCTCGACG CGCTCGTCGC AAAGCGCACG ACGGGGCTGA TGAATCCGAA GCAGGCGGCG ACGATCGAGA AGAAGACGCG CAAGGAATTC CCCGACGGCA TCCCCGCGTT CGGCACCGAC GCGCTGCGCT TCACGATGGC GTCGATGGCG ACGCTCGGGC GCAACGTGAA CTTCGATCTC GCGCGCTGCG AAGGCTATCG CAACTTCTGC AACAAGCTGT GGAACGCGAC GCGCTTCGTG CTGATGAACT GCGAAGGCCA CGACTGCGGC TTCGACAAGC CGGAAGTCTG CGGCGCGGGC GATTGCGGCC CCGGCGGCTA TCTCGACTTC TCGCCGGCGG ACCGCTGGAT CGTCTCGCTC ATGCAGCGCG TCGAGGCGGA CATCGCGAAG GGCTTCGCCG ACTATCGCTT CGACAACATC GCGAACGCGA TCTACAAGTT CGTCTGGGAC GAATACTGCG ACTGGTATCT CGAGCTCGCG AAGGTGCAGA TCCAGAACGG CACGCCCGAG CAGCAGCGCG CGACGCGCCG CACGCTGCTG CGCGTGCTGG AAACGGTGCT GCGCCTCGCG CACCCGATCA TCCCGTTCAT CACCGAGGCG CTGTGGCAGA AGGTCGCGCC GCTCGCCGGC CGCTATCCGG CGGGCAAGGC GGAGGGCGAA GCGTCGCTGA TGGTGCAGGC GTATCCGGTG GCCGAGCCGA AGAAGCTCGA CGAGGCTTGC GAACAGTGGG CGGCCGAACT GAAGGCCGTG GTCGATGCGT GTCGTAATCT ACGCGGCGAG ATGAATCTGT CTCCGGCGAC CAAGGTGCCG CTTCTCGCGG CCGGCGACGC GGCGCAACTG CAGGCGTTCG CGCCCTATGT GCAGGCGCTC GCGCGCCTGT CCGAAGTGCG CGTGCTGCCG GACGAAGCGG CGCTCGACGC CGACGCGCAC GGCGCGCCGA TCGCGATCGT CGGCGGCAAC AAGCTGGTGC TGAAGGTCGA GATCGACGTC GCGGCCGAAC GCGAGCGCCT GTCGAAGGAA ATCGCGCGTC TCGAAGGCGA GATCGCCAAG TGCAACGCGA AGCTCGGCAA CGAGGCGTTC GTCGCGAAGG CACCGCCCGC GGTGGTCGCG CAGGAGCAAA AACGGCTGGC GGAGTTTCAG AGCACGTTGA CGAAACTCGG CGCGCAGCTC GCTCGCTTGC CGGCGTAA
|
Protein sequence | MSDTTLAKSF EPQTIESQWG PEWEKRGYAT PALDPSRPDF SIQLPPPNVT GTLHMGHAFN QTIMDGLVRY HRMLGHNTLW VPGTDHAGIA TQIVVERQLD AQGVSRHDLG REKFVERVWE WKERSGSTIT GQVRRIGASP DWSREYFTMN DKMSEAVREV FVRLYEQGLI YRGKRLVNWD PVLLTAVSDL EVVSEEENGH LWHIRYPLAD GSGHLSVATT RPETMLGDVA VMVHPEDERY RHLVGRHVKL PLCEREIPII ADDYVDREFG TGVVKVTPAH DFNDYQVGLR HALAPIEILT LDAKINDNAP AAYRGLDRFD ARKAIVDELD AQGLLESVKP HKLMVPRGDR TGVVIEPMLT DQWFVAMTKP APQGTFHPGK SITEVSLEVV RRGEIKFVPE NWTTTYYQWL ENIQDWCISR QLWWGHQIPA WYGENGEIFV ARNEEDARAQ AAAKGYTGAL KRDDDVLDTW FSSALVPFSS LGWPNETPEM KHFLPSSVLV TGFDIIFFWV ARMVMMTTHF TGKVPFGTVY VHGLVRDAEG QKMSKSKGNT LDPIDIVDGI GLDALVAKRT TGLMNPKQAA TIEKKTRKEF PDGIPAFGTD ALRFTMASMA TLGRNVNFDL ARCEGYRNFC NKLWNATRFV LMNCEGHDCG FDKPEVCGAG DCGPGGYLDF SPADRWIVSL MQRVEADIAK GFADYRFDNI ANAIYKFVWD EYCDWYLELA KVQIQNGTPE QQRATRRTLL RVLETVLRLA HPIIPFITEA LWQKVAPLAG RYPAGKAEGE ASLMVQAYPV AEPKKLDEAC EQWAAELKAV VDACRNLRGE MNLSPATKVP LLAAGDAAQL QAFAPYVQAL ARLSEVRVLP DEAALDADAH GAPIAIVGGN KLVLKVEIDV AAERERLSKE IARLEGEIAK CNAKLGNEAF VAKAPPAVVA QEQKRLAEFQ STLTKLGAQL ARLPA
|
| |