Gene BURPS1710b_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1846 
SymbolvalS 
ID3690354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2014316 
End bp2017183 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content66% 
IMG OID637728302 
Productvalyl-tRNA synthetase 
Protein accessionYP_333247 
Protein GI162210068 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.344373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA CCACGCTTGC GAAAAGTTTC GAGCCCCAGA CCATCGAATC CCAATGGGGG 
CCGGAATGGG AAAAGCGCGG CTATGCGACC CCCGCGCTCG ATCCGAGCCG GCCGGACTTC
TCGATCCAGT TGCCGCCCCC GAACGTGACG GGCACGCTGC ACATGGGCCA CGCGTTCAAT
CAGACGATCA TGGACGGGCT CGTGCGCTAC CACCGGATGC TCGGCCACAA CACGCTGTGG
GTGCCCGGCA CCGACCACGC GGGCATCGCG ACGCAGATCG TCGTCGAGCG CCAGCTCGAT
GCGCAGGGCG TGTCGCGCCA CGATCTCGGC CGCGAGAAAT TCGTCGAGCG CGTATGGGAA
TGGAAGGAGC GGTCCGGCTC GACGATCACG GGCCAGGTTC GCCGCATCGG CGCGTCGCCC
GACTGGTCGC GCGAATACTT CACGATGAAC GACAAGATGT CGGAGGCCGT GCGCGAAGTG
TTCGTCCGCC TCTATGAACA AGGGCTCATC TATCGCGGCA AGCGCCTCGT GAACTGGGAC
CCCGTGCTGC TCACCGCCGT GTCCGATCTC GAAGTGGTGA GCGAGGAGGA AAACGGCCAT
CTGTGGCACA TCCGATACCC GCTCGCGGAC GGCTCGGGCC ACCTGAGCGT CGCGACGACG
CGCCCCGAGA CGATGCTCGG CGACGTCGCG GTGATGGTGC ATCCGGAAGA CGAACGCTAC
CGGCACCTCG TCGGCCGGCA CGTGAAGCTG CCGCTGTGCG AGCGCGAGAT TCCGATCATC
GCGGACGACT ACGTCGATCG CGAGTTCGGC ACGGGCGTCG TGAAGGTCAC GCCCGCGCAC
GATTTCAACG ACTACCAGGT CGGCCTGCGC CACGCGCTCG CGCCGATCGA GATTCTCACG
CTCGACGCGA AGATCAACGA CAACGCGCCC GCCGCTTACC GCGGCCTCGA TCGCTTCGAC
GCGCGCAAGG CCATCGTCGA CGAGCTCGAC GCGCAGGGCT TGCTCGAATC GGTGAAGCCG
CACAAGCTGA TGGTGCCGCG CGGCGACCGC ACGGGCGTCG TGATCGAGCC GATGCTGACC
GACCAGTGGT TCGTCGCGAT GACGAAGCCC GCGCCGCAAG GCACCTTCCA TCCGGGCAAG
TCGATCACCG AGGTCTCGCT CGAGGTCGTG CGCCGCGGCG AGATCAAGTT CGTGCCCGAG
AACTGGACGA CCACCTACTA CCAGTGGCTC GAGAACATCC AGGACTGGTG CATCTCGCGC
CAGCTGTGGT GGGGCCACCA GATTCCCGCG TGGTATGGCG AAAACGGCGA GATCTTCGTC
GCGCGCAACG AAGAGGACGC GCGCGCGCAA GCCGCCGCGA AGGGCTACAC GGGTGCGCTC
AAGCGCGACG ACGACGTGCT CGACACGTGG TTCTCGTCGG CGCTCGTGCC GTTCTCCTCG
CTCGGCTGGC CGAACGAGAC GCCCGAGATG AAACACTTCC TGCCGTCGTC GGTGCTCGTC
ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGGATGG TGATGATGAC GACGCACTTC
ACGGGCAAGG TGCCGTTCGG GACGGTCTAC GTGCACGGGC TCGTGCGCGA CGCCGAAGGC
CAGAAGATGT CCAAGAGCAA GGGCAACACG CTCGACCCGA TCGACATCGT CGACGGCATC
GGCCTCGACG CGCTCGTCGC AAAGCGCACG ACGGGGCTGA TGAATCCGAA GCAGGCGGCG
ACGATCGAGA AGAAGACGCG CAAGGAATTC CCCGACGGCA TCCCCGCGTT CGGCACCGAC
GCGCTGCGCT TCACGATGGC GTCGATGGCG ACGCTCGGGC GCAACGTGAA CTTCGATCTC
GCGCGCTGCG AAGGCTATCG CAACTTCTGC AACAAGCTGT GGAACGCGAC GCGCTTCGTG
CTGATGAACT GCGAAGGCCA CGACTGCGGC TTCGACAAGC CGGAAGTCTG CGGCGCGGGC
GATTGCGGCC CCGGCGGCTA TCTCGACTTC TCGCCGGCGG ACCGCTGGAT CGTCTCGCTC
ATGCAGCGCG TCGAGGCGGA CATCGCGAAG GGCTTCGCCG ACTATCGCTT CGACAACATC
GCGAACGCGA TCTACAAGTT CGTCTGGGAC GAATACTGCG ACTGGTATCT CGAGCTCGCG
AAGGTGCAGA TCCAGAACGG CACGCCCGAG CAGCAGCGCG CGACGCGCCG CACGCTGCTG
CGCGTGCTGG AAACGGTGCT GCGCCTCGCG CACCCGATCA TCCCGTTCAT CACCGAGGCG
CTGTGGCAGA AGGTCGCGCC GCTCGCCGGC CGCTATCCGG CGGGCAAGGC GGAGGGCGAA
GCGTCGCTGA TGGTGCAGGC GTATCCGGTG GCCGAGCCGA AGAAGCTCGA CGAGGCTTGC
GAACAGTGGG CGGCCGAACT GAAGGCCGTG GTCGATGCGT GTCGTAATCT ACGCGGCGAG
ATGAATCTGT CTCCGGCGAC CAAGGTGCCG CTTCTCGCGG CCGGCGACGC GGCGCAACTG
CAGGCGTTCG CGCCCTATGT GCAGGCGCTC GCGCGCCTGT CCGAAGTGCG CGTGCTGCCG
GACGAAGCGG CGCTCGACGC CGACGCGCAC GGCGCGCCGA TCGCGATCGT CGGCGGCAAC
AAGCTGGTGC TGAAGGTCGA GATCGACGTC GCGGCCGAAC GCGAGCGCCT GTCGAAGGAA
ATCGCGCGTC TCGAAGGCGA GATCGCCAAG TGCAACGCGA AGCTCGGCAA CGAGGCGTTC
GTCGCGAAGG CACCGCCCGC GGTGGTCGCG CAGGAGCAAA AACGGCTGGC GGAGTTTCAG
AGCACGTTGA CGAAACTCGG CGCGCAGCTC GCTCGCTTGC CGGCGTAA
 
Protein sequence
MSDTTLAKSF EPQTIESQWG PEWEKRGYAT PALDPSRPDF SIQLPPPNVT GTLHMGHAFN 
QTIMDGLVRY HRMLGHNTLW VPGTDHAGIA TQIVVERQLD AQGVSRHDLG REKFVERVWE
WKERSGSTIT GQVRRIGASP DWSREYFTMN DKMSEAVREV FVRLYEQGLI YRGKRLVNWD
PVLLTAVSDL EVVSEEENGH LWHIRYPLAD GSGHLSVATT RPETMLGDVA VMVHPEDERY
RHLVGRHVKL PLCEREIPII ADDYVDREFG TGVVKVTPAH DFNDYQVGLR HALAPIEILT
LDAKINDNAP AAYRGLDRFD ARKAIVDELD AQGLLESVKP HKLMVPRGDR TGVVIEPMLT
DQWFVAMTKP APQGTFHPGK SITEVSLEVV RRGEIKFVPE NWTTTYYQWL ENIQDWCISR
QLWWGHQIPA WYGENGEIFV ARNEEDARAQ AAAKGYTGAL KRDDDVLDTW FSSALVPFSS
LGWPNETPEM KHFLPSSVLV TGFDIIFFWV ARMVMMTTHF TGKVPFGTVY VHGLVRDAEG
QKMSKSKGNT LDPIDIVDGI GLDALVAKRT TGLMNPKQAA TIEKKTRKEF PDGIPAFGTD
ALRFTMASMA TLGRNVNFDL ARCEGYRNFC NKLWNATRFV LMNCEGHDCG FDKPEVCGAG
DCGPGGYLDF SPADRWIVSL MQRVEADIAK GFADYRFDNI ANAIYKFVWD EYCDWYLELA
KVQIQNGTPE QQRATRRTLL RVLETVLRLA HPIIPFITEA LWQKVAPLAG RYPAGKAEGE
ASLMVQAYPV AEPKKLDEAC EQWAAELKAV VDACRNLRGE MNLSPATKVP LLAAGDAAQL
QAFAPYVQAL ARLSEVRVLP DEAALDADAH GAPIAIVGGN KLVLKVEIDV AAERERLSKE
IARLEGEIAK CNAKLGNEAF VAKAPPAVVA QEQKRLAEFQ STLTKLGAQL ARLPA