Gene BMASAVP1_A1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1460 
SymbolvalS 
ID4680646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1427257 
End bp1430124 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content66% 
IMG OID639845730 
Productvalyl-tRNA synthetase 
Protein accessionYP_992791 
Protein GI121598960 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA CCACGCTTGC GAAAAGTTTC GAGCCCCAGA CCATCGAATC CCAATGGGGG 
CCGGAATGGG AAAAGCGCGG CTATGCGACC CCCGCGCTCG ATCCGAGCCG GCCGGACTTC
TCGATCCAGT TGCCGCCCCC GAACGTGACG GGCACGCTGC ACATGGGCCA CGCGTTCAAT
CAGACGATCA TGGACGGGCT CGTGCGCTAC CACCGGATGC TCGGCCACAA CACGCTGTGG
GTGCCCGGCA CCGACCACGC GGGCATCGCG ACGCAGATCG TCGTCGAGCG CCAGCTCGAT
GCGCAGGGCG TGTCGCGCCA CGATCTCGGC CGCGAGAAAT TCGTCGAGCG CGTATGGGAA
TGGAAGGAGC GGTCCGGCTC GACGATCACG GGCCAGGTTC GCCGCATCGG CGCGTCGCCC
GACTGGTCGC GCGAATACTT CACGATGAAC GACAAGATGT CGGAGGCCGT GCGCGAAGTG
TTCGTCCGCC TCTATGAACA AGGGCTCATC TATCGCGGCA AGCGCCTCGT GAACTGGGAC
CCCGTGCTGC TCACCGCCGT GTCCGATCTC GAAGTGGTGA GCGAGGAGGA AAACGGCCAT
CTGTGGCACA TCCGATACCC GCTCGCGGAC GGCTCGGGCC ACCTGAGCGT CGCGACGACG
CGCCCCGAGA CGATGCTCGG CGACGTCGCG GTGATGGTGC ATCCGGAAGA CGAACGCTAC
CGGCACCTCG TCGGCCGGCA CGTGAAGCTG CCGCTGTGCG AACGCGAGAT TCCGATCATC
GCGGACGACT ACGTCGATCG CGAGTTCGGC ACGGGCGTCG TGAAGGTCAC GCCCGCGCAC
GATTTCAACG ACTACCAGGT CGGCCTGCGC CACGCGCTCG CGCCGATCGA GATTCTCACG
CTCGACGCGA AGATCAACGA CAACGCGCCC GCCGCTTACC GCGGCCTCGA TCGCTTCGAC
GCGCGCAAGG CCATCGTCGA CGAGCTCGAC GCGCAGGGCT TGCTCGAATC GGTGAAGCCG
CACAAGCTGA TGGTGCCGCG CGGCGACCGC ACGGGCGTCG TGATCGAGCC GATGCTGACC
GACCAGTGGT TCGTCGCGAT GACGAAGCCC GCGCCGCAAG GCACCTTCCA TCCGGGCAAG
TCGATCACCG AGGTCTCGCT CGAGGTCGTG CGCCGCGGCG AGATCAAGTT CGTGCCCGAG
AACTGGACGA CCACCTACTA CCAGTGGCTC GAGAACATCC AGGACTGGTG CATCTCGCGC
CAGCTGTGGT GGGGCCACCA GATTCCCGCG TGGTATGGCG AAAACGGCGA GATCTTCGTC
GCGCGCAACG AAGAGGACGC GCGCGCGCAA GCCGCCGCGA AGGGCTACAC GGGTGCGCTC
AAGCGCGACG ACGACGTGCT CGACACGTGG TTCTCGTCGG CGCTCGTGCC GTTCTCCTCG
CTCGGCTGGC CGAACGAGAC GCCCGAGATG AAACACTTCC TGCCGTCGTC GGTGCTCGTC
ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGGATGG TGATGATGAC GACGCACTTC
ACGGGCAAGG TGCCGTTCGG GACGGTCTAC GTGCACGGGC TCGTGCGCGA CGCCGAAGGC
CAGAAGATGT CCAAGAGCAA GGGCAACACG CTCGACCCGA TCGACATCGT CGACGGCATC
GGCCTCGACG CGCTCGTCGC GAAGCGCACG ACGGGGCTGA TGAATCCGAG GCAGGCGGCG
ACGATCGAGA AGAAGACGCG CAAGGAATTC CCCGACGGCA TCCCCGCGTT CGGCACCGAC
GCGCTGCGCT TCACGATGGC GTCGATGGCG ACGCTCGGGC GCAACGTGAA CTTCGATCTC
GCGCGCTGCG AAGGCTATCG CAACTTCTGC AACAAGCTGT GGAACGCGAC GCGCTTCGTG
CTGATGAACT GCGAAGGCCA CGACTGCGGC TTCGACAAGC CGGAAGTCTG CGGCGCGGGC
GATTGCGGCC CCGGCGGCTA TCTCGACTTC TCGCCGGCGG ACCGCTGGAT CGTCTCGCTC
ATGCAGCGCG TCGAGGCGGA CATCGCGAAG GGCTTCGCCG ACTATCGCTT CGACAACATC
GCGAACGCGA TCTACAAGTT CGTCTGGGAC GAATACTGCG ACTGGTATCT CGAGCTCGCG
AAGGTGCAGA TCCAGAACGG CACGTCCGAG CAGCAGCGCG CGACGCGCCG CACGCTGCTG
CGCGTGCTGG AAACGGTGCT GCGCCTCGCG CACCCGATCA TCCCGTTCAT CACCGAGGCG
CTGTGGCAGA AGGTCGCGCC GCTCGCCGGC CGCTATCCGG CGGGCAAGGC GGAGGGCGAA
GCGTCGCTGA TGGTGCAGGC GTATCCGGTG GCCGAGCCGA AGAAGCTCGA CGAGGCTTGC
GAACAGTGGG CGGCCGAACT GAAGGCCGTG GTCGATGCGT GTCGTAATCT ACGCGGCGAG
ATGAATCTGT CTCCGGCGAC CAAGGTGCCG CTTCTCGCGG CCGGCGACGC GGCGCAACTG
CAGGCGTTCG CGCCCTATGT GCAGGCGCTC GCGCGCCTGT CCGAAGTGCG CGTGCTGCCG
GACGAAGCGG CGCTCGACGC CGACGCGCAC GGCGCGCCGA TCGCGATCGT CGGCGGCAAC
AAGCTGGTGC TGAAGGTCGA GATCGACGTC GCGGCCGAAC GCGAGCGCCT GTCGAAGGAA
ATCGCGCGTC TCGAAGGCGA GATCGTCAAG TGCAACGCGA AGCTCGGCAA CGAGGCGTTC
GTCGCGAAGG CGCCGCCCGC GGTGGTCGCG CAGGAGCAAA AACGGCTGGC GGAGTTTCAG
AGCACGTTGA CGAAACTCGG CGCGCAGCTC GCTCGCTTGC CGGCGTAA
 
Protein sequence
MSDTTLAKSF EPQTIESQWG PEWEKRGYAT PALDPSRPDF SIQLPPPNVT GTLHMGHAFN 
QTIMDGLVRY HRMLGHNTLW VPGTDHAGIA TQIVVERQLD AQGVSRHDLG REKFVERVWE
WKERSGSTIT GQVRRIGASP DWSREYFTMN DKMSEAVREV FVRLYEQGLI YRGKRLVNWD
PVLLTAVSDL EVVSEEENGH LWHIRYPLAD GSGHLSVATT RPETMLGDVA VMVHPEDERY
RHLVGRHVKL PLCEREIPII ADDYVDREFG TGVVKVTPAH DFNDYQVGLR HALAPIEILT
LDAKINDNAP AAYRGLDRFD ARKAIVDELD AQGLLESVKP HKLMVPRGDR TGVVIEPMLT
DQWFVAMTKP APQGTFHPGK SITEVSLEVV RRGEIKFVPE NWTTTYYQWL ENIQDWCISR
QLWWGHQIPA WYGENGEIFV ARNEEDARAQ AAAKGYTGAL KRDDDVLDTW FSSALVPFSS
LGWPNETPEM KHFLPSSVLV TGFDIIFFWV ARMVMMTTHF TGKVPFGTVY VHGLVRDAEG
QKMSKSKGNT LDPIDIVDGI GLDALVAKRT TGLMNPRQAA TIEKKTRKEF PDGIPAFGTD
ALRFTMASMA TLGRNVNFDL ARCEGYRNFC NKLWNATRFV LMNCEGHDCG FDKPEVCGAG
DCGPGGYLDF SPADRWIVSL MQRVEADIAK GFADYRFDNI ANAIYKFVWD EYCDWYLELA
KVQIQNGTSE QQRATRRTLL RVLETVLRLA HPIIPFITEA LWQKVAPLAG RYPAGKAEGE
ASLMVQAYPV AEPKKLDEAC EQWAAELKAV VDACRNLRGE MNLSPATKVP LLAAGDAAQL
QAFAPYVQAL ARLSEVRVLP DEAALDADAH GAPIAIVGGN KLVLKVEIDV AAERERLSKE
IARLEGEIVK CNAKLGNEAF VAKAPPAVVA QEQKRLAEFQ STLTKLGAQL ARLPA