Gene BURPS1106A_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1691 
SymbolvalS 
ID4899607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1647272 
End bp1650139 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content66% 
IMG OID640134921 
Productvalyl-tRNA synthetase 
Protein accessionYP_001065962 
Protein GI126452676 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA CCACGCTTGC GAAAAGTTTC GAGCCCCAGA CCATCGAATC CCAATGGGGG 
CCGGAATGGG AAAAGCGCGG CTATGCGACC CCCGCGCTCG ATCCGAGCCG GCCGGACTTC
TCGATCCAGT TGCCGCCCCC GAACGTGACG GGCACGCTGC ACATGGGCCA CGCGTTCAAT
CAGACGATCA TGGACGGGCT CGTGCGCTAC CACCGGATGC TCGGCCACAA CACGCTGTGG
GTGCCCGGCA CCGACCACGC GGGCATCGCG ACGCAGATCG TCGTCGAGCG CCAGCTCGAT
GCGCAGGGCG TGTCGCGCCA CGATCTCGGC CGCGAGAAAT TCGTCGAGCG CGTATGGGAA
TGGAAGGAGC GGTCCGGCTC GACGATCACG GGCCAGGTTC GCCGCATCGG CGCGTCGCCC
GACTGGTCGC GCGAATACTT CACGATGAAC GACAAGATGT CGGAGGCCGT GCGCGAAGTG
TTCGTCCGCC TCTATGAACA AGGGCTCATC TATCGCGGCA AGCGCCTCGT GAACTGGGAC
CCCGTGCTGC TCACCGCCGT GTCCGATCTC GAAGTGGTGA GCGAGGAGGA AAACGGCCAT
CTGTGGCACA TCCGATACCC GCTCGCGGAC GGCTCGGGCC ACCTGAGCGT CGCGACGACG
CGCCCCGAGA CGATGCTCGG CGACGTCGCG GTGATGGTGC ATCCGGAAGA CGAACGCTAC
CGGCACCTCG TCGGCCGGCA CGTGAAGCTG CCGCTGTGCG AGCGCGAGAT TCCGATCATC
GCGGACGACT ACGTCGATCG CGAGTTCGGC ACGGGCGTCG TGAAGGTCAC GCCCGCGCAC
GATTTCAACG ACTACCAGGT CGGCCTGCGC CACGCGCTCG CGCCGATCGA GATTCTCACG
CTCGACGCGA AGATCAACGA CAACGCGCCC GCCGCTTACC GCGGCCTCGA TCGCTTCGAC
GCGCGCAAGG CCATCGTCGA CGAGCTCGAC GCGCAGGGCT TGCTCGAATC GGTGAAGCCG
CACAAGCTGA TGGTGCCGCG CGGCGACCGC ACGGGCGTCG TGATCGAGCC GATGCTGACC
GACCAGTGGT TCGTCGCGAT GACGAAGCCC GCGCCGCAAG GCACCTTCCA TCCGGGCAAG
TCGATCACCG AGGTCTCGCT CGAGGTCGTG CGCCGCGGCG AGATCAAGTT CGTGCCCGAG
AACTGGACGA CCACCTACTA CCAGTGGCTC GAGAACATCC AGGACTGGTG CATCTCGCGC
CAGCTGTGGT GGGGCCACCA GATTCCCGCG TGGTATGGCG AAAACGGCGA GATCTTCGTC
GCGCGCAACG AAGAGGACGC GCGCGCGCAA GCCGCCGCGA AGGGCTACAC GGGTGCGCTC
AAGCGCGACG ACGACGTGCT CGACACGTGG TTCTCGTCGG CGCTCGTGCC GTTCTCCTCG
CTCGGCTGGC CGAACGAGAC GCCCGAGATG AAACACTTCC TGCCGTCGTC GGTGCTCGTC
ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGGATGG TGATGATGAC GACGCACTTC
ACGGGCAAGG TGCCGTTCGG GACGGTCTAC GTGCACGGGC TCGTGCGCGA CGCCGAAGGC
CAGAAGATGT CCAAGAGCAA GGGCAACACG CTCGACCCGA TCGACATCGT CGACGGCATC
GGCCTCGACG CGCTCGTCGC GAAGCGCACG ACGGGGCTGA TGAATCCGAA GCAGGCGGCG
ACGATCGAGA AGAAGACGCG CAAGGAATTC CCCGACGGCA TCCCCGCGTT CGGCACCGAC
GCGCTGCGCT TCACGATGGC GTCGATGGCG ACGCTCGGGC GCAACGTGAA CTTCGATCTC
GCGCGCTGCG AAGGCTATCG CAACTTCTGC AACAAGCTGT GGAACGCGAC GCGCTTCGTG
CTGATGAACT GCGAAGGCCA CGACTGCGGC TTCGACAAGC CGGAAGTCTG CGGCGCGGGC
GATTGCGGCC CCGGCGGCTA TCTCGACTTC TCGCCGGCGG ACCGCTGGAT CGTCTCGCTC
ATGCAGCGCG TCGAGGCGGA CATCGCGAAG GGCTTCGCCG ACTATCGCTT CGACAACATC
GCGAACGCGA TCTACAAGTT CGTCTGGGAC GAATACTGCG ACTGGTATCT CGAGCTCGCG
AAGGTGCAGA TCCAGAACGG CACGCCCGAG CAGCAGCGCG CGACGCGCCG CACGCTGCTG
CGCGTGCTGG AAACGGTGCT GCGCCTCGCG CACCCGATCA TCCCGTTCAT CACCGAGGCG
CTGTGGCAGA AGGTCGCGCC GCTCGCCGGC CGCTATCCGG CGGGCAAGGC GGAGGGCGAA
GCGTCGCTGA TGGTGCAGGC GTATCCGGTG GCCGAGCCGA AGAAGCTCGA CGAGGCTTGC
GAACAGTGGG CGGCCGAACT GAAGGCCGTG GTCGATGCGT GTCGTAATCT ACGCGGCGAG
ATGAATCTGT CTCCGGCGAC CAAGGTGCCG CTTCTCGCGG CCGGCGACGC GGCGCAACTG
CAGGCGTTCG CGCCCTATGT GCAGGCGCTC GCGCGCCTGT CCGAAGTGCG CGTGCTGCCG
GACGAAGCGG CGCTCGACGC CGACGCGCAC GGCGCGCCGA TCGCGATCGT CGGCGGCAAC
AAGCTGGTGC TGAAGGTCGA GATCGACGTC GCGGCCGAAC GCGAGCGCCT GTCGAAGGAA
ATCGCGCGTC TCGAAGGCGA GATCGTCAAG TGCAACGCGA AGCTCGGCAA CGAGGCGTTC
GTCGCGAAGG CGCCGCCCGC GGTGGTCGCG CAGGAGCAAA AACGGCTGGC GGAGTTTCAG
AGCACGTTGA CGAAACTCGG CGCGCAGCTC GCTCGCTTGC CGGCGTAA
 
Protein sequence
MSDTTLAKSF EPQTIESQWG PEWEKRGYAT PALDPSRPDF SIQLPPPNVT GTLHMGHAFN 
QTIMDGLVRY HRMLGHNTLW VPGTDHAGIA TQIVVERQLD AQGVSRHDLG REKFVERVWE
WKERSGSTIT GQVRRIGASP DWSREYFTMN DKMSEAVREV FVRLYEQGLI YRGKRLVNWD
PVLLTAVSDL EVVSEEENGH LWHIRYPLAD GSGHLSVATT RPETMLGDVA VMVHPEDERY
RHLVGRHVKL PLCEREIPII ADDYVDREFG TGVVKVTPAH DFNDYQVGLR HALAPIEILT
LDAKINDNAP AAYRGLDRFD ARKAIVDELD AQGLLESVKP HKLMVPRGDR TGVVIEPMLT
DQWFVAMTKP APQGTFHPGK SITEVSLEVV RRGEIKFVPE NWTTTYYQWL ENIQDWCISR
QLWWGHQIPA WYGENGEIFV ARNEEDARAQ AAAKGYTGAL KRDDDVLDTW FSSALVPFSS
LGWPNETPEM KHFLPSSVLV TGFDIIFFWV ARMVMMTTHF TGKVPFGTVY VHGLVRDAEG
QKMSKSKGNT LDPIDIVDGI GLDALVAKRT TGLMNPKQAA TIEKKTRKEF PDGIPAFGTD
ALRFTMASMA TLGRNVNFDL ARCEGYRNFC NKLWNATRFV LMNCEGHDCG FDKPEVCGAG
DCGPGGYLDF SPADRWIVSL MQRVEADIAK GFADYRFDNI ANAIYKFVWD EYCDWYLELA
KVQIQNGTPE QQRATRRTLL RVLETVLRLA HPIIPFITEA LWQKVAPLAG RYPAGKAEGE
ASLMVQAYPV AEPKKLDEAC EQWAAELKAV VDACRNLRGE MNLSPATKVP LLAAGDAAQL
QAFAPYVQAL ARLSEVRVLP DEAALDADAH GAPIAIVGGN KLVLKVEIDV AAERERLSKE
IARLEGEIVK CNAKLGNEAF VAKAPPAVVA QEQKRLAEFQ STLTKLGAQL ARLPA