Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_1419 |
Symbol | valS |
ID | 6123097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010508 |
Strand | - |
Start bp | 1569023 |
End bp | 1571890 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641637995 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001764716 |
Protein GI | 170732769 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ACACGCTTGC GAAGAGCTTC GAGCCCCATA CCATCGAGTC CCAATGGGGG CCGGAGTGGG AAAAACGCGG CTATGCCGCC CCGGCATTCG ATCCGGCCCG CCCCGATTTC GCGATCCAGT TGCCGCCGCC GAACGTGACG GGCACGCTGC ACATGGGCCA CGCGTTCAAT CAGACGATCA TGGACGGCCT CGCCCGCTAC CACCGGATGC TCGGCGAGAA CACGCTGTGG GTGCCCGGCA CCGACCACGC GGGGATCGCC ACCCAGATCG TGGTCGAGCG CCAGCTCGAC GCGCAAGGCG TGTCGCGCCA CGACCTCGGC CGCGAGAAGT TCGTCGAGCG CGTGTGGGAG TGGAAGCAGA AATCCGGTTC GACGATCACC GACCAGGTGC GCCGCCTCGG CGCGTCGACC GACTGGTCGC GCGAATACTT CACGATGGAC GACAAGATGT CGGCCGCCGT GCGCGACGTG TTCGTCACGC TGTATGAACA AGGGCTGATC TATCGCGGCA AGCGCCTCGT CAACTGGGAT CCGGTGCTGC TCACCGCCGT GTCGGACCTG GAAGTCGTCA GCGAAGAGGA AAACGGCCAC CTGTGGCACA TCCGCTACCC GCTCGTCGAC GGCTCGGGCT CGCTGACGGT CGCCACCACG CGTCCCGAAA CGATGCTCGG CGACGTCGCG GTGATGGTCC ATCCGGAAGA CGAACGCTAC GCGCACCTGA TCGGCAAGCT CGTCACGCTG CCGCTGACGG GCCGCGAGAT TCCGGTGATC GCCGACGACT ACGTCGACCG CGAGTTCGGC ACGGGCGTCG TGAAGGTCAC GCCCGCGCAC GATTTCAACG ACTACCAGGT CGGCCTGCGC CACAAGCTCG CGCCGATCGA GATCCTGACG CTCGACGCGA AGATCAACGA CAACGGCCCC GAGCAGTATC GCGGCCTCGA CCGCTTCGAC GCGCGCAAGG CGATCGTCGC CGACCTCGAC GCGCAGGGCT TCCTCGACTC CGTGAAGCCG CACAAGCTGA TGGTGCCGCG CGGCGACCGC ACGGGCGTCG TGATCGAGCC GTTGCTGACC GACCAGTGGT TCGTCGCGAT GACGAAGCCG GCGCCGGAAG GCACGTTCCA TCCGGGCAAG TCGATCACCG AGGTATCGCT CGACGTGGTG CGCGACGGCC AGATCAAGTT CGTCCCGGAA AACTGGACGA CCACCTACTA CCAGTGGCTC GAGAACATCC AGGACTGGTG CATCTCGCGC CAACTGTGGT GGGGCCACCA GATTCCCGCG TGGTACGGCG AGAACGGCGA GGTATTCGTC GCGCGCAACG AGGAAGACGC ACGCGCGCAG GCCGCCGCGA AGGGTTACGC GGGGGCGCTC AAGCGCGACG AAGACGTGCT CGATACGTGG TTCTCGTCGG CGCTCGTGCC GTTCTCGTCG CTCGGCTGGC CGAACGAAAC GCCCGAACTG AAGCACTTCC TGCCGTCGTC GGTGCTCGTC ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGGATGG TGATGATGAC CACGCACTTC ACCGGCAAGG TGCCGTTCCA TACCGTCTAC ATGCACGGCC TCGTGCGCGA CGCGGAAGGC CAGAAGATGT CGAAGAGCAA GGGCAACACG CTCGACCCGA TCGACATCGT CGACGGCATC GACCTCGAAT CGCTGGTCGC GAAGCGCACG ACGGGCCTGA TGAACCCGAA GCAGGCCGCG ACCATCGAGA AGAAGACGCG CAAGGAATTC CCGGACGGCA TCCCCGCGTT CGGCACCGAC GCGCTGCGCT TCACGATGGC GTCGATGGCG ACGCTCGGCC GCAATGTGAA CTTCGACCTC GCGCGCTGCG AAGGCTATCG CAACTTCTGC AACAAGCTGT GGAACGCGAC GCGCTTCGTG CTGATGAACT GCGAAGGCCA CGACTGCGGC AACGACAAGC CGGAAGTGTG CGGCGCGGGC GACTGCGGCC CCGGCGGCTA CCTCGACTTC TCGGCGGCCG ACCGCTGGAT CGTGTCGCTC TTGCAGCGCA CCGAAGCCGA CATCGCGAAG GGTTTCGCCG ATTACCGCTT CGACAACATC GCGAGCAGTA TCTACAAGTT CGTGTGGGAC GAGTATTGCG ACTGGTATCT CGAACTCGCG AAGGTGCAGA TCCAGAACGG CACGCCGGAA CAGCAGCGCG CCACCCGCCG CACGCTGCTG CGCGTGCTGG AAACGGTGCT GCGCCTCGCC CATCCGGTCA TCCCGTTCAT CACCGAAGCG CTCTGGCAGA AGGTCGCGCC GCTCGCCGGC CGCTATCCGC AGGGCAAGGC CGAGGGCGAA GCATCGCTGA TGACGCAGCC GTACCCGGTC GCGAACCTGC AGAAGCTCGA CGAGGCGTCC GAGCAGTGGG CGGCCGACCT GAAGGCGATC GTCGACGCGT GCCGCAACCT TCGCGGCGAG ATGAACCTGT CCCCGGCGAC CAAGGTGCCG CTGCTGGCGG CCGGCGACGC CGAACGCCTG CGCTCGTTCG CGCCGTACGT CCAGGCGCTC GCGCGCCTGT CGGAAGTGCA GATCCTCGCG GACGAGGCGG CGCTCGACAA GGAGGCACAC GGCGCGCCGA TCGCGATCGT CGGCCCGAAC AAGCTGGTGC TGAAGGTGGA AATCGACGTC GCGGCCGAAC GCGAGCGCCT GTCGAAGGAA ATCACGCGCC TGACCGGCGA GATCGCGAAG TGCAACGCGA AGCTCGGCAA CGAGGCTTTC GTCGCGAAAG CACCGCCGGC CGTGGTCGAA CAGGAGCAGA AACGTGTCGC GGAATTCCAG AGCACGCTCG AAAAACTGCG TGCGCAGCTC GATCGGTTGC CTGCGTAA
|
Protein sequence | MSDNTLAKSF EPHTIESQWG PEWEKRGYAA PAFDPARPDF AIQLPPPNVT GTLHMGHAFN QTIMDGLARY HRMLGENTLW VPGTDHAGIA TQIVVERQLD AQGVSRHDLG REKFVERVWE WKQKSGSTIT DQVRRLGAST DWSREYFTMD DKMSAAVRDV FVTLYEQGLI YRGKRLVNWD PVLLTAVSDL EVVSEEENGH LWHIRYPLVD GSGSLTVATT RPETMLGDVA VMVHPEDERY AHLIGKLVTL PLTGREIPVI ADDYVDREFG TGVVKVTPAH DFNDYQVGLR HKLAPIEILT LDAKINDNGP EQYRGLDRFD ARKAIVADLD AQGFLDSVKP HKLMVPRGDR TGVVIEPLLT DQWFVAMTKP APEGTFHPGK SITEVSLDVV RDGQIKFVPE NWTTTYYQWL ENIQDWCISR QLWWGHQIPA WYGENGEVFV ARNEEDARAQ AAAKGYAGAL KRDEDVLDTW FSSALVPFSS LGWPNETPEL KHFLPSSVLV TGFDIIFFWV ARMVMMTTHF TGKVPFHTVY MHGLVRDAEG QKMSKSKGNT LDPIDIVDGI DLESLVAKRT TGLMNPKQAA TIEKKTRKEF PDGIPAFGTD ALRFTMASMA TLGRNVNFDL ARCEGYRNFC NKLWNATRFV LMNCEGHDCG NDKPEVCGAG DCGPGGYLDF SAADRWIVSL LQRTEADIAK GFADYRFDNI ASSIYKFVWD EYCDWYLELA KVQIQNGTPE QQRATRRTLL RVLETVLRLA HPVIPFITEA LWQKVAPLAG RYPQGKAEGE ASLMTQPYPV ANLQKLDEAS EQWAADLKAI VDACRNLRGE MNLSPATKVP LLAAGDAERL RSFAPYVQAL ARLSEVQILA DEAALDKEAH GAPIAIVGPN KLVLKVEIDV AAERERLSKE ITRLTGEIAK CNAKLGNEAF VAKAPPAVVE QEQKRVAEFQ STLEKLRAQL DRLPA
|
| |