Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3947 |
Symbol | valS |
ID | 5151058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4142419 |
End bp | 4145286 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640558782 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001239923 |
Protein GI | 148255338 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.14713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA AGAATTATCA GCCCGCCGAC CTCGAAGGCC GCATGTCCCT GGTCTGGGAG GATGCGCGCG CATTCTCCGC CGGCCGTCCC GATCGCCGGG ATGCCGATCC CTTTACGATC GTGATCCCGC CGCCCAACGT GACCGGCTCG CTGCACATGG GGCACGCACT CAACAATACG CTGCAGGACA TCCTGTGCCG GTTCGAGCGG ATGCGCGGCC GGGACGTGCT GTGGCAGCCG GGCACCGACC ATGCGGGCAT CGCCACCCAG ATGGTGGTCG AGCGCCAGCT GATGGAACGC CAGCAGCCGG GCCGCCGCGA GATGGGTCGC GAGAAGTTCC TCGCACGCGT GTGGCAGTGG AAGGACGAGA GCGGCGGGAC CATCATCAAC CAGCTCAAGC GCCTCGGCGC CTCCTGCGAT TGGGGCCGCG AGCGCTTTAC CATGGACGAG GGCCTGTCGA AGGCCGTCGT GAAGGTGTTC GTCGAGCTGC ACCGCGCCGG CCTCATCTAC AAGGACAAGC GGCTGGTCAA CTGGGACCCG AAGCTGCTGA CGGCGATCTC GGATCTCGAG GTGCAGCAGA TCGAGGTCAA GGGCAACCTC TGGTACCTGC GCTATCCGAT CGAGGGCAAA ACCTTCAACC CCGAGGATCC GACCAGTTTC ATCGTCGTGG CCACCACGCG TCCCGAGACG ATGCTCGGCG ACAGCGCGGT TGCCGTGCAC CCGGATGACG AGCGCTATCA GCATCTCGTC GGCAAGCGCG TGATCCTGCC GCTGGTCGGC CGCGAGATCC CGATCGTCGC CGACACCTAT TCCGATCCGG AGAAGGGCAC TGGCGCGGTC AAGATCACGC CGGCGCATGA CTTCAACGAT TTCGAGGTCG GCCGCCGCCA CGGCCTGCCG CAGATCAGCA TTCTCGACCG CGAGGGCTGC ATCGCCCTCG TCGACAACGA GGACTATCTG CGCGGGCTGC CCGAGGGCTC GGAGGAGTTT GCGCGGGAGT TCCACGGCGT CGAGCGCTTC GCCGCGCGCA AGAAGATCCT GGAGCGGCTG GAGACCTTCG GCTTCCTCGA GCGCGTCGAG CCCAACACCC ACATGGTGCC GCATGGCGAC CGCTCCGGCG TCGTCATCGA GCCGTACCTC ACCGACCAGT GGTATGTCGA CGCCAAGACG CTGGCAAAGC CGGCGATTGC CGCCGTCAAG TCAGGCGCGA CGACGTTCGT GCCGCGCAAC TGGGAGAAGA CCTATTTCGA GTGGATGGAG AACATCCAGC CCTGGTGCAT CTCGCGCCAG CTGTGGTGGG GCCATCAGAT CCCGGCCTGG TACGGTCCCG ACGGCAAGGT GTTCGTCGCT GAGACCGAGG AGGAGGCCGT CAGCCACGCC ATTGCGTACT ATGTCGAGCA GGAGGTCATC ACGGCCGAGC AGGGCCGCGC AATGGCCCTC GACCGCAACA AGCGCGCGGG CTTCATCACA CGTGATGAAG ATGTTCTCGA CACCTGGTTC TCCTCGGCGC TGTGGCCGTT CTCGACGCTT GGCTGGCCGG ACGATACGCC GGAGGTGAAG CGCTACTACC CGACCAACGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC CGCATGATGA TGATGGGCCT GCACTTCATG AAGGAGGTGC CGTTCTCGAC TGTCTACATC CACGCGCTCG TCCGTGACGA GAAGGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTCATC GATCCCTTGG CACTGATCGA CGAGCACGGC GCCGACGCGC TGCGCTTCAC GCTCGCGGCG ATGGCCGCCC AAGGCCGCGA CATCAAGCTG TCGACCCAGC GCGTCGACGG CTACCGCAAG TTCGCCAACA AGCTCTGGAA CGCCAGCCGC TTTGCCGAGA TGAACGGCTG CACGGTGCCG GTGGGCTTCG ATCCGGCCTC CGCCAAGGAG ACGCTCAACC GCTGGATCGC CCATGAGGCC GCCGCTGCCA CGCGCGAGGT CACCGAGGCC CTGGAGGCCT ATCGCTTCAA CGATGCGGCG AACGCCATCT ACCGCTTTGT CTGGGACGTC TATTGCGACT GGTATGTCGA GCTCGCCAAG CCGACTTTGC TCGGCGAGGA CAGCCCGGCC AAGGCCGAGA CCCGCGCCAT GGTCGCCTGG GCGCGCGACG AGATCCTCAA GCTGCTGCAT CCCTTCATGC CTTTCATCAC CGAGGCGCTG TGGGAATCGA CCGCCAAGCG CGACGGCCTG CTGACCCTGT CGCCCTGGCC GCATCGCACG GACGTGCCGA CCGTCGCCGA GATTGCCGCG CTGTCGGCCG CGAGCTTCGG CGATCCGCTG GTGCCGCCGG TCCTGGTCGC GTTCAACGCC GCCAGCTTCA AGGACGAGGC GGCCGAGGCC GAGATCGGCT GGGTGGTCGA TCTCGTCACG GCGATCCGCT CGGTCCGGGC CGAGATGAAC ATCCCGCCCG CGACCCTGAT GCCGCTGGCG CTGGTCGGTG CGTCCGCCGA GACGAAAGAG CGCGCGCCGC GCTGGAGCGA CGTGATCAAA CGGCTTGCGC GCCTTTCGGA CATCTCGTTC GCCGACACCG CGCCTGAGGG GGCTGCTCAA CTCGTGGTGC GTGGCGAGGT GGCGGCCCTG CCGCTCAAGG GCGTCGTCGA TCTCGCCGCC GAGCGCACCC GCTTGGCCAA GGAAATCGCC AAGGCCGACG CCGATATCGA GCGGGTCGAC AAGAAGCTTG GCAACGAGAA GTTCGTCGCC AACGCGCCGG AAGAGCTCGT CGAGGAAGAG AAGGAAAAGC GCGAGGCGGC GCTGGCCCGC AAGGCCAAGT TCCAGGACGC GCTGTCGCGG CTGCAGGGCG TGGGGTAG
|
Protein sequence | MIDKNYQPAD LEGRMSLVWE DARAFSAGRP DRRDADPFTI VIPPPNVTGS LHMGHALNNT LQDILCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER QQPGRREMGR EKFLARVWQW KDESGGTIIN QLKRLGASCD WGRERFTMDE GLSKAVVKVF VELHRAGLIY KDKRLVNWDP KLLTAISDLE VQQIEVKGNL WYLRYPIEGK TFNPEDPTSF IVVATTRPET MLGDSAVAVH PDDERYQHLV GKRVILPLVG REIPIVADTY SDPEKGTGAV KITPAHDFND FEVGRRHGLP QISILDREGC IALVDNEDYL RGLPEGSEEF AREFHGVERF AARKKILERL ETFGFLERVE PNTHMVPHGD RSGVVIEPYL TDQWYVDAKT LAKPAIAAVK SGATTFVPRN WEKTYFEWME NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEEEAVSHA IAYYVEQEVI TAEQGRAMAL DRNKRAGFIT RDEDVLDTWF SSALWPFSTL GWPDDTPEVK RYYPTNVLVT GFDIIFFWVA RMMMMGLHFM KEVPFSTVYI HALVRDEKGA KMSKSKGNVI DPLALIDEHG ADALRFTLAA MAAQGRDIKL STQRVDGYRK FANKLWNASR FAEMNGCTVP VGFDPASAKE TLNRWIAHEA AAATREVTEA LEAYRFNDAA NAIYRFVWDV YCDWYVELAK PTLLGEDSPA KAETRAMVAW ARDEILKLLH PFMPFITEAL WESTAKRDGL LTLSPWPHRT DVPTVAEIAA LSAASFGDPL VPPVLVAFNA ASFKDEAAEA EIGWVVDLVT AIRSVRAEMN IPPATLMPLA LVGASAETKE RAPRWSDVIK RLARLSDISF ADTAPEGAAQ LVVRGEVAAL PLKGVVDLAA ERTRLAKEIA KADADIERVD KKLGNEKFVA NAPEELVEEE KEKREAALAR KAKFQDALSR LQGVG
|
| |