Gene BBta_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3947 
SymbolvalS 
ID5151058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4142419 
End bp4145286 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content66% 
IMG OID640558782 
Productvalyl-tRNA synthetase 
Protein accessionYP_001239923 
Protein GI148255338 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.14713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA AGAATTATCA GCCCGCCGAC CTCGAAGGCC GCATGTCCCT GGTCTGGGAG 
GATGCGCGCG CATTCTCCGC CGGCCGTCCC GATCGCCGGG ATGCCGATCC CTTTACGATC
GTGATCCCGC CGCCCAACGT GACCGGCTCG CTGCACATGG GGCACGCACT CAACAATACG
CTGCAGGACA TCCTGTGCCG GTTCGAGCGG ATGCGCGGCC GGGACGTGCT GTGGCAGCCG
GGCACCGACC ATGCGGGCAT CGCCACCCAG ATGGTGGTCG AGCGCCAGCT GATGGAACGC
CAGCAGCCGG GCCGCCGCGA GATGGGTCGC GAGAAGTTCC TCGCACGCGT GTGGCAGTGG
AAGGACGAGA GCGGCGGGAC CATCATCAAC CAGCTCAAGC GCCTCGGCGC CTCCTGCGAT
TGGGGCCGCG AGCGCTTTAC CATGGACGAG GGCCTGTCGA AGGCCGTCGT GAAGGTGTTC
GTCGAGCTGC ACCGCGCCGG CCTCATCTAC AAGGACAAGC GGCTGGTCAA CTGGGACCCG
AAGCTGCTGA CGGCGATCTC GGATCTCGAG GTGCAGCAGA TCGAGGTCAA GGGCAACCTC
TGGTACCTGC GCTATCCGAT CGAGGGCAAA ACCTTCAACC CCGAGGATCC GACCAGTTTC
ATCGTCGTGG CCACCACGCG TCCCGAGACG ATGCTCGGCG ACAGCGCGGT TGCCGTGCAC
CCGGATGACG AGCGCTATCA GCATCTCGTC GGCAAGCGCG TGATCCTGCC GCTGGTCGGC
CGCGAGATCC CGATCGTCGC CGACACCTAT TCCGATCCGG AGAAGGGCAC TGGCGCGGTC
AAGATCACGC CGGCGCATGA CTTCAACGAT TTCGAGGTCG GCCGCCGCCA CGGCCTGCCG
CAGATCAGCA TTCTCGACCG CGAGGGCTGC ATCGCCCTCG TCGACAACGA GGACTATCTG
CGCGGGCTGC CCGAGGGCTC GGAGGAGTTT GCGCGGGAGT TCCACGGCGT CGAGCGCTTC
GCCGCGCGCA AGAAGATCCT GGAGCGGCTG GAGACCTTCG GCTTCCTCGA GCGCGTCGAG
CCCAACACCC ACATGGTGCC GCATGGCGAC CGCTCCGGCG TCGTCATCGA GCCGTACCTC
ACCGACCAGT GGTATGTCGA CGCCAAGACG CTGGCAAAGC CGGCGATTGC CGCCGTCAAG
TCAGGCGCGA CGACGTTCGT GCCGCGCAAC TGGGAGAAGA CCTATTTCGA GTGGATGGAG
AACATCCAGC CCTGGTGCAT CTCGCGCCAG CTGTGGTGGG GCCATCAGAT CCCGGCCTGG
TACGGTCCCG ACGGCAAGGT GTTCGTCGCT GAGACCGAGG AGGAGGCCGT CAGCCACGCC
ATTGCGTACT ATGTCGAGCA GGAGGTCATC ACGGCCGAGC AGGGCCGCGC AATGGCCCTC
GACCGCAACA AGCGCGCGGG CTTCATCACA CGTGATGAAG ATGTTCTCGA CACCTGGTTC
TCCTCGGCGC TGTGGCCGTT CTCGACGCTT GGCTGGCCGG ACGATACGCC GGAGGTGAAG
CGCTACTACC CGACCAACGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC
CGCATGATGA TGATGGGCCT GCACTTCATG AAGGAGGTGC CGTTCTCGAC TGTCTACATC
CACGCGCTCG TCCGTGACGA GAAGGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTCATC
GATCCCTTGG CACTGATCGA CGAGCACGGC GCCGACGCGC TGCGCTTCAC GCTCGCGGCG
ATGGCCGCCC AAGGCCGCGA CATCAAGCTG TCGACCCAGC GCGTCGACGG CTACCGCAAG
TTCGCCAACA AGCTCTGGAA CGCCAGCCGC TTTGCCGAGA TGAACGGCTG CACGGTGCCG
GTGGGCTTCG ATCCGGCCTC CGCCAAGGAG ACGCTCAACC GCTGGATCGC CCATGAGGCC
GCCGCTGCCA CGCGCGAGGT CACCGAGGCC CTGGAGGCCT ATCGCTTCAA CGATGCGGCG
AACGCCATCT ACCGCTTTGT CTGGGACGTC TATTGCGACT GGTATGTCGA GCTCGCCAAG
CCGACTTTGC TCGGCGAGGA CAGCCCGGCC AAGGCCGAGA CCCGCGCCAT GGTCGCCTGG
GCGCGCGACG AGATCCTCAA GCTGCTGCAT CCCTTCATGC CTTTCATCAC CGAGGCGCTG
TGGGAATCGA CCGCCAAGCG CGACGGCCTG CTGACCCTGT CGCCCTGGCC GCATCGCACG
GACGTGCCGA CCGTCGCCGA GATTGCCGCG CTGTCGGCCG CGAGCTTCGG CGATCCGCTG
GTGCCGCCGG TCCTGGTCGC GTTCAACGCC GCCAGCTTCA AGGACGAGGC GGCCGAGGCC
GAGATCGGCT GGGTGGTCGA TCTCGTCACG GCGATCCGCT CGGTCCGGGC CGAGATGAAC
ATCCCGCCCG CGACCCTGAT GCCGCTGGCG CTGGTCGGTG CGTCCGCCGA GACGAAAGAG
CGCGCGCCGC GCTGGAGCGA CGTGATCAAA CGGCTTGCGC GCCTTTCGGA CATCTCGTTC
GCCGACACCG CGCCTGAGGG GGCTGCTCAA CTCGTGGTGC GTGGCGAGGT GGCGGCCCTG
CCGCTCAAGG GCGTCGTCGA TCTCGCCGCC GAGCGCACCC GCTTGGCCAA GGAAATCGCC
AAGGCCGACG CCGATATCGA GCGGGTCGAC AAGAAGCTTG GCAACGAGAA GTTCGTCGCC
AACGCGCCGG AAGAGCTCGT CGAGGAAGAG AAGGAAAAGC GCGAGGCGGC GCTGGCCCGC
AAGGCCAAGT TCCAGGACGC GCTGTCGCGG CTGCAGGGCG TGGGGTAG
 
Protein sequence
MIDKNYQPAD LEGRMSLVWE DARAFSAGRP DRRDADPFTI VIPPPNVTGS LHMGHALNNT 
LQDILCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER QQPGRREMGR EKFLARVWQW
KDESGGTIIN QLKRLGASCD WGRERFTMDE GLSKAVVKVF VELHRAGLIY KDKRLVNWDP
KLLTAISDLE VQQIEVKGNL WYLRYPIEGK TFNPEDPTSF IVVATTRPET MLGDSAVAVH
PDDERYQHLV GKRVILPLVG REIPIVADTY SDPEKGTGAV KITPAHDFND FEVGRRHGLP
QISILDREGC IALVDNEDYL RGLPEGSEEF AREFHGVERF AARKKILERL ETFGFLERVE
PNTHMVPHGD RSGVVIEPYL TDQWYVDAKT LAKPAIAAVK SGATTFVPRN WEKTYFEWME
NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEEEAVSHA IAYYVEQEVI TAEQGRAMAL
DRNKRAGFIT RDEDVLDTWF SSALWPFSTL GWPDDTPEVK RYYPTNVLVT GFDIIFFWVA
RMMMMGLHFM KEVPFSTVYI HALVRDEKGA KMSKSKGNVI DPLALIDEHG ADALRFTLAA
MAAQGRDIKL STQRVDGYRK FANKLWNASR FAEMNGCTVP VGFDPASAKE TLNRWIAHEA
AAATREVTEA LEAYRFNDAA NAIYRFVWDV YCDWYVELAK PTLLGEDSPA KAETRAMVAW
ARDEILKLLH PFMPFITEAL WESTAKRDGL LTLSPWPHRT DVPTVAEIAA LSAASFGDPL
VPPVLVAFNA ASFKDEAAEA EIGWVVDLVT AIRSVRAEMN IPPATLMPLA LVGASAETKE
RAPRWSDVIK RLARLSDISF ADTAPEGAAQ LVVRGEVAAL PLKGVVDLAA ERTRLAKEIA
KADADIERVD KKLGNEKFVA NAPEELVEEE KEKREAALAR KAKFQDALSR LQGVG