Gene Daro_3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3085 
SymbolvalS 
ID3566514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3324199 
End bp3327030 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content60% 
IMG OID637681556 
Productvalyl-tRNA synthetase 
Protein accessionYP_286285 
Protein GI71908698 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.405108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG CCAAAGCCTT TGAACCAGCC GATATCGAAC GCCGCTGGTA TCCCGAGTGG 
GAAACCCAGA ATTATTTCGC CGCCGGCGTA GACGCCAGCA AGGCCGACAA TTTCTGCATC
CTGCTGCCAC CGCCGAATGT TACCGGCACG CTGCACATGG GTCATGGCTT CAACCAGACG
ATCATGGACG CGCTGACCCG CTATTACCGG ATGCGCGGCC ACAACACGCT GTGGCAACCC
GGTACCGACC ACGCCGGTAT CGCGACGCAG ATCGTCGTCG AACGCCAGCT GGACGCCCAG
GGCATTTCGC GCCACGATCT CGGCCGCGAG AAGTTCCTGG AAAAGGTCTG GGAATGGAAA
GAATACTCCG GCAACACCAT CACCAAGCAG ATGCGCCGCA TGGGCACCAG CCCGGACTGG
AAGCGCGAGC GCTTCACGAT GGATGCCGGT CTCAACAAGG TTGTCACCGA AACCTTCGTC
CGCCTGTTTA ATGAAGGCCT GATCTACCGT GGCAAGCGCC TGGTGAACTG GGACCCGAAG
CTGAATACTG CCGTTTCCGA CCTCGAAGTG GTGCAGGAGG AAGAAGACGG CTTCATGTGG
CATATCCGCT ATCCGCTGGC TGATGGCAGT GATAGCCTGG TGGTCGCCAC GACACGGCCG
GAAACCATGC TCGGCGACAC CGCCGTGATG GTGCATCCGG AAGACGAGCG TTACAAGCAC
ATGATCGGCC AGATGGTGAA GCTGCCGCTG ACCGATAGAG AAATACCGAT CATCGCTGAC
AGCTACGTCG ATCTCGAATT CGGCACCGGT TGCGTCAAGG TCACGCCGGC GCACGACTTC
AACGACTACG CCGTCGGCCA GCGCCACGGC CTGCCGATGA TTTCCATTCT GACGCTGGAT
GCCAAGGTCA ACGAGAACGC CCCCGAGAAA TACCGCGGCC TGGATCGTTT CGATGCCCGC
AAGGCTGTCG TCGCCGACCT CGAAGCACTC GGCATCCTGG AAAAAACCGA CAAGCACAAG
CTCAAGGTGC CGCGCGGTGA CCGGACCAAT GTCGTGATCG AGCCGATGCT GACCGACCAG
TGGTTCGTCG CGATGAGCAA GCCGGGCGAC GACGGCAAGT CGATCACCGA GAAGGCACTC
GACGTCGTCC ATTCCGGCGA GATCAAGTTC TATCCGGAAA ACTGGGTCAA TACCTACAAC
CAGTGGCTGA ACAACATCCA GGACTGGTGC ATCTCCCGCC AGCTGTGGTG GGGCCACCAG
ATTCCGGCGT GGTACGGTGA TAATGGCCAG ATTTTCGTTG CCCACAGCGA AGCCGAAGCC
AAGGCCGAAG CTGCCAAGCA GGGTTACACC GGCACTCTCA AGCGTGACGA AGACGTTCTC
GACACCTGGT TCTCTTCAGC CTTGTGGCCG TTCTCGACAC TGGACTGGAC GGGCGATGAG
GCGATCGATG CCGCCAACCC GCTGCTCAAG CAATACCTGC CCTCCTCGGT GCTGGTCACC
GGTTTCGACA TCATCTTTTT CTGGGTCGCC CGCATGGTCA TGATGACCAA GCAGATCACT
GGCCAGATTC CGTTCAAGCA CGTTTATGTG CACGGCCTGA TCCGTGATGG CGAAGGCCAG
AAGATGTCCA AGTCCAAGGG CAACGTGCTC GATCCGATCG ACCTGATCGA TGGCATCGGC
CTCGAAGCGC TGATCGAGAA GCGTACAACC GGCCTGATGA ACCCAAAGCA GGCGGAAAGC
ATCGCCAAGA AGACGAAGAA GGAGTTCCCG GAAGGCATCG CCTCGTTCGG TACCGATGCG
CTGCGCTTCA CCTTCGCCAG CCTCGCCTCG CCCGGCCGCG ACATCAAGTT CGACCTCAAC
CGCTGCGACG GCTACCGCAA CTTCTGCAAC AAGCTGTGGA ACGCCACGCG CTTCGTACTG
ATGAACGTCG AAGGCCACGA TCTGGCCCTC GAACACCAAC AGAACGGCCC AGCCTGTGGC
GGTTCGGCTC CGCTCGAATT CTCCTTCGCC GACCGCTGGA TCGTCAGCCA GTTGCAACGC
GTCGAGCAGG AAGTCGAACA GCACTTCACC GACTACCGTT TCGACCTGAT CGCTCAGGCC
ATCTACAAGT TCATCTGGGA CGAGTTCTGC GACTGGTATC TGGAAATCGC CAAGGTCGAG
ATCCAGACCG GCAACGACGC CCAGCAGCGC GGCGCCCGCC GGACGCTGGT GCGTACGCTG
GAAGCTGTGC TGCGTCTGGC TCACCCGCTG ATTCCGTTCA TCACCGAAGA ACTGTGGCAA
ACCGTCGCCC CGATCGCCGG CCGCAAGACG CACGACTCGA TCATGCTGGC CGCCTACCCG
CGTGCCGAGG AATACAAGAT CGACGCCGCT TCGGAAGCAA AAGTCGAGCG CCTGAAGGCC
CTGGCCTATG CCTGCCGCAA CCTGCGCGGC GAGATGAACG TCTCCCCGGC CCTGCGCATG
CCCCTGCTGG TTGCTGGTGG CGGCGCTGAA ATTTCCGAGT TCGCAGCCAT CCTGCAAGCT
CTGGGTAAGC TCTCCGAGGT ACAAATCGTA GACGACATGC CGGCCGATGC GATGGCGCCG
GTAGCCGTGG TCGGTGAAAC CCGACTGATG CTGAAGGTGG AGATCGACGT TGCCGCCGAA
CGCATTCGTC TGGCCAAGGA AATCGAAAAG CTGGAAAAGC AGATTTCGAT TGCCCAAGGC
AAGCTGGCCA ACGAAGGTTT CGTCGCCCGC GCCCCGGCAG CGGTCATCGA TCAGGAAAAG
CAGCGCGTCG CCGATTTCAC GGCAACGCTG GAACAACTTA AACCACAACT GGCCAAGCTC
GGCCAGGCAT AG
 
Protein sequence
MELAKAFEPA DIERRWYPEW ETQNYFAAGV DASKADNFCI LLPPPNVTGT LHMGHGFNQT 
IMDALTRYYR MRGHNTLWQP GTDHAGIATQ IVVERQLDAQ GISRHDLGRE KFLEKVWEWK
EYSGNTITKQ MRRMGTSPDW KRERFTMDAG LNKVVTETFV RLFNEGLIYR GKRLVNWDPK
LNTAVSDLEV VQEEEDGFMW HIRYPLADGS DSLVVATTRP ETMLGDTAVM VHPEDERYKH
MIGQMVKLPL TDREIPIIAD SYVDLEFGTG CVKVTPAHDF NDYAVGQRHG LPMISILTLD
AKVNENAPEK YRGLDRFDAR KAVVADLEAL GILEKTDKHK LKVPRGDRTN VVIEPMLTDQ
WFVAMSKPGD DGKSITEKAL DVVHSGEIKF YPENWVNTYN QWLNNIQDWC ISRQLWWGHQ
IPAWYGDNGQ IFVAHSEAEA KAEAAKQGYT GTLKRDEDVL DTWFSSALWP FSTLDWTGDE
AIDAANPLLK QYLPSSVLVT GFDIIFFWVA RMVMMTKQIT GQIPFKHVYV HGLIRDGEGQ
KMSKSKGNVL DPIDLIDGIG LEALIEKRTT GLMNPKQAES IAKKTKKEFP EGIASFGTDA
LRFTFASLAS PGRDIKFDLN RCDGYRNFCN KLWNATRFVL MNVEGHDLAL EHQQNGPACG
GSAPLEFSFA DRWIVSQLQR VEQEVEQHFT DYRFDLIAQA IYKFIWDEFC DWYLEIAKVE
IQTGNDAQQR GARRTLVRTL EAVLRLAHPL IPFITEELWQ TVAPIAGRKT HDSIMLAAYP
RAEEYKIDAA SEAKVERLKA LAYACRNLRG EMNVSPALRM PLLVAGGGAE ISEFAAILQA
LGKLSEVQIV DDMPADAMAP VAVVGETRLM LKVEIDVAAE RIRLAKEIEK LEKQISIAQG
KLANEGFVAR APAAVIDQEK QRVADFTATL EQLKPQLAKL GQA