Gene Dshi_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2087 
SymbolvalS 
ID5713082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2209300 
End bp2212413 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content66% 
IMG OID641268009 
Productvalyl-tRNA synthetase 
Protein accessionYP_001533425 
Protein GI159044631 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGG AAAAAACCTT CAACGCCGCC GAGGCCGAAC CCCGTCTCTA TGCCCGGTGG 
GAGGCGGAGG GATGCTTTGC GGCCGGCGCG AACGCATCGC GGGACGAGAC CTTCTGCGTG
ATGATCCCGC CGCCCAACGT CACGGGCTCT CTGCATATGG GGCACGCGTT CAACAACACG
CTCCAGGACA TCCTGATCCG CTGGAAACGG ATGCAGGGCT ACGACACGCT CTGGCAGCCG
GGCACGGACC ACGCGGGGAT CGCCACGCAG ATGGTGACCG AGCGCGAAAT GGCCGCGAAC
GGCGAGCCGA CCCGCGCCGA GATGGGCCGC GCGAAATTCC TCGCCCGCGT CTGGGAACAG
AAGGTCAAAT CCCGCGGCAC CATCATCGGC CAGCTCAAGC GCATCGGCGC CTCCTGCGAC
TGGTCGCGCG AAGCCTTCAC CATGGGCGGT GCCGACGGCG ACCCCGAGAA GGGCAACGGC
CCGAATTTCC ACGACGCGGT CATCAAGGTC TTCGTCGACA TGTACGACAA GGGCCTGATC
TACCGCGGCA AGCGGCTGGT GAACTGGGAC CCGCATTTCG AGACCGCGAT CTCCGATCTC
GAGGTCGAGA ATATCGAGAC CCCGGGTCAT ATGTGGCACT TCAAATACCC GCTCGCAGGC
GGCGAGACCT ACGAATATGT CGAGAAGGAC GCGGACGGCA CCGTCACCCT GCGCGAGACC
CGCGACTACA TCTCCATCGC CACGACCCGG CCCGAAACCA TGCTGGGCGA CGGCGCGGTC
GCGGTGCACC CATCGGATGA ACGTTACGCG CCAATCGTCG GAAAGCTCTG TGAAATCCCT
GTCGGACCCA AGGAACACCG CCGCCTGATC CCGATCATCA CCGATGAATA CCCGGACCCG
ACCTTCGGCT CGGGCGCGGT GAAGATCACC GGCGCGCATG ACTTCAACGA CTACCAGGTC
GCCAAGCGCG GCGGCATCCC GATGTACCGG CTGATGGACA CCAAGGCCCG GATGCGCGAT
GACGGCGCCC CCTATGCCAA GGCCGCCGCC ATCGCCATGG AAGTCGCCGA AGGCACCCGC
ACGCTCGACG AGGCCGAGGC CGACAGCCTC AACCTGGTGC CCGACGATCT GCGCGGACTC
GACCGGTTCG AAGCCCGCAA GGCGGTGGTT GACCAGATCA CCGCCGAAGG CCTCGCCGTC
ATGGTCCCCA ACCCCGCCGC AGCCGCATCC ACCGAGGAGG GCGCGGCCGC GACCGAGGAG
GTCCCGGCCT TCCTCCCCCT GGTCGAGTCC AAACCGATCA TGCAGCCCTT CGGCGACCGT
TCGAAAGTTG TGATCGAACC GATGCTCACC GACCAGTGGT TCGTGGATGC CGAGCAGATC
GTCGGCCCCG CGCTCGACGC CGTGCGCAAT GGCACCGTCA AGATCCTGCC CGAAAGCGGG
GAAAAAACCT ATTACCACTG GCTCGACAAC ATCGAGCCCT GGTGCATCTC CCGCCAGCTG
TGGTGGGGCC ACCAGATCCC GGTCTGGTAC GGTCCCCGCC GGGTCGAGGT GAACGGGGTC
GAAACCCTCG ATTTCGATCC CGCCAATGCC GTGCATTTCG TCGCCCACTC GGTGGACGAA
GCCCGGGCCA AGGCCGCGGG CTACTACGCC CTGCCCGACG CCGACAAGGT GATCATCGTG
CGCTCCTTCC CGCGCGGCAC CCCGGGCAGC GGCCCCACCG ATGGCCGGGT CGATCCGATG
ACCGACGCGG TGGCGGCGGC CCAACGGGCC GAAGCGGTGC CCGACGCGAT CCCGTTGGTG
CAGGACCCGG ATGTGCTCGA CACCTGGTTC TCCTCGGGGC TCTGGCCCAT CGGCACGCTG
GGCTGGCCCG AGGACACCGA GGAGCTGCGC AAGTATTTCC CCACCTCCAC TCTCGTGACC
GGCCAGGACA TCCTGTTCTT CTGGGTCGCG CGGATGATGA TGATGCAACT GGCCGTCACC
GGCGAGGTGC CCTTCCGCGA GGTCTACCTG CACGGCCTCG TGCGCGACGC CAAGGGCAAG
AAGATGTCCA AATCCGTGGG CAACGTGGTC GACCCGCTGG AGATCATCGA CGAGTACGGT
GCGGATGCGC TGCGCTTCTC CTCGGCGGCC ATGGCCAGCC TGGGCGGCGT GTTGAAACTC
GACCTCCAAC GGGTGCAGGG CTACCGCAAT TTCGGCACCA AGCTGTGGAA CGCCACTCGC
TTTGCCGAGA TGAACGAGGT CTTCACCGCC CACACCCAAT CCGCCATGCC CCCCGGCTGC
ACCGAAACCG TGAACCGCTG GATCATCGGC GAGACCGCGA AAGTGCGCGA GGCGGTCGAT
ACCGCCCTGG CCGAGTACAA GTTCAACGAC GCCGCCAATG CGCTTTATGC CTTCGTCTGG
GGCAAGGTCT GCGACTGGTA CGTGGAATTC GCCAAACCGC TCCTGCTCGA TGGCGATGAC
GCCACGAAGG CGGAAACCCG CGCCGTCATG GCCTGGGTGC TGGACCAATG CTTCATCCTG
CTGCACCCGA TCATGCCCTT CATCACCGAA GAACTCTGGG GCACAACCGG ACAGCGCGAC
AAGATGCTCG TGCATGCGGA CTGGCCGAGT TACGGGGCCG ACCTGGTCGA CGCCGACGCC
GACCGCGAAA TGAACTGGGT GATCTCGCTG ATCGAAAGCG TGCGCTCCGT GCGCGCCCAG
ATGCGGGTGC CCGCGGGGCT CTACGTGCCC GTGGTGCAGG TCGCGCTCGA TGAGGCCGGA
CAGCGCGCCT ATGCCAATAA CGAAACCCTG ATCAAGCGGC TCGCCCGGAT CGAAGGCATC
ACCAAGGCAG ACACGGCTCC CAAGGGCGCG CTGACCATCC CGGTCGAAGG CGGCACCTTC
GCCCTGCCTT TGGCGGACAT CATCGACGTC AGCGCCGAAA AGGACCGGCT GGGCAAGACC
CTCGCCAAGC TCCAGAAAGA CCTGGGCGGT CTCAGGGGGC GGCTGTCGAA CGCGAAGTTC
GTGGCCTCCG CCCCGGCCGA AGTGGTCGAG GAAAACCGCG AACGCCTCGC GGCTGGCGAA
GCCGAGCTTG CCACCCTCAG CGCCGCGCTG GAGCGTCTCG AAGAAGTCGG ATAA
 
Protein sequence
MPMEKTFNAA EAEPRLYARW EAEGCFAAGA NASRDETFCV MIPPPNVTGS LHMGHAFNNT 
LQDILIRWKR MQGYDTLWQP GTDHAGIATQ MVTEREMAAN GEPTRAEMGR AKFLARVWEQ
KVKSRGTIIG QLKRIGASCD WSREAFTMGG ADGDPEKGNG PNFHDAVIKV FVDMYDKGLI
YRGKRLVNWD PHFETAISDL EVENIETPGH MWHFKYPLAG GETYEYVEKD ADGTVTLRET
RDYISIATTR PETMLGDGAV AVHPSDERYA PIVGKLCEIP VGPKEHRRLI PIITDEYPDP
TFGSGAVKIT GAHDFNDYQV AKRGGIPMYR LMDTKARMRD DGAPYAKAAA IAMEVAEGTR
TLDEAEADSL NLVPDDLRGL DRFEARKAVV DQITAEGLAV MVPNPAAAAS TEEGAAATEE
VPAFLPLVES KPIMQPFGDR SKVVIEPMLT DQWFVDAEQI VGPALDAVRN GTVKILPESG
EKTYYHWLDN IEPWCISRQL WWGHQIPVWY GPRRVEVNGV ETLDFDPANA VHFVAHSVDE
ARAKAAGYYA LPDADKVIIV RSFPRGTPGS GPTDGRVDPM TDAVAAAQRA EAVPDAIPLV
QDPDVLDTWF SSGLWPIGTL GWPEDTEELR KYFPTSTLVT GQDILFFWVA RMMMMQLAVT
GEVPFREVYL HGLVRDAKGK KMSKSVGNVV DPLEIIDEYG ADALRFSSAA MASLGGVLKL
DLQRVQGYRN FGTKLWNATR FAEMNEVFTA HTQSAMPPGC TETVNRWIIG ETAKVREAVD
TALAEYKFND AANALYAFVW GKVCDWYVEF AKPLLLDGDD ATKAETRAVM AWVLDQCFIL
LHPIMPFITE ELWGTTGQRD KMLVHADWPS YGADLVDADA DREMNWVISL IESVRSVRAQ
MRVPAGLYVP VVQVALDEAG QRAYANNETL IKRLARIEGI TKADTAPKGA LTIPVEGGTF
ALPLADIIDV SAEKDRLGKT LAKLQKDLGG LRGRLSNAKF VASAPAEVVE ENRERLAAGE
AELATLSAAL ERLEEVG