Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2087 |
Symbol | valS |
ID | 5713082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2209300 |
End bp | 2212413 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641268009 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001533425 |
Protein GI | 159044631 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGG AAAAAACCTT CAACGCCGCC GAGGCCGAAC CCCGTCTCTA TGCCCGGTGG GAGGCGGAGG GATGCTTTGC GGCCGGCGCG AACGCATCGC GGGACGAGAC CTTCTGCGTG ATGATCCCGC CGCCCAACGT CACGGGCTCT CTGCATATGG GGCACGCGTT CAACAACACG CTCCAGGACA TCCTGATCCG CTGGAAACGG ATGCAGGGCT ACGACACGCT CTGGCAGCCG GGCACGGACC ACGCGGGGAT CGCCACGCAG ATGGTGACCG AGCGCGAAAT GGCCGCGAAC GGCGAGCCGA CCCGCGCCGA GATGGGCCGC GCGAAATTCC TCGCCCGCGT CTGGGAACAG AAGGTCAAAT CCCGCGGCAC CATCATCGGC CAGCTCAAGC GCATCGGCGC CTCCTGCGAC TGGTCGCGCG AAGCCTTCAC CATGGGCGGT GCCGACGGCG ACCCCGAGAA GGGCAACGGC CCGAATTTCC ACGACGCGGT CATCAAGGTC TTCGTCGACA TGTACGACAA GGGCCTGATC TACCGCGGCA AGCGGCTGGT GAACTGGGAC CCGCATTTCG AGACCGCGAT CTCCGATCTC GAGGTCGAGA ATATCGAGAC CCCGGGTCAT ATGTGGCACT TCAAATACCC GCTCGCAGGC GGCGAGACCT ACGAATATGT CGAGAAGGAC GCGGACGGCA CCGTCACCCT GCGCGAGACC CGCGACTACA TCTCCATCGC CACGACCCGG CCCGAAACCA TGCTGGGCGA CGGCGCGGTC GCGGTGCACC CATCGGATGA ACGTTACGCG CCAATCGTCG GAAAGCTCTG TGAAATCCCT GTCGGACCCA AGGAACACCG CCGCCTGATC CCGATCATCA CCGATGAATA CCCGGACCCG ACCTTCGGCT CGGGCGCGGT GAAGATCACC GGCGCGCATG ACTTCAACGA CTACCAGGTC GCCAAGCGCG GCGGCATCCC GATGTACCGG CTGATGGACA CCAAGGCCCG GATGCGCGAT GACGGCGCCC CCTATGCCAA GGCCGCCGCC ATCGCCATGG AAGTCGCCGA AGGCACCCGC ACGCTCGACG AGGCCGAGGC CGACAGCCTC AACCTGGTGC CCGACGATCT GCGCGGACTC GACCGGTTCG AAGCCCGCAA GGCGGTGGTT GACCAGATCA CCGCCGAAGG CCTCGCCGTC ATGGTCCCCA ACCCCGCCGC AGCCGCATCC ACCGAGGAGG GCGCGGCCGC GACCGAGGAG GTCCCGGCCT TCCTCCCCCT GGTCGAGTCC AAACCGATCA TGCAGCCCTT CGGCGACCGT TCGAAAGTTG TGATCGAACC GATGCTCACC GACCAGTGGT TCGTGGATGC CGAGCAGATC GTCGGCCCCG CGCTCGACGC CGTGCGCAAT GGCACCGTCA AGATCCTGCC CGAAAGCGGG GAAAAAACCT ATTACCACTG GCTCGACAAC ATCGAGCCCT GGTGCATCTC CCGCCAGCTG TGGTGGGGCC ACCAGATCCC GGTCTGGTAC GGTCCCCGCC GGGTCGAGGT GAACGGGGTC GAAACCCTCG ATTTCGATCC CGCCAATGCC GTGCATTTCG TCGCCCACTC GGTGGACGAA GCCCGGGCCA AGGCCGCGGG CTACTACGCC CTGCCCGACG CCGACAAGGT GATCATCGTG CGCTCCTTCC CGCGCGGCAC CCCGGGCAGC GGCCCCACCG ATGGCCGGGT CGATCCGATG ACCGACGCGG TGGCGGCGGC CCAACGGGCC GAAGCGGTGC CCGACGCGAT CCCGTTGGTG CAGGACCCGG ATGTGCTCGA CACCTGGTTC TCCTCGGGGC TCTGGCCCAT CGGCACGCTG GGCTGGCCCG AGGACACCGA GGAGCTGCGC AAGTATTTCC CCACCTCCAC TCTCGTGACC GGCCAGGACA TCCTGTTCTT CTGGGTCGCG CGGATGATGA TGATGCAACT GGCCGTCACC GGCGAGGTGC CCTTCCGCGA GGTCTACCTG CACGGCCTCG TGCGCGACGC CAAGGGCAAG AAGATGTCCA AATCCGTGGG CAACGTGGTC GACCCGCTGG AGATCATCGA CGAGTACGGT GCGGATGCGC TGCGCTTCTC CTCGGCGGCC ATGGCCAGCC TGGGCGGCGT GTTGAAACTC GACCTCCAAC GGGTGCAGGG CTACCGCAAT TTCGGCACCA AGCTGTGGAA CGCCACTCGC TTTGCCGAGA TGAACGAGGT CTTCACCGCC CACACCCAAT CCGCCATGCC CCCCGGCTGC ACCGAAACCG TGAACCGCTG GATCATCGGC GAGACCGCGA AAGTGCGCGA GGCGGTCGAT ACCGCCCTGG CCGAGTACAA GTTCAACGAC GCCGCCAATG CGCTTTATGC CTTCGTCTGG GGCAAGGTCT GCGACTGGTA CGTGGAATTC GCCAAACCGC TCCTGCTCGA TGGCGATGAC GCCACGAAGG CGGAAACCCG CGCCGTCATG GCCTGGGTGC TGGACCAATG CTTCATCCTG CTGCACCCGA TCATGCCCTT CATCACCGAA GAACTCTGGG GCACAACCGG ACAGCGCGAC AAGATGCTCG TGCATGCGGA CTGGCCGAGT TACGGGGCCG ACCTGGTCGA CGCCGACGCC GACCGCGAAA TGAACTGGGT GATCTCGCTG ATCGAAAGCG TGCGCTCCGT GCGCGCCCAG ATGCGGGTGC CCGCGGGGCT CTACGTGCCC GTGGTGCAGG TCGCGCTCGA TGAGGCCGGA CAGCGCGCCT ATGCCAATAA CGAAACCCTG ATCAAGCGGC TCGCCCGGAT CGAAGGCATC ACCAAGGCAG ACACGGCTCC CAAGGGCGCG CTGACCATCC CGGTCGAAGG CGGCACCTTC GCCCTGCCTT TGGCGGACAT CATCGACGTC AGCGCCGAAA AGGACCGGCT GGGCAAGACC CTCGCCAAGC TCCAGAAAGA CCTGGGCGGT CTCAGGGGGC GGCTGTCGAA CGCGAAGTTC GTGGCCTCCG CCCCGGCCGA AGTGGTCGAG GAAAACCGCG AACGCCTCGC GGCTGGCGAA GCCGAGCTTG CCACCCTCAG CGCCGCGCTG GAGCGTCTCG AAGAAGTCGG ATAA
|
Protein sequence | MPMEKTFNAA EAEPRLYARW EAEGCFAAGA NASRDETFCV MIPPPNVTGS LHMGHAFNNT LQDILIRWKR MQGYDTLWQP GTDHAGIATQ MVTEREMAAN GEPTRAEMGR AKFLARVWEQ KVKSRGTIIG QLKRIGASCD WSREAFTMGG ADGDPEKGNG PNFHDAVIKV FVDMYDKGLI YRGKRLVNWD PHFETAISDL EVENIETPGH MWHFKYPLAG GETYEYVEKD ADGTVTLRET RDYISIATTR PETMLGDGAV AVHPSDERYA PIVGKLCEIP VGPKEHRRLI PIITDEYPDP TFGSGAVKIT GAHDFNDYQV AKRGGIPMYR LMDTKARMRD DGAPYAKAAA IAMEVAEGTR TLDEAEADSL NLVPDDLRGL DRFEARKAVV DQITAEGLAV MVPNPAAAAS TEEGAAATEE VPAFLPLVES KPIMQPFGDR SKVVIEPMLT DQWFVDAEQI VGPALDAVRN GTVKILPESG EKTYYHWLDN IEPWCISRQL WWGHQIPVWY GPRRVEVNGV ETLDFDPANA VHFVAHSVDE ARAKAAGYYA LPDADKVIIV RSFPRGTPGS GPTDGRVDPM TDAVAAAQRA EAVPDAIPLV QDPDVLDTWF SSGLWPIGTL GWPEDTEELR KYFPTSTLVT GQDILFFWVA RMMMMQLAVT GEVPFREVYL HGLVRDAKGK KMSKSVGNVV DPLEIIDEYG ADALRFSSAA MASLGGVLKL DLQRVQGYRN FGTKLWNATR FAEMNEVFTA HTQSAMPPGC TETVNRWIIG ETAKVREAVD TALAEYKFND AANALYAFVW GKVCDWYVEF AKPLLLDGDD ATKAETRAVM AWVLDQCFIL LHPIMPFITE ELWGTTGQRD KMLVHADWPS YGADLVDADA DREMNWVISL IESVRSVRAQ MRVPAGLYVP VVQVALDEAG QRAYANNETL IKRLARIEGI TKADTAPKGA LTIPVEGGTF ALPLADIIDV SAEKDRLGKT LAKLQKDLGG LRGRLSNAKF VASAPAEVVE ENRERLAAGE AELATLSAAL ERLEEVG
|
| |