Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3722 |
Symbol | valS |
ID | 8120721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 4209767 |
End bp | 4212622 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644854092 |
Product | valyl-tRNA synthetase |
Protein accession | YP_003006004 |
Protein GI | 251791283 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.12918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATACAACCC GCACGACATT GAGCAGCCGC TGTACGAGCA CTGGGAAAAA CAGGGCTACT TCAAGCCCAA CGGCGACACC AGCAAAGAAA GCTTCAGCAT CATGATCCCG CCGCCCAACG TGACCGGCAG CCTGCACATG GGCCATGCTT TCCAGCAAAC CATTATGGAC ACGATGATCC GTTACCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGC CGGGACCGAT CACGCCGGTA TCGCCACCCA GATGGTCGTG GAACGTAAGA TCGCCGCCGA AGAAGGCAAA ACCCGCCACG ATTACGGCCG TGAAGCGTTT ATCGATAAAA TCTGGCAGTG GAAGGCGGAA TCCGGCGGCA CCATTACCCG CCAGATGCGC CGACTGGGCA ACTCGGTAGA CTGGGAGCGC GAGCGTTTCA CCATGGACGA CGGCTTGTCC AACGCGGTGA AGGAAGTGTT CGTTCGCCTG TATCAAGAAG ACCTGATCTA CCGCGGCAAA CGCCTGGTGA ACTGGGACCC GAAACTGCGT ACCGCCATTT CCGACCTGGA AGTGGAAAAC CGCGACGTCA AAGGATCGAT GTGGCATCTG CGCTACCCGC TGGCGGACGG CGCGAAAACC GCCGACGGCA AAGAGTATCT GGTGGTGGCG ACCACCCGTC CGGAAACCGT GCTGGGTGAT ACCGGCGTAG CGGTAAACCC GGAAGATCCG CGCTATAAAG ATCTGATCGG CAAGTTCCTG ATCCTGCCGC TGGTCGGTCG CCGCATTCCG ATCGTCGGCG ACGAACACGC CGATATGGAA AAAGGCACCG GCTGCGTGAA AATTACCCCG GCGCACGACT TCAACGACTA CGAAGTCGGC AAGCGTCACC AGTTGCCGAT GATCAACATC CTGACGTTCG ACGGTGATAT CCGCCAGGAA GCGGAAGTTT TCAATACCAA CGGCGAAGCC AGCACCGCCT ACAGCAGCGA CATCCCCGAC GCGTTTCGCG GACTGGAGCG TTTCGCCGCT CGCAAAGCCG TGGTAGCCGC GTTTGATGAG CTGGGCCTGT TGGAAGAAAT TAAAGCGCAC GACCTGACCG TGCCCTACGG CGACCGCGGC GGCGTGGTGA TCGAGCCGAT GCTGACCGAC CAGTGGTACG TTCGTGCCGG CGTGCTGGCT AAACCGGCGG TGGAAGCGGT GGAAGACGGC CGTATCCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCAGTTGGAT GCGCGACATT CAGGACTGGT GTATCTCTCG CCAGCTGTGG TGGGGTCATC GCATCCCGGC GTGGTACGAC GATAACGGCA AAGTCTACGT CGGCCGTGAT GAAGCCGAAG TGCGCCGCGA GAATAATCTG GCCGCGGATG TAGCGCTGCG TCAGGACGAA GACGTGCTGG ACACCTGGTT CTCCTCCGGG TTGTGGACTT TCTCCACACT CGGCTGGCCG GAGCAAACCC CGGAGCTGAA AGCCTTCCAT CCCAGCAGTG TGATGGTCAG CGGCTTCGAC ATCATTTTCT TCTGGATCGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA GACGGCAAGC CGCAGGTGCC GTTCCATACC GTGTATATGA CCGGTCTGAT CCGTGACGAG GAAGGCCAGA AGATGTCCAA ATCCAAGGGC AACGTGATTG ACCCGCTGGA TATGGTAGAC GGTATTTCGC TGGAAGCGCT GCTGGAAAAA CGCACCGGCA ACATGATGCA GCCGCAACTG GCGGAGAAAA TCCGCAAACG TACCGAAAAA CAGTTCCCGA ACGGCATCGA ACCGCACGGC ACCGACGCCC TGCGCTTTAC GCTGGCGGCG CTGGCCTCTA CCGGCCGCGA CATCAACTGG GACATGAAAC GCCTGGAAGG TTACCGCAAC TTCTGTAATA AGCTGTGGAA CGCCAGCCGC TTCGTGCTGA TGAACACCGA AGAACAGGAT TGCGGCTTTA ACGGCGGCGA GAAAGTGCTG TCGCTGGCAG ACCGCTGGAT TCTGGCGGAA TTCAACCGCA CAGTGAAAGC CTACCGTGAA GCGCTGGATA GCTACCGTTT CGATCTGGCC GCCAACGTGC TGTATGAGTT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAACTGACC AAACCGGTGA TGAACGGCGG CAGTGAAGCC GAACTGCGCG GCACGCGCCA CACGCTGGTC ACCGTACTGG AAGCATTGTT GCGCCTGGCG CACCCGATCA TCCCGTTCAT TACCGAAACC ATCTGGCAAC GGGTGAAGGT ACTGAAAGGC GTGAGCGCCG ACACGATTAT GCTGCAACCG TTCCCGGCCT TTGATGCCAC GCTGGAAGAC GAGCAAGCAT TTAACGATCT GGAGTGGATC AAGCAGACGA TCATCGCAGT GCGTAATATC CGCGCTGAAA TGAACATCGC CCCCAGCAAA CCGCTGGCAC TGCTGCTGCG CGATGCGTCC GCCGATGCCA CCCGCCGCGT GCAGGACAAT CTCGGCTTTA TTCAGACGCT GGCGCGTCTG GAGAGCATCA CGCTGTTGCC GGCTGGCGAC AAAGGCCCGG TTTCCGTTAC CAAACTGGTG GATGGCGCCG AGTTGTTGAT CCCGATGGCC GGCTTGATTG ATAAAGTGGC CGAGCTGGAC AGGCTGGCAA AAGAAGTGGC GAAGCTTGAA GTGGAAATCG GCCGCATCGA CAGCAAGCTG TCCAACGACG GTTTTGTGGC GCGCGCACCG GAAGCGGTTG TCGCCAAAGA GCGTGAAAAA CGTGACGGTT ACGCCGCCGC CAAAGCCAAA CTGCTGGAGC AGCAGGCGAC TATCGCCGCA CTGTAA
|
Protein sequence | MEKTYNPHDI EQPLYEHWEK QGYFKPNGDT SKESFSIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQAGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWQWKAE SGGTITRQMR RLGNSVDWER ERFTMDDGLS NAVKEVFVRL YQEDLIYRGK RLVNWDPKLR TAISDLEVEN RDVKGSMWHL RYPLADGAKT ADGKEYLVVA TTRPETVLGD TGVAVNPEDP RYKDLIGKFL ILPLVGRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHQLPMINI LTFDGDIRQE AEVFNTNGEA STAYSSDIPD AFRGLERFAA RKAVVAAFDE LGLLEEIKAH DLTVPYGDRG GVVIEPMLTD QWYVRAGVLA KPAVEAVEDG RIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD DNGKVYVGRD EAEVRRENNL AADVALRQDE DVLDTWFSSG LWTFSTLGWP EQTPELKAFH PSSVMVSGFD IIFFWIARMI MMTMHFIKDE DGKPQVPFHT VYMTGLIRDE EGQKMSKSKG NVIDPLDMVD GISLEALLEK RTGNMMQPQL AEKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEEQD CGFNGGEKVL SLADRWILAE FNRTVKAYRE ALDSYRFDLA ANVLYEFTWN QFCDWYLELT KPVMNGGSEA ELRGTRHTLV TVLEALLRLA HPIIPFITET IWQRVKVLKG VSADTIMLQP FPAFDATLED EQAFNDLEWI KQTIIAVRNI RAEMNIAPSK PLALLLRDAS ADATRRVQDN LGFIQTLARL ESITLLPAGD KGPVSVTKLV DGAELLIPMA GLIDKVAELD RLAKEVAKLE VEIGRIDSKL SNDGFVARAP EAVVAKEREK RDGYAAAKAK LLEQQATIAA L
|
| |