Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4861 |
Symbol | valS |
ID | 6871479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4709459 |
End bp | 4712314 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642787745 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002218339 |
Protein GI | 198246120 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.522647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATATAACCC CCAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTATT TCAAGCCTAA CGGCGACGAA AGCAAAGAGT CCTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGACATGCTT TCCAGCAAAC CATCATGGAT ACCATGATCC GTTACCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGCACCGAC CACGCCGGGA TCGCAACCCA GATGGTGGTT GAGCGCAAGA TTGCCGCTGA AGAAGGTAAA ACCCGTCACG ACTACGGCCG CGATGCGTTT ATCGACAAAA TCTGGCAGTG GAAAGCGGAA TCCGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTTGA CTGGGAGCGC GAGCGCTTCA CCATGGACGA AGGCCTTTCC AATGCCGTGA AAGAAGTCTT TGTTCGCCTG TACAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTGA ACTGGGACCC GAAACTGCGC ACCGCCATCT CTGACCTGGA AGTGGAAAAC CGCGAGTCCA AAGGCTCGAT GTGGCACATC CGCTATCCGC TGGCCGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTTGTCGCC ACCACCCGCC CGGAAACTAT TCTCGGCGAT ACCGGCGTGG CCGTGAACCC GGAAGATCCG CGCTACCAGA GTCTTATCGG TAAATTCGTT ATTCTGCCGC TGGTTAACCG CCGCATTCCG ATTGTTGGCG ACGAACACGC CGATATGGAA AAAGGCACCG GCTGCGTGAA GATCACCCCG GCGCACGACT TTAACGACTA TGAGGTCGGG AAACGTCATG CCCTGCCGAT GATCAACATC CTGACCTTTG ATGGCGACAT CCGTGAAAGC GCGGAAGTGT TCGATACCAA AGGTGAAGAG TCTGACGTTT ATTCCAGCGA GATTCCGGCT GAGTTCCAAA AGCTGGAACG CTTTGCTGCC CGTAAGGCCA TCGTTGCTGC CGTTGACGCG CTCGGCCTGC TGGAAGAAAT TAAACCGCAC GATCTGACCG TCCCTTACGG CGACCGTGGC GGCGTTGTTA TCGAACCGAT GCTGACCGAC CAGTGGTATG TGCGTGCAGA CGTGCTGGCG AAACCGGCGG TGGAAGCGGT TGAGAACGGC GACATTCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC AACGACGGCA ACGTCTACGT TGGCCGTACC GAAGACGAAG TGCGTCAGGA AAATAACCTC GGCGCCGACG TTCAGCTTCG TCAGGACGAA GACGTTCTTG ATACCTGGTT CTCCTCCGCG CTGTGGACTT TCTCTACCCT CGGCTGGCCG GAAAACACCG ACGCGCTGCG TCAGTTCCAC CCGACCAGCG TGATGGTTTC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AACGGCAAGC CGCAGGTGCC GTTCCATACC GTCTACATGA CCGGTCTGAT TCGCGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT AACGTTATCG ACCCGCTGGA TATGGTGGAC GGCATCTCCC TGCCGGAACT GCTGGAAAAA CGCACCGGCA ACATGATGCA GCCGCAGATG GCGGAGAAAA TCCGCAAGCG TACCGAGAAG CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGACGCCC TGCGCTTTAC CCTGGCGGCG CTGGCCTCGA CCGGTCGCGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACCGA AGAGCAGGAT TGCGGCTTCA ACGGCGGTGA AATGACCCTG TCGCTGGCTG ACCGTTGGAT CCTGGCGGAA TTCAACCAGA CCGTTAAAGC GTACCGCGAC GCGCTGGACA GCTTCCGCTT CGATATCGCC GCGGGCATCC TGTACGAGTT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAGCTGACC AAGCCGGTAA TGACCGGGGG TTCCGAGTCT GAACTGCGTG GCACCCGCCA TACGCTGGTC ACCGTACTGG AAGGTCTGCT GCGCCTGGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GCGTGAAGGT TATCTGTGGC ATTACCGCCG ATACCATTAT GCTGCAGCCG TTCCCGGAAT ATAACGCCGC ACAGGTGGAT GAAGCCGCGC TGGCCGATAC CGAGTGGCTG AAGCAGGCGA TCGTCGCGGT ACGTAACATT CGTGCGGAAA TGAACATCGC CCCGGGCAAA CCGCTGGAAC TGCTGCTGCG CGGCTGTAGT GAAGAAGCCG TTCGTCGTGT TAACGACAAC CGTAGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC AGCCGATGAC AAAGGTCCGG TTTCCGTGAC CAAAATCATC GACGGCGCTG AACTGCTGAT CCCGATGGCA GGCCTCATCA ACAAAGACGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAAATCGAA GGCGAGATTG CCCGTATCGA AGGCAAACTG TCCAACGAAG GTTTCGTTGC CCGCGCGCCG GAAGCGGTCA TTGCCAAAGA GCGTGAGAAG CTGGACGGTT ACGCAGAAGC GAAAGCGAAG CTGATTGAGC AGCAGGCGGT TATTAGCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SKESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWQWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETILGD TGVAVNPEDP RYQSLIGKFV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AEVFDTKGEE SDVYSSEIPA EFQKLERFAA RKAIVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD NDGNVYVGRT EDEVRQENNL GADVQLRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQM AEKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEEQD CGFNGGEMTL SLADRWILAE FNQTVKAYRD ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMTGGSES ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVICG ITADTIMLQP FPEYNAAQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS EEAVRRVNDN RSFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKDDELA RLAKEVAKIE GEIARIEGKL SNEGFVARAP EAVIAKEREK LDGYAEAKAK LIEQQAVISA L
|
| |