Gene SeD_A4861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4861 
SymbolvalS 
ID6871479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4709459 
End bp4712314 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content57% 
IMG OID642787745 
Productvalyl-tRNA synthetase 
Protein accessionYP_002218339 
Protein GI198246120 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.522647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC CCAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTATT TCAAGCCTAA CGGCGACGAA AGCAAAGAGT CCTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGACATGCTT TCCAGCAAAC CATCATGGAT
ACCATGATCC GTTACCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGCACCGAC
CACGCCGGGA TCGCAACCCA GATGGTGGTT GAGCGCAAGA TTGCCGCTGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGATGCGTTT ATCGACAAAA TCTGGCAGTG GAAAGCGGAA
TCCGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTTGA CTGGGAGCGC
GAGCGCTTCA CCATGGACGA AGGCCTTTCC AATGCCGTGA AAGAAGTCTT TGTTCGCCTG
TACAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTGA ACTGGGACCC GAAACTGCGC
ACCGCCATCT CTGACCTGGA AGTGGAAAAC CGCGAGTCCA AAGGCTCGAT GTGGCACATC
CGCTATCCGC TGGCCGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTTGTCGCC
ACCACCCGCC CGGAAACTAT TCTCGGCGAT ACCGGCGTGG CCGTGAACCC GGAAGATCCG
CGCTACCAGA GTCTTATCGG TAAATTCGTT ATTCTGCCGC TGGTTAACCG CCGCATTCCG
ATTGTTGGCG ACGAACACGC CGATATGGAA AAAGGCACCG GCTGCGTGAA GATCACCCCG
GCGCACGACT TTAACGACTA TGAGGTCGGG AAACGTCATG CCCTGCCGAT GATCAACATC
CTGACCTTTG ATGGCGACAT CCGTGAAAGC GCGGAAGTGT TCGATACCAA AGGTGAAGAG
TCTGACGTTT ATTCCAGCGA GATTCCGGCT GAGTTCCAAA AGCTGGAACG CTTTGCTGCC
CGTAAGGCCA TCGTTGCTGC CGTTGACGCG CTCGGCCTGC TGGAAGAAAT TAAACCGCAC
GATCTGACCG TCCCTTACGG CGACCGTGGC GGCGTTGTTA TCGAACCGAT GCTGACCGAC
CAGTGGTATG TGCGTGCAGA CGTGCTGGCG AAACCGGCGG TGGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
AACGACGGCA ACGTCTACGT TGGCCGTACC GAAGACGAAG TGCGTCAGGA AAATAACCTC
GGCGCCGACG TTCAGCTTCG TCAGGACGAA GACGTTCTTG ATACCTGGTT CTCCTCCGCG
CTGTGGACTT TCTCTACCCT CGGCTGGCCG GAAAACACCG ACGCGCTGCG TCAGTTCCAC
CCGACCAGCG TGATGGTTTC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AACGGCAAGC CGCAGGTGCC GTTCCATACC
GTCTACATGA CCGGTCTGAT TCGCGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTGGAC GGCATCTCCC TGCCGGAACT GCTGGAAAAA
CGCACCGGCA ACATGATGCA GCCGCAGATG GCGGAGAAAA TCCGCAAGCG TACCGAGAAG
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGACGCCC TGCGCTTTAC CCTGGCGGCG
CTGGCCTCGA CCGGTCGCGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACCGA AGAGCAGGAT
TGCGGCTTCA ACGGCGGTGA AATGACCCTG TCGCTGGCTG ACCGTTGGAT CCTGGCGGAA
TTCAACCAGA CCGTTAAAGC GTACCGCGAC GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCGGGCATCC TGTACGAGTT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAGCTGACC
AAGCCGGTAA TGACCGGGGG TTCCGAGTCT GAACTGCGTG GCACCCGCCA TACGCTGGTC
ACCGTACTGG AAGGTCTGCT GCGCCTGGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GCGTGAAGGT TATCTGTGGC ATTACCGCCG ATACCATTAT GCTGCAGCCG
TTCCCGGAAT ATAACGCCGC ACAGGTGGAT GAAGCCGCGC TGGCCGATAC CGAGTGGCTG
AAGCAGGCGA TCGTCGCGGT ACGTAACATT CGTGCGGAAA TGAACATCGC CCCGGGCAAA
CCGCTGGAAC TGCTGCTGCG CGGCTGTAGT GAAGAAGCCG TTCGTCGTGT TAACGACAAC
CGTAGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC AGCCGATGAC
AAAGGTCCGG TTTCCGTGAC CAAAATCATC GACGGCGCTG AACTGCTGAT CCCGATGGCA
GGCCTCATCA ACAAAGACGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAAATCGAA
GGCGAGATTG CCCGTATCGA AGGCAAACTG TCCAACGAAG GTTTCGTTGC CCGCGCGCCG
GAAGCGGTCA TTGCCAAAGA GCGTGAGAAG CTGGACGGTT ACGCAGAAGC GAAAGCGAAG
CTGATTGAGC AGCAGGCGGT TATTAGCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SKESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWQWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETILGD TGVAVNPEDP
RYQSLIGKFV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AEVFDTKGEE SDVYSSEIPA EFQKLERFAA RKAIVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD NDGNVYVGRT EDEVRQENNL GADVQLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQM AEKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEEQD
CGFNGGEMTL SLADRWILAE FNQTVKAYRD ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMTGGSES ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVICG ITADTIMLQP
FPEYNAAQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS EEAVRRVNDN
RSFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKDDELA RLAKEVAKIE
GEIARIEGKL SNEGFVARAP EAVIAKEREK LDGYAEAKAK LIEQQAVISA L