Gene SNSL254_A4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4824 
SymbolvalS 
ID6482517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4695549 
End bp4698404 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content57% 
IMG OID642740037 
Productvalyl-tRNA synthetase 
Protein accessionYP_002043715 
Protein GI194444143 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC CCAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTATT TCAAGCCTAA CGGCGACGAA AGCAAAGAGT CCTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGACATGCTT TCCAGCAAAC CATCATGGAT
ACCATGATCC GTTACCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGCACCGAC
CACGCCGGGA TCGCCACCCA GATGGTGGTT GAGCGCAAGA TTGCCGCTGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGATGCGTTT ATCGACAAAA TCTGGCAGTG GAAAGCGGAA
TCCGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTTGA CTGGGAGCGC
GAGCGCTTCA CCATGGACGA AGGTCTTTCC AATGCCGTGA AAGAAGTCTT TGTCCGCCTG
TACAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTGA ACTGGGACCC GAAACTGCGT
ACCGCCATCT CTGACCTGGA AGTGGAAAAC CGCGAGTCCA AAGGCTCGAT GTGGCACATC
CGCTATCCGC TTGCCGACGG CGCGAAAACC GCAGACGGTA AAGATTACCT GGTTGTCGCC
ACCACCCGTC CGGAAACCGT ACTGGGCGAT ACCGGCGTGG CCGTGAACCC GGAAGATCCG
CGTTATAAAG ATCTGATTGG CAAATTCGTT ATTCTGCCGC TGGTTAACCG CCGCATTCCA
ATCGTTGGCG ACGAACACGC CGATATGGAA AAAGGCACCG GCTGCGTAAA AATCACCCCG
GCGCACGACT TTAACGACTA TGAAGTCGGG AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGACAT TCGCGAAAGC GCGGAAGTGT TCGATACCAA AGGTGAGGAA
TCCGACGTTT ACTCCAGCGA GATTCCGGCT GAGTTCCAAA AGCTGGAACG CTTTGCTGCC
CGTAAGGCCA TCGTCGCTGC CGTCGACGCG CTGGGCCTGC TGGAAGAAAT TAAACCGCAC
GATCTGACCG TCCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCGAT GCTGACCGAC
CAGTGGTACG TCCGTGCTGA CGTGCTGGCG AAACCGGCGG TGGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGTGATATC
CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
AACGACGGCA ACGTCTACGT TGGCCGTACC GAAGAAGAAG TGCGTCAGGA AAATAACCTC
GGCGCCGACG TTCAGCTTCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCCTCCGCG
CTGTGGACTT TCTCTACCCT CGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC
CCGACCAGCG TGATGGTTTC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AACGGCAAGC CGCAGGTGCC GTTCCATACC
GTCTACATGA CCGGTCTGAT TCGCGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTGGAC GGCATCTCCC TGCCGGAACT GCTGGAAAAA
CGCACTGGCA ATATGATGCA GCCGCAGATG GCGGAGAAAA TTCGCAAGCG TACCGAGAAG
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACTGATGCGC TGCGCTTCAC CCTGGCGGCG
CTGGCCTCGA CCGGCCGCGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACCGA AGAGCAGGAT
TGCGGCTTCA ACGGCGGCGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT CCTGGCGGAA
TTCAACCAGA CCGTTAAAGC CTACCGCGAC GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCGGGCATTC TGTATGAATT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAGCTGACC
AAGCCGGTGA TGACCGGGGG TTCCGAGTCT GAACTGCGCG GTACCCGCAA TACGCTGGTC
ACCGTACTGG AAGGTCTGCT GCGCCTGGCG CACCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAGGT CATTTGCGGT ATTACTGCCG ACACCATCAT GCTGCAGCCG
TTCCCGGAAT ATAACGCCGC ACAGGTGGAT GAAGCCGCGT TGGCCGATAC CGAGTGGCTG
AAGCAGGCGA TCGTTGCGGT ACGTAACATT CGTGCGGAAA TGAACATCGC CCCGGGCAAA
CCGCTGGAAC TGCTGCTGCG CGGCTGTAGT GAAGAAGCCG TTCGTCGTGT TAACGACAAC
CGTAGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC AGCCGATGAC
AAAGGCCCGG TTTCCGTGAC CAAAATCATC GACGGCGCTG AACTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGACGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC AAAAATCGAA
GGTGAGATTG CCCGTATCGA AGGCAAGCTG TCCAACGAAG GTTTCGTTGC TCGCGCGCCG
GAAGCGGTCA TTGCCAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC AAAAGCGAAG
CTGATTGAGC AGCAGGCGGT TATTAGCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SKESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWQWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETVLGD TGVAVNPEDP
RYKDLIGKFV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AEVFDTKGEE SDVYSSEIPA EFQKLERFAA RKAIVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD NDGNVYVGRT EEEVRQENNL GADVQLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQM AEKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEEQD
CGFNGGEMTL SLADRWILAE FNQTVKAYRD ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMTGGSES ELRGTRNTLV TVLEGLLRLA HPIIPFITET IWQRVKVICG ITADTIMLQP
FPEYNAAQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS EEAVRRVNDN
RSFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKDDELA RLAKEVAKIE
GEIARIEGKL SNEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVISA L