Gene SeHA_C4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4880 
SymbolvalS 
ID6488982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4748734 
End bp4751589 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content57% 
IMG OID642744926 
Productvalyl-tRNA synthetase 
Protein accessionYP_002048499 
Protein GI194449603 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC CCAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTATT TCAAGCCTAA CGGCGACGAA AGCAAAGAGT CCTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGACATGCTT TCCAGCAAAC CATCATGGAT
ACCATGATCC GTTACCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGCACCGAC
CACGCCGGGA TCGCCACCCA GATGGTGGTT GAGCGCAAGA TTGCCGCTGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGATGCGTTT ATCGACAAAA TCTGGCAGTG GAAAGCGGAA
TCCGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCGGTGGA CTGGGAGCGC
GAGCGCTTCA CCATGGACGA AGGCCTTTCC AATGCCGTGA AAGAAGTCTT TGTTCGCCTG
TACGAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTGA ACTGGGACCC GAAACTGCGC
ACCGCCATCT CTGACCTGGA AGTGGAAAAC CGCGAGTCCA AAGGCTCGAT GTGGCACATC
CGCTATAAGC TTGCCGATGG CGCGAAAACC GCAGACGGTA AAGATTACCT GGTCGTCGCC
ACCACCCGTC CGGAAACCGT ACTGGGCGAT ACCGGCGTGG CCGTGAACCC GGAAGATCCG
CGTTATAAAG ATCTGATTGG CAAATTCGTT ATTCTGCCGC TGGTTAACCG CCGCATTCCG
ATTGTGGGCG ACGAACACGC CGATATGGAA AAAGGCACCG GCTGCGTGAA GATCACCCCG
GCGCACGACT TTAACGACTA TGAAGTCGGG AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGACAT TCGCGAAAGC GCGGAAGTGT TCGATACCAA AGGTGAAGAG
TCTGACGTTT ATTCCAGCGA GATTCCAGCT GAGTTCCAGA AGCTGGAACG TTTTGCTGCC
CGTAAGGCCA TCGTTGCTGC CGTTGACGCG CTGGGCCTGC TGGAAGAAAT TAAACCGCAC
GATCTGACCG TCCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCGAT GCTAACCGAC
CAGTGGTACG TCCGTGCCGA CGTGCTGGCG AAACCGGCGG TGGAAGCGGT TGAAAACGGC
GACATTCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGTGATATC
CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
AACGACGGCA ACGTCTACGT TGGCCGTACC GAAGACGAAG TGCGTCAGGA AAATAACCTC
GGCGCCGACG TTGCGCTTCG TCAGGACGAA GACGTTCTTG ATACCTGGTT CTCCTCCGCG
CTGTGGACTT TCTCTACCCT CGGCTGGCCG GAAAACACCG ACGCGCTGCG TCAGTTCCAC
CCGACCAGCG TGATGGTTTC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AACGGCAAGC CGCAGGTGCC GTTCCATACC
GTCTACATGA CCGGTCTGAT TCGCGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTGGAC GGCATCTCCC TGCCGGAACT GCTGGAAAAA
CGCACCGGCA ACATGATGCA GCCGCAGATG GCGGAGAAAA TCCGCAAGCG TACCGAGAAG
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGACGCCC TGCGCTTTAC CCTGGCGGCG
CTGGCCTCGA CCGGTCGCGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACCGA AGAGCAGGAT
TGCGGCTTCA ACGGCGGTGA AATGACCCTG TCGCTGGCTG ACCGTTGGAT CCTGGCGGAA
TTCAACCAGA CCGTTAAAGC GTACCGTGAC GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCGGGCATCC TGTACGAGTT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAGCTGACC
AAGCCGGTAA TGACCGGGGG TTCCGAGTCT GAACTGCGTG GCACCCGCCA TACGCTGGTC
ACCGTACTGG AAGGTCTGCT GCGCCTGGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GCGTGAAGGT TATCTGTGGC ATTACCGCCG ATACCATTAT GCTGCAGCCG
TTCCCGGAAT ATAACGCCGC ACAGGTGGAT GAAGCCGCGC TGGCCGATAC CGAGTGGCTG
AAGCAGGCGA TCGTCGCGGT ACGTAACATT CGTGCGGAAA TGAACATCGC CCCGGGCAAA
CCGCTGGAAC TGCTGCTGCG CGGCTGTAGT GAAGAAGCCG TTCGTCGTGT TAACGACAAC
CGTAGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC AGCCGATGAC
AAAGGTCCGG TTTCCGTGAC CAAAATCATC GACGGCGCCG AACTGCTGAT CCCGATGGCA
GGCCTCATCA ACAAAGACGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAAATCGAA
GGCGAGATTG CCCGTATCGA AGGCAAACTG TCCAACGAAG GTTTCGTCGC CCGTGCGCCG
GAAGCGGTGA TTGCCAAAGA GCGTGAGAAG CTGGACGGTT ACGCAGAAGC CAAAGCGAAA
CTGATTGAGC AGCAGGCGGT TATTAGCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SKESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWQWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YEEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYKLADGAKT ADGKDYLVVA TTRPETVLGD TGVAVNPEDP
RYKDLIGKFV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AEVFDTKGEE SDVYSSEIPA EFQKLERFAA RKAIVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD NDGNVYVGRT EDEVRQENNL GADVALRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQM AEKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEEQD
CGFNGGEMTL SLADRWILAE FNQTVKAYRD ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMTGGSES ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVICG ITADTIMLQP
FPEYNAAQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS EEAVRRVNDN
RSFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKDDELA RLAKEVAKIE
GEIARIEGKL SNEGFVARAP EAVIAKEREK LDGYAEAKAK LIEQQAVISA L