Gene YPK_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3681 
SymbolvalS 
ID6090505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4065984 
End bp4068881 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content52% 
IMG OID641598769 
Productvalyl-tRNA synthetase 
Protein accessionYP_001722401 
Protein GI170025896 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.470005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA CACCTTCTCA TATCGACAAA ACTGAGCCGT CCCTCGATAA AACATACAGC 
CCGCAGGAAA TTGAGCAGCC GCTGTATGAA CATTGGGAGA AACAGGGTTA TTTCAAACCA
AACGGCGATA CCAGCAAAGA AAGCTACTGC ATCATGATCC CGCCGCCGAA TGTGACCGGC
AGCCTGCATA TGGGTCATGC ATTCCAGCAG ACCATCATGG ATACCTTGAT TCGCTATCAG
CGTATGCAGG GGAAAAATAC CCTATGGCAG GCAGGTACCG ATCATGCAGG TATCGCGACC
CAGATGGTTG TGGAACGCAA GATTGCCGCC GAAGAAGGCA AGACCCGCCA CGATTACGGC
CGTGATGCGT TTATCGATAA AATCTGGGAG TGGAAAGGCG AATCCGGCGG CACCATTACT
CGCCAAATGC GTCGTTTGGG TAACTCCGTG GACTGGGAAC GTGAACGTTT CACCATGGAT
GAAGGCTTGT CCAACGCAGT TAAAGAAGTG TTCGTGCGCC TTCATAAAGA AGATCTGATT
TACCGTGGCA AGCGCCTGGT GAACTGGGAT CCGAAACTGC GCACTGCCAT TTCTGATCTG
GAAGTAGAAA ACCGCGAATC CAAAGGTTCC ATGTGGCACC TGCGTTATCC GCTGGCCGAT
GGTGCCAAGA CTGCGGAAGG CAAAGATTAT CTGGTGGTGG CGACCACCCG TCCAGAAACC
GTTTTGGGTG ATACTGGTGT CGCGGTTAAC CCGGAAGATC CACGTTATAA AGACCTGATC
GGCAAAGAAG TGATCCTGCC GCTGGTTGGC CGCCGCATTC CGATCCTCGG TGACGAACAC
GCCGATATGG AGAAGGGTAC TGGCTGTGTG AAAATCACTC CAGCCCACGA CTTTAATGAC
TATGAAGTCG GTAAGCGCCA TGCCCTGCCA ATGATCAACA TTCTGACCTT CGACGGGGAT
ATCCGCTCAG AAGCCGAGGT ATTTGATACC CACGGTGAAG CGACTGATGC ATTCAGTAAC
GCTATTCCTG CGCAGTTCCA AGGGCTAGAA CGTTTTGCTG CCCGTAAAGC GGTGGTCGCG
GAATTCGAGA AACTCGGTCT GTTGGAAGAG GTTAAACCTC ATGACCTGAC AGTACCTTAT
GGCGACCGTG GCGGCGTGGT TATCGAACCC ATGCTGACCG ATCAATGGTA CGTGCGCACT
GCCCCGCTGG CCAAAGTCGC GATTGAAGCC GTAGAGAACG GCGAGATCCA GTTCGTCCCT
AAACAGTACG AAAACATGTA TTACTCATGG ATGCGCGATA TCCAGGACTG GTGTATCTCA
CGTCAATTGT GGTGGGGCCA CCGTATTCCG GCCTGGTATG ACGAGCAGGG TAATGTGTAT
GTTGGCCGCG ACGAAGCCGA AGTGCGTCGC GACAATAATC TGGGCGCAGA GGTTGCTTTG
CGTCAGGACG AAGATGTGTT GGACACCTGG TTCTCATCCG GCCTGTGGAC ATTCTCTACA
CTGGGCTGGC CTGAACAAAC CGACGCACTG AAAACCTTCC ATCCGACCAG CGTGGTCGTC
AGTGGTTTTG ATATTATTTT CTTCTGGATT GCCCGCATGA TCATGCTGAC CATGCACTTT
ATGAAAGATG AAAATGGTAA ACCACAAGTG CCGTTCAAAA CGGTCTACAT GACCGGTCTG
ATCCGTGATG ACGAAGGGCA GAAAATGTCC AAGTCCAAAG GTAACGTGAT CGATCCACTG
GATATGGTTG ACGGTATCTC TTTGGAAGCG TTGCTGGAAA AACGTACCGG CAATATGATG
CAGCCACAGT TGGCGGAGAA AATTCGCAAG CGCACTGAAA AGCAGTTCCC GAACGGTATC
GAGCCGCACG GCACTGATGC GCTGCGCTTC ACGTTGGCGG CACTGGCCTC AACTGGCCGT
GATATCAACT GGGATATGAA ACGCCTGGAA GGGTATCGCA ACTTCTGTAA TAAGCTGTGG
AATGCCAGCC GTTTCGTGCT GATGAATACC GAAGGGCAGG ATTGTGGGCA GAACGGTGGC
GAAATGGTGT TATCACTGGC TGACCGCTGG ATTTTGGCGG AATTCAACCA GACCATCAAA
GCCTACCGTG AAGCGATGGA CACCTACCGC TTCGATCTGG CGGCCGGTAT TTTGTATGAA
TTTACCTGGA ACCAGTTCTG TGACTGGTAT CTGGAACTGA CCAAACCGGT GATGAACAGT
GGCTCTGAAG CTGAACTGCG AGGCACCCGC CACACGCTAA TTCAGGTGCT GGAAGCCTTG
CTGCGCTTGG CGCACCCCAT CATTCCTTAC ATCACTGAAA CTATCTGGCA GCGGGTGAAA
AACCTGAAAG GCATTACCGC AGACACGATT ATGTTGCAGC CTTTCCCGGA ATATGATGCC
AGCCAAGTCG ATGAACAAGC ACTCAGTGAT TTAGAGTGGA TTAAGCAAAC CATTATCGCG
GTGCGTAATA TCCGGGCGGA AATGAACATT GCACCGGGTA AACCACTTGA GGTCATGCTG
CGGGGTGCCA ACGCACAAGC ACAGCGTCGG GTGCTGGAAA ACCAGAGTTT TATCCAGTCA
TTGGCGCGCT TGTCCTCTCT CACCTTGCTA GCTGAAGGTG ATAAAGGCCC AGTATCGGTC
ACTAAATTGG TTGAAGGTGC TGAAGTGCTG ATCCCAATGG CAGGCCTGAT CGATAAAGCC
ACCGAGTTGG ATCGTCTGGC GAAGGAAGTG GCGAAACTGG ATGCTGAAAT TGAGCGCATC
GAAGGCAAAC TGGGTAACGA AGGTTTTGTG GCGCGTGCGC CAGAAGCGGT AGTTGCCAAA
GAGCGTGAAA GACTGGCCGC TTGTGCTGAA GCCAAACAGA AGTTAATTGA GCAGCAGGCA
ACTATCGCTG CACTATAA
 
Protein sequence
MENTPSHIDK TEPSLDKTYS PQEIEQPLYE HWEKQGYFKP NGDTSKESYC IMIPPPNVTG 
SLHMGHAFQQ TIMDTLIRYQ RMQGKNTLWQ AGTDHAGIAT QMVVERKIAA EEGKTRHDYG
RDAFIDKIWE WKGESGGTIT RQMRRLGNSV DWERERFTMD EGLSNAVKEV FVRLHKEDLI
YRGKRLVNWD PKLRTAISDL EVENRESKGS MWHLRYPLAD GAKTAEGKDY LVVATTRPET
VLGDTGVAVN PEDPRYKDLI GKEVILPLVG RRIPILGDEH ADMEKGTGCV KITPAHDFND
YEVGKRHALP MINILTFDGD IRSEAEVFDT HGEATDAFSN AIPAQFQGLE RFAARKAVVA
EFEKLGLLEE VKPHDLTVPY GDRGGVVIEP MLTDQWYVRT APLAKVAIEA VENGEIQFVP
KQYENMYYSW MRDIQDWCIS RQLWWGHRIP AWYDEQGNVY VGRDEAEVRR DNNLGAEVAL
RQDEDVLDTW FSSGLWTFST LGWPEQTDAL KTFHPTSVVV SGFDIIFFWI ARMIMLTMHF
MKDENGKPQV PFKTVYMTGL IRDDEGQKMS KSKGNVIDPL DMVDGISLEA LLEKRTGNMM
QPQLAEKIRK RTEKQFPNGI EPHGTDALRF TLAALASTGR DINWDMKRLE GYRNFCNKLW
NASRFVLMNT EGQDCGQNGG EMVLSLADRW ILAEFNQTIK AYREAMDTYR FDLAAGILYE
FTWNQFCDWY LELTKPVMNS GSEAELRGTR HTLIQVLEAL LRLAHPIIPY ITETIWQRVK
NLKGITADTI MLQPFPEYDA SQVDEQALSD LEWIKQTIIA VRNIRAEMNI APGKPLEVML
RGANAQAQRR VLENQSFIQS LARLSSLTLL AEGDKGPVSV TKLVEGAEVL IPMAGLIDKA
TELDRLAKEV AKLDAEIERI EGKLGNEGFV ARAPEAVVAK ERERLAACAE AKQKLIEQQA
TIAAL