Gene YpsIP31758_3545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3545 
SymbolvalS 
ID5386484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4005341 
End bp4008238 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content52% 
IMG OID640866561 
Productvalyl-tRNA synthetase 
Protein accessionYP_001402499 
Protein GI153948685 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.562165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA CACCTTCTCA TATCAACAAA ACTGAGCCGT CCCTCGATAA AACATACAGC 
CCGCAGGAAA TTGAACAGCC GCTGTATGAA CATTGGGAGA AACAGGGTTA TTTCAAACCA
AACGGCGATA CCAGCAAAGA AAGCTATTGC ATCATGATCC CACCGCCGAA CGTGACCGGC
AGCCTGCATA TGGGTCATGC ATTCCAGCAG ACCATCATGG ATACCTTGAT TCGCTATCAG
CGTATGCAGG GGAAAAATAC CCTATGGCAG GCAGGTACCG ATCATGCAGG TATCGCGACC
CAGATGGTTG TGGAACGCAA GATTGCCGCC GAAGAAGGCA AGACCCGCCA CGATTACGGC
CGTGATGCGT TTATCGATAA AATCTGGGAG TGGAAAGGCG AATCCGGCGG CACCATTACT
CGCCAAATGC GTCGTTTGGG TAACTCCGTG GACTGGGAAC GTGAACGTTT CACCATGGAT
GAAGGCTTGT CCAACGCAGT TAAAGAAGTG TTCGTGCGCC TTCATAAAGA AGATCTGATT
TACCGTGGCA AGCGCCTGGT GAACTGGGAT CCGAAACTGC GCACTGCCAT TTCTGATCTG
GAAGTAGAAA ACCGCGAATC CAAAGGTTCC ATGTGGCACC TGCGTTATCC GCTGGCCGAT
GGTGCCAAGA CTGCGGAAGG CAAAGATTAT CTGGTGGTGG CGACCACCCG TCCAGAAACC
GTTCTGGGTG ATACTGGTGT CGCGGTTAAC CCGGAAGATC CACGTTATAA AGATCTGATC
GGCAAAGAAG TGATCCTGCC GCTGGTTGGC CGCCGTATTC CGATCCTCGG TGACGAACAC
GCCGATATGG AGAAGGGTAC TGGCTGTGTG AAAATCACTC CAGCCCACGA CTTTAATGAC
TATGAAGTCG GTAAGCGCCA TGCCCTGCCA ATGATCAACA TTCTGACCTT CGACGGGGAT
ATCCGCTCAG AAGCCGAGGT ATTTGATACC CACGGTGAAG CGACTGATGC ATTTAGTAAC
GCTATTCCTG CGCAGTTCCA AGGGCTAGAA CGTTTTGCTG CCCGTAAAGC GGTGGTCGCG
GAATTCGAGA AACTCGGTCT GTTGGAAGAG GTTAAACCTC ATGACCTGAC AGTACCTTAT
GGCGACCGTG GCGGCGTGGT TATCGAACCC ATGCTGACCG ATCAATGGTA CGTGCGCACT
GCCCCGCTGG CCAAAGTCGC GATTGAAGCC GTAGAGAACG GCGAGATCCA GTTCGTCCCT
AAACAGTACG AAAACATGTA TTACTCATGG ATGCGCGATA TCCAGGACTG GTGTATCTCA
CGTCAATTGT GGTGGGGCCA CCGTATTCCG GCCTGGTATG ACGAGCAGGG TAATGTGTAT
GTTGGCCGCG ACGAAGCCGA AGTGCGCCGC GACAATAATC TGGGCGCCGA GGTTGCTTTG
CGTCAGGACG AAGATGTGTT GGACACCTGG TTCTCATCCG GCCTGTGGAC ATTCTCTACA
CTGGGCTGGC CTGAACAAAC CGACGCACTG AAAACCTTCC ATCCGACCAG CGTGGTCGTC
AGTGGTTTTG ATATTATTTT CTTCTGGATT GCCCGCATGA TCATGCTGAC CATGCACTTT
ATGAAAGATG AAAATGGTAA ACCACAGGTG CCGTTCAAAA CAGTCTACAT GACCGGTCTG
ATCCGTGATG ACGAAGGGCA GAAAATGTCC AAGTCCAAAG GTAACGTGAT CGACCCACTG
GATATGGTTG ACGGTATCTC TTTGGAAGCG TTGCTGGAAA AACGTACCGG CAATATGATG
CAGCCACAGT TGGCGGAGAA AATTCGCAAG CGCACTGAAA AGCAGTTCCC GAACGGTATC
GAGCCGCACG GCACTGATGC ACTGCGCTTC ACGTTGGCGG CACTGGCCTC AACTGGCCGT
GATATCAACT GGGATATGAA ACGCCTGGAA GGGTATCGCA ATTTCTGTAA TAAGCTGTGG
AATGCCAGCC GTTTCGTGCT GATGAATACC GAAGGGCAGG ATTGTGGGCA GAACGGTGGC
GAAATGGTGT TATCACTGGC TGACCGCTGG ATTTTGGCGG AATTCAACCA GACCATCAAA
GCCTACCGTG AAGCGATGGA CACCTACCGC TTCGATCTGG CGGCCGGTAT TCTGTATGAA
TTCACCTGGA ACCAGTTCTG TGACTGGTAT CTGGAACTGA CCAAGCCGGT GATGAACAGT
GGCTCTGAAG CTGAACTGCG AGGCACTCGC CACACGCTGA TTCAGGTGCT GGAAGCCTTG
CTGCGCTTGG CGCACCCCAT CATTCCTTAC ATCACTGAAA CTATCTGGCA GCGGGTGAAA
AACCTGAAAG GCATTACTGC AGACACGATT ATGTTGCAGC CTTTCCCAGA ATATGATGCC
AGCCAAGTCG ATGAACAAGC ACTCAGTGAT TTAGAGTGGA TTAAGCAAAC CATTATCGCG
GTGCGTAATA TCCGGGCGGA AATGAACATT GCACCGGGTA AACCACTTGA GGTCATGCTG
CGGGGTGCCA ACGCACAAGC ACAGCGTCGG GTGCTGGAAA ACCAGAGTTT TATCCAGTCA
TTGGCGCGCT TGTCCTCTCT CACCTTGCTA GCTGAAGGTG ATAAAGGCCC AGTATCGGTC
ACTAAATTGG TTGAAGGTGC TGAAGTGCTG ATCCCAATGG CAGGCCTGAT CGATAAAGCC
ACCGAGTTGG ATCGTCTGGC GAAGGAAGTG GCGAAACTGG ATGCTGAAAT TGAGCGCATC
GAAGGCAAAC TGGGTAACGA AGGTTTTGTG GCGCGGGCGC CAGAAGCGGT AGTTGCCAAA
GAGCGTGAAA GACTGGCCGC TTGTGCTGAA GCCAAACAGA AGTTAATTGA GCAGCAGGCA
ACTATCGCTG CACTATAA
 
Protein sequence
MENTPSHINK TEPSLDKTYS PQEIEQPLYE HWEKQGYFKP NGDTSKESYC IMIPPPNVTG 
SLHMGHAFQQ TIMDTLIRYQ RMQGKNTLWQ AGTDHAGIAT QMVVERKIAA EEGKTRHDYG
RDAFIDKIWE WKGESGGTIT RQMRRLGNSV DWERERFTMD EGLSNAVKEV FVRLHKEDLI
YRGKRLVNWD PKLRTAISDL EVENRESKGS MWHLRYPLAD GAKTAEGKDY LVVATTRPET
VLGDTGVAVN PEDPRYKDLI GKEVILPLVG RRIPILGDEH ADMEKGTGCV KITPAHDFND
YEVGKRHALP MINILTFDGD IRSEAEVFDT HGEATDAFSN AIPAQFQGLE RFAARKAVVA
EFEKLGLLEE VKPHDLTVPY GDRGGVVIEP MLTDQWYVRT APLAKVAIEA VENGEIQFVP
KQYENMYYSW MRDIQDWCIS RQLWWGHRIP AWYDEQGNVY VGRDEAEVRR DNNLGAEVAL
RQDEDVLDTW FSSGLWTFST LGWPEQTDAL KTFHPTSVVV SGFDIIFFWI ARMIMLTMHF
MKDENGKPQV PFKTVYMTGL IRDDEGQKMS KSKGNVIDPL DMVDGISLEA LLEKRTGNMM
QPQLAEKIRK RTEKQFPNGI EPHGTDALRF TLAALASTGR DINWDMKRLE GYRNFCNKLW
NASRFVLMNT EGQDCGQNGG EMVLSLADRW ILAEFNQTIK AYREAMDTYR FDLAAGILYE
FTWNQFCDWY LELTKPVMNS GSEAELRGTR HTLIQVLEAL LRLAHPIIPY ITETIWQRVK
NLKGITADTI MLQPFPEYDA SQVDEQALSD LEWIKQTIIA VRNIRAEMNI APGKPLEVML
RGANAQAQRR VLENQSFIQS LARLSSLTLL AEGDKGPVSV TKLVEGAEVL IPMAGLIDKA
TELDRLAKEV AKLDAEIERI EGKLGNEGFV ARAPEAVVAK ERERLAACAE AKQKLIEQQA
TIAAL