Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A4045 |
Symbol | valS |
ID | 5802524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 4301470 |
End bp | 4304367 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641341829 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001608336 |
Protein GI | 162418871 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.230085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAATA CACCTTCTCA TATCAACAAA ACTGAGCCGT CCCTCGATAA AACATACAGC CCGCAGGAAA TTGAGCAGCC GCTGTATGAA CATTGGGAGA AACAGGGTTA TTTCAAACCA AACGGCGATA CCAGCAAAGA AAGCTACTGC ATCATGATCC CGCCGCCGAA CGTGACCGGC AGCCTGCATA TGGGTCATGC ATTCCAGCAG ACAATCATGG ATACCTTGAT TCGCTATCAG CGTATGCAGG GGAAAAATAC CCTCTGGCAG GCAGGTACTG ATCATGCAGG TATCGCGACC CAGATGGTTG TGGAACGCAA GATTGCCGCC GAAGAAGGCA AGACCCGCCA CGATTACGGC CGTGATGCGT TTATCGATAA AATCTGGGAG TGGAAAGGCG AATCCGGCGG CACCATTACT CGCCAAATGC GTCGTTTGGG TAACTCCGTG GACTGGGAAC GTGAACGTTT CACTATGGAT GAAGGCTTGT CCAACGCAGT TAAAGAAGTG TTCGTGCGCC TTCATAAAGA AGATCTGATT TACCGTGGCA AGCGCCTGGT GAACTGGGAT CCGAAACTGC GTACTGCCAT TTCTGATCTG GAAGTAGAAA ACCGCGAATC CAAAGGTTCC ATGTGGCACC TGCGTTATCC GCTGGCCGAT GGTGCCAAGA CTGCGGAAGG CAAAGATTAT CTGGTGGTGG CAACCACCCG TCCAGAAACC GTTCTGGGTG ATACTGGTGT CGCGGTTAAC CCGGAAGATC CACGTTATAA AGATCTGATC GGCAAAGAAG TGATCCTGCC GCTGGTTGGC CGCCGTATTC CGATCCTCGG TGACGAACAC GCCGATATGG AGAAGGGCAC CGGTTGTGTG AAAATCACCC CAGCCCACGA CTTTAATGAC TATGAAGTCG GTAAGCGCCA TGCCCTGCCA ATGATCAACA TTCTGACCTT CGACGGGGAT ATCCGCTCAG AAGCCGAGGT ATTTGATACC CACGGTGAAG CAACTGATGC ATTCAGTAAC GCTATTCCTG CGCAGTTCCA AGGGCTAGAA CGTTTTGCTG CCCGTAAAGC GGTGGTCGCG GAATTCGAGA AACTCGGTCT GTTGGAAGAG GTTAAACCTC ATGACCTGAC AGTACCTTAT GGCGACCGTG GCGGCGTGGT TATCGAACCC ATGCTGACCG ATCAATGGTA CGTGCACACC GCCCCGCTGG CCAAAGTCGC GATTGAAGCC GTAGAGAACG GCGAGATCCA GTTCGTCCCT AAACAGTACG AAAACATGTA TTACTCATGG ATGCGCGATA TCCAGGACTG GTGTATCTCA CGTCAATTGT GGTGGGGCCA CCGTATTCCG GCCTGGTATG ACGAGCAGGG TAATGTGTAT GTTGGCCGCG ACGAAGCCGA AGTGCGTCGC GACAATAATC TGGGCGCAGA GGTTGCTTTG CGTCAGGACG AAGATGTGTT GGACACCTGG TTCTCATCCG GCCTGTGGAC ATTCTCTACA CTGGGCTGGC CTGAACAAAC CGACGCACTG AAAACCTTCC ATCCGACCAG CGTGGTCGTC AGTGGTTTTG ATATTATTTT CTTCTGGATT GCCCGCATGA TCATGCTGAC CATGCACTTT ATGAAAGATG AAAATGGTAA ACCACAGGTG CCGTTCAAAA CGGTCTACAT GACCGGTCTG ATCCGTGATG ACGAAGGGCA GAAAATGTCC AAGTCCAAAG GTAACGTGAT CGATCCACTG GATATGGTTG ACGGTATCTC TTTGGAAGCG TTGCTGGAAA AACGTACCGG CAATATGATG CAGCCACAGT TGGCGGAGAA AATTCGCAAG CGCACTGAAA AGCAGTTCCC GAACGGTATC GAGCCGCACG GCACTGATGC ACTGCGCTTC ACGTTGGCGG CACTGGCCTC AACTGGCCGT GATATCAACT GGGATATGAA ACGCCTGGAA GGGTATCGCA ACTTCTGTAA TAAGCTGTGG AATGCCAGCC GTTTCGTGCT GATGAATACC GAAGGGCAGG ATTGTGGGCA GAACGGTGGC GAAATGGTGT TATCACTGGC TGACCGCTGG ATTTTGGCGG AATTCAACCA GACCATCAAA GCCTACCGTG AAGCGATGGA CACCTACCGC TTCGATCTGG CGGCCGGTAT TCTGTATGAA TTCACCTGGA ACCAGTTCTG TGACTGGTAT CTGGAACTGA CCAAGCCGGT GATGAACAGT GGCTCTGAAG CTGAACTGCG AGGCACTCGC CACACGCTGA TTCAGGTGCT GGAAGCCTTG CTACGCTTGG CGCACCCCAT CATTCCTTAC ATCACTGAAA CTATCTGGCA GCGGGTGAAA AACCTGAAAG GCATTACTGC AGACACGATT ATGTTGCAGC CTTTCCCAGA ATATGATGCC AGCCAAGTCG ATGAACAAGC ACTCAGTGAT TTAGAGTGGA TTAAGCAAAC CATTATCGCG GTGCGTAATA TCCGGGCGGA AATGAACATT GCACCGGGTA AACCACTTGA GGTCATGCTG CGGGGTGCCA ACGCACAAGC ACAGCGTCGG GTGCTGGAAA ACCAGAGTTT TATCCAGTCA TTGGCGCGCT TGTCCTCTCT CACCTTGCTA GCTGAAGGTG ATAAAGGCCC AGTATCGGTC ACTAAATTGG TTGAAGGTGC TGAAGTGCTG ATCCCAATGG CAGGCCTGAT CGATAAAGCC ACCGAGTTGG ATCGTCTGGC GAAGGAAGTG GCGAAACTGG ATGCTGAAAT TGAGCGCATC GAAGGCAAAC TGGGTAACGA AGGTTTTGTG GCGCGGGCGC CAGAAGCGGT AGTTGCCAAA GAGCGTGAAA GACTGGCCGC TTGTGCTGAA GCCAAACAGA AGTTAATTGA GCAGCAGGCA ACTATCGCTG CACTATAA
|
Protein sequence | MENTPSHINK TEPSLDKTYS PQEIEQPLYE HWEKQGYFKP NGDTSKESYC IMIPPPNVTG SLHMGHAFQQ TIMDTLIRYQ RMQGKNTLWQ AGTDHAGIAT QMVVERKIAA EEGKTRHDYG RDAFIDKIWE WKGESGGTIT RQMRRLGNSV DWERERFTMD EGLSNAVKEV FVRLHKEDLI YRGKRLVNWD PKLRTAISDL EVENRESKGS MWHLRYPLAD GAKTAEGKDY LVVATTRPET VLGDTGVAVN PEDPRYKDLI GKEVILPLVG RRIPILGDEH ADMEKGTGCV KITPAHDFND YEVGKRHALP MINILTFDGD IRSEAEVFDT HGEATDAFSN AIPAQFQGLE RFAARKAVVA EFEKLGLLEE VKPHDLTVPY GDRGGVVIEP MLTDQWYVHT APLAKVAIEA VENGEIQFVP KQYENMYYSW MRDIQDWCIS RQLWWGHRIP AWYDEQGNVY VGRDEAEVRR DNNLGAEVAL RQDEDVLDTW FSSGLWTFST LGWPEQTDAL KTFHPTSVVV SGFDIIFFWI ARMIMLTMHF MKDENGKPQV PFKTVYMTGL IRDDEGQKMS KSKGNVIDPL DMVDGISLEA LLEKRTGNMM QPQLAEKIRK RTEKQFPNGI EPHGTDALRF TLAALASTGR DINWDMKRLE GYRNFCNKLW NASRFVLMNT EGQDCGQNGG EMVLSLADRW ILAEFNQTIK AYREAMDTYR FDLAAGILYE FTWNQFCDWY LELTKPVMNS GSEAELRGTR HTLIQVLEAL LRLAHPIIPY ITETIWQRVK NLKGITADTI MLQPFPEYDA SQVDEQALSD LEWIKQTIIA VRNIRAEMNI APGKPLEVML RGANAQAQRR VLENQSFIQS LARLSSLTLL AEGDKGPVSV TKLVEGAEVL IPMAGLIDKA TELDRLAKEV AKLDAEIERI EGKLGNEGFV ARAPEAVVAK ERERLAACAE AKQKLIEQQA TIAAL
|
| |