Gene YpAngola_A4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4045 
SymbolvalS 
ID5802524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4301470 
End bp4304367 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content52% 
IMG OID641341829 
Productvalyl-tRNA synthetase 
Protein accessionYP_001608336 
Protein GI162418871 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.230085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA CACCTTCTCA TATCAACAAA ACTGAGCCGT CCCTCGATAA AACATACAGC 
CCGCAGGAAA TTGAGCAGCC GCTGTATGAA CATTGGGAGA AACAGGGTTA TTTCAAACCA
AACGGCGATA CCAGCAAAGA AAGCTACTGC ATCATGATCC CGCCGCCGAA CGTGACCGGC
AGCCTGCATA TGGGTCATGC ATTCCAGCAG ACAATCATGG ATACCTTGAT TCGCTATCAG
CGTATGCAGG GGAAAAATAC CCTCTGGCAG GCAGGTACTG ATCATGCAGG TATCGCGACC
CAGATGGTTG TGGAACGCAA GATTGCCGCC GAAGAAGGCA AGACCCGCCA CGATTACGGC
CGTGATGCGT TTATCGATAA AATCTGGGAG TGGAAAGGCG AATCCGGCGG CACCATTACT
CGCCAAATGC GTCGTTTGGG TAACTCCGTG GACTGGGAAC GTGAACGTTT CACTATGGAT
GAAGGCTTGT CCAACGCAGT TAAAGAAGTG TTCGTGCGCC TTCATAAAGA AGATCTGATT
TACCGTGGCA AGCGCCTGGT GAACTGGGAT CCGAAACTGC GTACTGCCAT TTCTGATCTG
GAAGTAGAAA ACCGCGAATC CAAAGGTTCC ATGTGGCACC TGCGTTATCC GCTGGCCGAT
GGTGCCAAGA CTGCGGAAGG CAAAGATTAT CTGGTGGTGG CAACCACCCG TCCAGAAACC
GTTCTGGGTG ATACTGGTGT CGCGGTTAAC CCGGAAGATC CACGTTATAA AGATCTGATC
GGCAAAGAAG TGATCCTGCC GCTGGTTGGC CGCCGTATTC CGATCCTCGG TGACGAACAC
GCCGATATGG AGAAGGGCAC CGGTTGTGTG AAAATCACCC CAGCCCACGA CTTTAATGAC
TATGAAGTCG GTAAGCGCCA TGCCCTGCCA ATGATCAACA TTCTGACCTT CGACGGGGAT
ATCCGCTCAG AAGCCGAGGT ATTTGATACC CACGGTGAAG CAACTGATGC ATTCAGTAAC
GCTATTCCTG CGCAGTTCCA AGGGCTAGAA CGTTTTGCTG CCCGTAAAGC GGTGGTCGCG
GAATTCGAGA AACTCGGTCT GTTGGAAGAG GTTAAACCTC ATGACCTGAC AGTACCTTAT
GGCGACCGTG GCGGCGTGGT TATCGAACCC ATGCTGACCG ATCAATGGTA CGTGCACACC
GCCCCGCTGG CCAAAGTCGC GATTGAAGCC GTAGAGAACG GCGAGATCCA GTTCGTCCCT
AAACAGTACG AAAACATGTA TTACTCATGG ATGCGCGATA TCCAGGACTG GTGTATCTCA
CGTCAATTGT GGTGGGGCCA CCGTATTCCG GCCTGGTATG ACGAGCAGGG TAATGTGTAT
GTTGGCCGCG ACGAAGCCGA AGTGCGTCGC GACAATAATC TGGGCGCAGA GGTTGCTTTG
CGTCAGGACG AAGATGTGTT GGACACCTGG TTCTCATCCG GCCTGTGGAC ATTCTCTACA
CTGGGCTGGC CTGAACAAAC CGACGCACTG AAAACCTTCC ATCCGACCAG CGTGGTCGTC
AGTGGTTTTG ATATTATTTT CTTCTGGATT GCCCGCATGA TCATGCTGAC CATGCACTTT
ATGAAAGATG AAAATGGTAA ACCACAGGTG CCGTTCAAAA CGGTCTACAT GACCGGTCTG
ATCCGTGATG ACGAAGGGCA GAAAATGTCC AAGTCCAAAG GTAACGTGAT CGATCCACTG
GATATGGTTG ACGGTATCTC TTTGGAAGCG TTGCTGGAAA AACGTACCGG CAATATGATG
CAGCCACAGT TGGCGGAGAA AATTCGCAAG CGCACTGAAA AGCAGTTCCC GAACGGTATC
GAGCCGCACG GCACTGATGC ACTGCGCTTC ACGTTGGCGG CACTGGCCTC AACTGGCCGT
GATATCAACT GGGATATGAA ACGCCTGGAA GGGTATCGCA ACTTCTGTAA TAAGCTGTGG
AATGCCAGCC GTTTCGTGCT GATGAATACC GAAGGGCAGG ATTGTGGGCA GAACGGTGGC
GAAATGGTGT TATCACTGGC TGACCGCTGG ATTTTGGCGG AATTCAACCA GACCATCAAA
GCCTACCGTG AAGCGATGGA CACCTACCGC TTCGATCTGG CGGCCGGTAT TCTGTATGAA
TTCACCTGGA ACCAGTTCTG TGACTGGTAT CTGGAACTGA CCAAGCCGGT GATGAACAGT
GGCTCTGAAG CTGAACTGCG AGGCACTCGC CACACGCTGA TTCAGGTGCT GGAAGCCTTG
CTACGCTTGG CGCACCCCAT CATTCCTTAC ATCACTGAAA CTATCTGGCA GCGGGTGAAA
AACCTGAAAG GCATTACTGC AGACACGATT ATGTTGCAGC CTTTCCCAGA ATATGATGCC
AGCCAAGTCG ATGAACAAGC ACTCAGTGAT TTAGAGTGGA TTAAGCAAAC CATTATCGCG
GTGCGTAATA TCCGGGCGGA AATGAACATT GCACCGGGTA AACCACTTGA GGTCATGCTG
CGGGGTGCCA ACGCACAAGC ACAGCGTCGG GTGCTGGAAA ACCAGAGTTT TATCCAGTCA
TTGGCGCGCT TGTCCTCTCT CACCTTGCTA GCTGAAGGTG ATAAAGGCCC AGTATCGGTC
ACTAAATTGG TTGAAGGTGC TGAAGTGCTG ATCCCAATGG CAGGCCTGAT CGATAAAGCC
ACCGAGTTGG ATCGTCTGGC GAAGGAAGTG GCGAAACTGG ATGCTGAAAT TGAGCGCATC
GAAGGCAAAC TGGGTAACGA AGGTTTTGTG GCGCGGGCGC CAGAAGCGGT AGTTGCCAAA
GAGCGTGAAA GACTGGCCGC TTGTGCTGAA GCCAAACAGA AGTTAATTGA GCAGCAGGCA
ACTATCGCTG CACTATAA
 
Protein sequence
MENTPSHINK TEPSLDKTYS PQEIEQPLYE HWEKQGYFKP NGDTSKESYC IMIPPPNVTG 
SLHMGHAFQQ TIMDTLIRYQ RMQGKNTLWQ AGTDHAGIAT QMVVERKIAA EEGKTRHDYG
RDAFIDKIWE WKGESGGTIT RQMRRLGNSV DWERERFTMD EGLSNAVKEV FVRLHKEDLI
YRGKRLVNWD PKLRTAISDL EVENRESKGS MWHLRYPLAD GAKTAEGKDY LVVATTRPET
VLGDTGVAVN PEDPRYKDLI GKEVILPLVG RRIPILGDEH ADMEKGTGCV KITPAHDFND
YEVGKRHALP MINILTFDGD IRSEAEVFDT HGEATDAFSN AIPAQFQGLE RFAARKAVVA
EFEKLGLLEE VKPHDLTVPY GDRGGVVIEP MLTDQWYVHT APLAKVAIEA VENGEIQFVP
KQYENMYYSW MRDIQDWCIS RQLWWGHRIP AWYDEQGNVY VGRDEAEVRR DNNLGAEVAL
RQDEDVLDTW FSSGLWTFST LGWPEQTDAL KTFHPTSVVV SGFDIIFFWI ARMIMLTMHF
MKDENGKPQV PFKTVYMTGL IRDDEGQKMS KSKGNVIDPL DMVDGISLEA LLEKRTGNMM
QPQLAEKIRK RTEKQFPNGI EPHGTDALRF TLAALASTGR DINWDMKRLE GYRNFCNKLW
NASRFVLMNT EGQDCGQNGG EMVLSLADRW ILAEFNQTIK AYREAMDTYR FDLAAGILYE
FTWNQFCDWY LELTKPVMNS GSEAELRGTR HTLIQVLEAL LRLAHPIIPY ITETIWQRVK
NLKGITADTI MLQPFPEYDA SQVDEQALSD LEWIKQTIIA VRNIRAEMNI APGKPLEVML
RGANAQAQRR VLENQSFIQS LARLSSLTLL AEGDKGPVSV TKLVEGAEVL IPMAGLIDKA
TELDRLAKEV AKLDAEIERI EGKLGNEGFV ARAPEAVVAK ERERLAACAE AKQKLIEQQA
TIAAL