Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3006 |
Symbol | valS |
ID | 5456209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3207194 |
End bp | 3209923 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640878594 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001414270 |
Protein GI | 154253446 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.405828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA AGACCTTCGA CCCGCAGGAT GCCGAAGCCC GGCTTTACAA GCTCTGGGAG GAAAGCGGCG CTTTCCAGGC CGGCGACCCC GCGCGGGTAG AAGCGGGCGC CAAGCCCTAT TGCATCGTCA TCCCGCCGCC CAACGTCACC GGCAGCCTGC ATATGGGCCA TGCGCTCAAC AACACGCTGC AGGACGTGCT GATCCGCTTC GAGCGGATGC GCGGCAAGGA CGTGCTCTGG CAGCCGGGCC TCGACCATGC GGGCATCGCC ACTCAGATGA TCGTCGAGCG CCAGCTTGAA GGCGAGGGCA ATATCCGCCG CCGCGACATG GGCCGCGATG CCTTCATCGA GCGCATCTGG AAATGGAAAG GCGAGAGCGG CGGCACCATC ATCGGCCAGC TCCGCCGCCT CGGCGCCTCC TGCGACTGGA GCCGCGAGCG TTTCACGCTG GACGAAGGTC TGTCGAAAGC CGTGCTGAAA GTTTTCGTGG AGCTTTACCG CGAAGGCCTC ATCTATCGCG ACAAGCGCCT CGTCAACTGG CACCCGGAAC TTCAGACCGC CGTCTCCGAT CTCGAAGTCG AGAATATCGA GGTGAAGGGC CACTTCTGGC ATTTCCGCTA TCCGCTCGCG GATGGCGTCA CCTATGAGTG GCCGCTTTTC GACGAGGAGG GTAATCCCGC CGGCACCGAA ACGCGCGACT ACATCGTCGT CGCCACCACG CGCCCCGAAA CGATGCTGGG CGACACCGGT GTCGCCGTGA ACCCGGAAGA CGAGCGTTAC AAGTCCCTCA TCGGCAAGTT CGTGGAGCTG CCGCTGGTCG GCCGCCGCAT TCCGATCGTC GGCGACGAAC ACGCGGACCC GGAGCAGGGT TCCGGTGCGG TGAAGATCAC GCCCGCACAT GACTTCAACG ACTTCGAGGT CGGTAAGCGG CAGGGACTTG AGCAGATCAA CATTCTCAAC CCCGACGGCA AGCTCAACGA CGAAGTGCCG GAGCCTTATC GCGGCATGGA CCGTTTCAAG GCGCGCAAGC AGGTCGTCGT TGACATCGAG GCGCTCGGTC TCCTCGACCG CATCGAGGAC AAGATCGTCA TGGTTCCGCA TGACGAGAAA TCAAAACTGG TCGTCATCGA GCCCTATCTC ACCGACCAGT GGTATGTCGA CGCGGCGACG CTGGCGAAGC CCGCCATTGA AGCGGTGGAG CGCGGCCAGA CCGTCTTCGT GCCGAAGAAC TGGGAAAAGA CCTATTTCGA GTGGATGCGC AACATCCAGC CCTGGTGCAT CTCGCGCCAG CTCTGGTGGG GGCATCGCAT TCCGGCATGG TATGGGCCGG ACGGTCATGT CTTCGTCGCC TATAGCGATG AAGAGGCCGC CGCCGAAGCG GAAAAGCACT ACGGCAAAAG CGTCGAACTC ACGCGCGACG AAGACGTGCT CGACACATGG TTCTCCTCCG GCCTCTGGCC CTTCTCCACG CTCGGCTGGC CGGACAAGAC GCCGGAACTC GCGCGCTATT ACAAGACCGA CGTGCTCGTC ACCGGCTTCG ACATCATCTT CTTCTGGGTC GCCCGCATGA TGATGATGGG TCTGCACTTC ATGCAGGAAG TGCCGTTCCA CACCGTCTAT ATCCATGCGC TGGTCCGCGA CGAGAAGGGC CAGAAGATGT CGAAGACGAA GGGCAACGTC ATCGACCCGC TGAAACTCAT CGACGAATAC GGCGCCGACG CGCTCCGCTT CACGCTGGCC GCCATGGCCG CGCAGGGCCG CGACATCAAG CTTGCGACAT CCCGCGTCGA AGGTTACCGC AATTTCGGCA CCAAGCTCTG GAACGCCGCG CGCTTCTGCG AGATGAACGA ATGCGCCCGC GTAGAGGGTT TCGATCCCGC GAAGCTCAGC CAGACGGTCA ACAAGTGGAT CGTCGGCGAG GTTGCGCGCA CCGCCGCCGA AATCACGGAA GCCATCGAAG CCTACCGCTT CAACGACGCG GCGAACGCGG CCTACAAATT CGTCTGGAAC GTCTTCTGCG ACTGGTATCT CGAATTCATC AAGCCGCTTT TGATGGGCGA CAATGAGGAA GCGAAAGCCG AAACCCGCGC CACGGCCGCC TGGGTTCTCG ACCAGATATT GCTGATGCTC CATCCCTTCA TGCCCTTCGT CACCGAAGAG CTCTGGCAGC GCACGGGCGA AGTTGGCCCG AAGCGCGAGA CGCTGCTCGT CAAGGCGGCA TGGCCGCTTC ACACAGGGCT CGGCGATGCC GCGGCCGACA AGGAGATGGA TTGGGTGATC CGCTTCATCA CCGAAATCCG TTCCGTCCGT GCCGAGATGA ATGTGCCGGC GGGCGCCAAG ATCGCGCTTC TCATCAAGGA TGCGAGCGAG GAAAGCCTCG CGCGCTTCGA GCGTCACCGC GACCTCATCA TGCGCCTCGC GCGTCTCGAA AGCGCTGTGG TGACGACATC CGTTCCGGAA GGCGCGCTTC AGCTCGTGCT GGACGAAGCG ACGCTGATCC TGCCGCTCGC AAACCTCATC GACGTGGCCG CCGAAACCGC GCGTCTCCGC AAGGAACTCG GCAAGCTCGA AGACGAAGTG AAGAAGATCG ACGCGAAGCT TGGCAACGCC AAATTCTTGG CGGGTGCGCC GGAGCAGGTC GTCGAGGAGC AGCGCGAACG CAAGGCCGAC GCACAAGCGG CGATGGCGAA GTTCAACGAG GCGCTGAAGC GGCTCGCGGG CGCTGCCTGA
|
Protein sequence | MLDKTFDPQD AEARLYKLWE ESGAFQAGDP ARVEAGAKPY CIVIPPPNVT GSLHMGHALN NTLQDVLIRF ERMRGKDVLW QPGLDHAGIA TQMIVERQLE GEGNIRRRDM GRDAFIERIW KWKGESGGTI IGQLRRLGAS CDWSRERFTL DEGLSKAVLK VFVELYREGL IYRDKRLVNW HPELQTAVSD LEVENIEVKG HFWHFRYPLA DGVTYEWPLF DEEGNPAGTE TRDYIVVATT RPETMLGDTG VAVNPEDERY KSLIGKFVEL PLVGRRIPIV GDEHADPEQG SGAVKITPAH DFNDFEVGKR QGLEQINILN PDGKLNDEVP EPYRGMDRFK ARKQVVVDIE ALGLLDRIED KIVMVPHDEK SKLVVIEPYL TDQWYVDAAT LAKPAIEAVE RGQTVFVPKN WEKTYFEWMR NIQPWCISRQ LWWGHRIPAW YGPDGHVFVA YSDEEAAAEA EKHYGKSVEL TRDEDVLDTW FSSGLWPFST LGWPDKTPEL ARYYKTDVLV TGFDIIFFWV ARMMMMGLHF MQEVPFHTVY IHALVRDEKG QKMSKTKGNV IDPLKLIDEY GADALRFTLA AMAAQGRDIK LATSRVEGYR NFGTKLWNAA RFCEMNECAR VEGFDPAKLS QTVNKWIVGE VARTAAEITE AIEAYRFNDA ANAAYKFVWN VFCDWYLEFI KPLLMGDNEE AKAETRATAA WVLDQILLML HPFMPFVTEE LWQRTGEVGP KRETLLVKAA WPLHTGLGDA AADKEMDWVI RFITEIRSVR AEMNVPAGAK IALLIKDASE ESLARFERHR DLIMRLARLE SAVVTTSVPE GALQLVLDEA TLILPLANLI DVAAETARLR KELGKLEDEV KKIDAKLGNA KFLAGAPEQV VEEQRERKAD AQAAMAKFNE ALKRLAGAA
|
| |