Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0684 |
Symbol | valS |
ID | 4710313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 766491 |
End bp | 769244 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855147 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001002268 |
Protein GI | 121997481 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.980667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAGA CGTACGACCC GAGCGCAATC GAGTCCCGAC TCTACGAGCT CTGGGAGCAG GGCGGCCACT TCGCCCCCTC CGGCGAGGGC AGCCCGTACT GCATCATGAT CCCGCCGCCG AACGTCACCG GCACCCTGCA CATGGGGCAC GCCTTCCAGG ACACGGTGAT GGACGCCCTG GTGCGCTACC ACCGCATGGA CGGCCACAAC ACCCTCTGGC AGCCGGGCAC CGACCACGCC GGGATCGCCA CGCAGATGGT CGTCGAGCGC CAGCTCGAGG CCGAGGGGCT GAGCCGCCAC GACCTGGGCC GCGAGCGCTT CCTGGAGCGA GTCTGGCAGT GGAAGGCCGA ATCCGGCGGC ACCATCCAGA ACCAGCTGCG GCGCATGGGC GCCTCGGTGG ACTGGCGCCG CGAGCGCTTC ACCATGGACG AGGGCCTCTC CGAGGCGGTT ACCGAGGTCT TCGTACGCCT CTACGAGGAG GATCTGCTCT ACCGCGGCGA GCGCCTGGTC AACTGGGATC CGGTGCTGCA CACCGCCGTC TCGGACCTGG AGGTGGTCTC CGCCGAGGAG CAGGGCCACA TCTGGCACAT GGTCTACCCG CTGGCCGACG GCAGCGGCTC GGTGGTGGTG GCCACCACCC GGCCGGAGAC CATGCTCGGC GACACCGCCG TGGCCGTTAA CCCGGAGGAC GAGCGCTACA CCCATCTGAT CGGCAAGAGC GTGCGCCTGC CCCTGGTCGG CCGCGAGATC CCGATCATCG GCGACGACTA CGTCGACCCG AGCTTCGGCA GCGGCTGCCT GAAGATCACC CCGGCCCACG ACTTCAACGA CTTCGCCGTC GGCGAGCGCC ACGACCTGCC GCGGATCAAC GTCCTCACCG AGGACGCCCG CATCAACACC AACGCCCCGG AGCGCTACCA GGGGCTGGAC CGCTACGAGG CCCGCAAGCA GATCGTCGAG GACCTGCGCA CCGAGGGCCT TCTGCAGCAG GTCGAGGACC ACAAACTGAT GGTCCCGCGC GGCGACCGCA GCGGCGCCGT CATTGAGCCG TACCTGACCT GGCAGTGGTT CGTGCGCGCC GAGCCGCTGG CCCGCCCGGC CATCGAGGCC GTCGAGGACG GGCGCATCCG CTTCATCCCC GGCAACTGGG ACAAGACCTA CTACGAGTGG CTGCGCAACA TCGAGGACTG GTGCATCTCG CGGCAGATCT GGTGGGGCCA CCGCATCCCG GCCTGGTACG ACGAGCAGGG CCACGTCTAC GTGGGCCGCT CCGAGGCCGA GGTCCGCGAG CGCCACGGCC TGGGCGACGC GCCGCTGACC CGCGACGACG ATGTCCTCGA CACCTGGTTC TCGTCGGCCC TGTGGCCCTT CTCCACCCTG GGCTGGCCCG AGCAGACCGA TGCCCTGCGC ACCTTCTACC CGACCTCGGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC CGGATGATCA TGTTCGGGCT GCACTTCATG GGCGAGGTAC CGTTCCGCGA GATCTACATC CACGGCCTGG TGCGCGACCC CGAAGGGCAG AAGATGTCCA AATCCAAGGG CAACGTCCTC GACCCGCTGG ACATCGTCGA CGGCATCGAT CTGGAATCCC TCGTCACCAA ACGCACCGCC GATATGATGC AGCCGCAGCT GGCTGAGCGC ATCGAGGCCA TGACCCGCCG CCACTACCCG GACGGCATCA AGGCCCACGG CACCGACGCC CTGCGCTTTA CCTTCGCCTC GCTGGCGACC ACCGGGCGGG ACGTGGTCTT CGACCTCGGC CGGGTCGAGG GCTACCGCAA CTTCTGCAAC AAGCTCTGGA ACGCCGCCCG CTACGTGCTG ATGAACACCG AGGGCGCCGA CTGCGGCAGC GGCGGGCCGG TGGAGCTGGG CGCCGCCGAG CGCTGGATCC GCTCGCGACT GGATCGCACC GTCGCCGAGG TCCGCCAGGC CTTCGCCGAC TACCGCCTGG ACCAGGCGGC GCAGGCCATC TACGAGTTCA CCTGGGACGA GTACTGCGAC TGGTACCTGG AGCTCTCCAA GCCGGCCCTG CGCGAGGGCA CCGCGGCCCA GGCCCGCGGC ACCCGGCAGA CGCTGATCCA GGTCCTCGAG GAGCTGCTGC GCCTGACCCA CCCGCTGATG CCGTTCATCA CCGAGGCGAT CTGGCAGCGC GTCGCCCCGG TCGCCGGTGC CCAGGGCGAG ACGATCATGC GCCAGCCCTT CCCGGCGGCG GATGCGCAGC GGCGCGACGC CACCGCCGAG GCCGAGATCG ACTGGGTCCA GCGGGTCATC CTCGGCGTGC GGCGCATCCG CGGCGAGATG GACATCTCGC CGAGCCAGCC GGTGCCGGTG CTGCTGCGCC ACGCCGGCGA ACAGGACCGG GCGCGACTCG CCGAGCACCA GGGCTTCGTC ACCACCCTCG CCCGGCTGGA GTCGATCCGG GTGCTGGAGG CCGACGAGCA GCCGCCGGAG TCGGCCCTCG CCCTGGCCGG CGAGATGGAG GTCCTGGTGC CCATGGCCGG CCTGGTCGAC AAGGAGGCCG AGCTCGCCCG CCTGGCCAAG GAGCGGACGC GGCTCGAGGG CGAGATCGAG CGCGCCGAGA AGAAACTCGG CAACGAGAGC TTCGTCGCCA AGGCCCCGGC CGAGGTGGTG GACAAGGAGC GCGCGAAGCT CAACGAGGCG CAGCAGGCCC TGGAGAAGGT CGCCGAACAA GAGCGGCGGG TGGAGGCGCT CTAA
|
Protein sequence | MEKTYDPSAI ESRLYELWEQ GGHFAPSGEG SPYCIMIPPP NVTGTLHMGH AFQDTVMDAL VRYHRMDGHN TLWQPGTDHA GIATQMVVER QLEAEGLSRH DLGRERFLER VWQWKAESGG TIQNQLRRMG ASVDWRRERF TMDEGLSEAV TEVFVRLYEE DLLYRGERLV NWDPVLHTAV SDLEVVSAEE QGHIWHMVYP LADGSGSVVV ATTRPETMLG DTAVAVNPED ERYTHLIGKS VRLPLVGREI PIIGDDYVDP SFGSGCLKIT PAHDFNDFAV GERHDLPRIN VLTEDARINT NAPERYQGLD RYEARKQIVE DLRTEGLLQQ VEDHKLMVPR GDRSGAVIEP YLTWQWFVRA EPLARPAIEA VEDGRIRFIP GNWDKTYYEW LRNIEDWCIS RQIWWGHRIP AWYDEQGHVY VGRSEAEVRE RHGLGDAPLT RDDDVLDTWF SSALWPFSTL GWPEQTDALR TFYPTSVLVT GFDIIFFWVA RMIMFGLHFM GEVPFREIYI HGLVRDPEGQ KMSKSKGNVL DPLDIVDGID LESLVTKRTA DMMQPQLAER IEAMTRRHYP DGIKAHGTDA LRFTFASLAT TGRDVVFDLG RVEGYRNFCN KLWNAARYVL MNTEGADCGS GGPVELGAAE RWIRSRLDRT VAEVRQAFAD YRLDQAAQAI YEFTWDEYCD WYLELSKPAL REGTAAQARG TRQTLIQVLE ELLRLTHPLM PFITEAIWQR VAPVAGAQGE TIMRQPFPAA DAQRRDATAE AEIDWVQRVI LGVRRIRGEM DISPSQPVPV LLRHAGEQDR ARLAEHQGFV TTLARLESIR VLEADEQPPE SALALAGEME VLVPMAGLVD KEAELARLAK ERTRLEGEIE RAEKKLGNES FVAKAPAEVV DKERAKLNEA QQALEKVAEQ ERRVEAL
|
| |