Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_0678 |
Symbol | valS |
ID | 7105111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 703008 |
End bp | 705737 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643473776 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002370919 |
Protein GI | 218245548 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAA ACGTACCTGA ATTACCCACC CAATACGACC CCAAAACCAC CGAAGCTAAA TGGCAAGAAG CGTGGGAAAC CCACCAAGTC TTTAAAGCAG ACACCAACCA CCCAGGAACC CCCTACAGCA TCGTTATCCC CCCTCCCAAC GTCACGGGAA GCTTGCACAT GGGTCATGCC TTTGAAGACT GTTTAATGGA TGTTCTGATG CGCTATCATC GGATGTGCGG ACACAATACC CTCTGTCTCC CTGGAACTGA TCACGCGAGT ATCGCGGTTC ATACCATCCT CGATCGCCAA CTCAAAGCAG AAGGCAAAAC CCGTTATGAT ATGGGTCGGG AAAAATTCCT CGAAAAAGCC TGGCAATGGA AAGAAGAATC AGGCGGTACA ATTGTTAATC AATTAAAGCG AATGGGACTG TCTGCGGACT GGTCACGGGA ACGATTTACC CTCGATGAAG GACTGTCTAA AGCGGTTAGA AAAGCCTTTA TTCAACTCTA TGAAGCAGGG TTAATTTATA GAGGAAATTA CCTAGTTAAT TGGTGTCCTG CGTCTCTTTC TGCTGTCTCA GATTTAGAAG TAGAAAGCAA AGAAGTAGAC GGACATCTTT GGCATTTTCG CTACCCCTTA ACCGATGGAA CGGGGTATAT AGAAGTGGCA ACCACACGAC CCGAAACCAT GTTAGGAGAT ACGGGAGTGG CAGTTAATCC TAATGATAAA CGCTACCAAA GTTTGATTGG AAAAACCCTA ACCTTACCCC TGGTTGGTCG AGAAATTCCT ATCTTTGCCG ATGAGTTAGT AGACCCCGAA TTTGGAACAG GATGCGTTAA AGTTACCCCC GCGCATGATC CCAATGACTT CGAGATGGGA AACCGTCATA ACTTGCCCTT TATTAATATT ATGAATAAAG ACGGCACACT CAATGAAAAT GCCGGAATTT TTCAAGGACA AGATCGCTAC GTTGCACGGA AAAATGTTGT TAAAAAACTC GAAGAAGAAG GCTATTTAAT TAAAGTTGAA GAATACCGTC ATGCGGTTCC CTATAGCGAT CGCGGAAAAG TCCCCGTTGA ACCCCTATTA TCTACCCAAT GGTTCGTTAA AATTGAACCC CTCTCTAAAA AAGCCTTAAC CTTTTTAGAT GAGTATAATT CCCCTCGTTT TGTCCCTGAT CGCTGGACCA AAGTCTATCG AGATTGGTTA ATAAAATTAA AGGACTGGTG TATCTCCCGT CAATTGTGGT GGGGTCATCA AATTCCCGCT TGGTATGTAA TTAGTGAAAC TAATAACGAA ATTACCGATC ATACCCCATT TATTGTGGCA GACAACGAAG ACGAAGCCAT CAAAAAAGCC CAAGAACAAT ACGGAGAGAA CATCACATTA GAACAAGATC CTGATGTTTT AGATACTTGG TTTTCTTCTG GTTTATGGCC GTTTTCTACG CTAGGTTGGC CAGAAAATAC TGAAGACTTA ACCCGCTACT ATCCCACTAG CACGCTAGTT ACCGGGTTCG ATATTATTTT CTTTTGGGTT GCCCGTATGA CCATGATGGC AGGGTATTTT ACCGATCAAA TTCCCTTCAA AGATGTCTAT ATTCATGGGT TAGTTAGAGA CGAAAACGGT AAAAAAATGT CTAAGTCGGC TAACAATGGC ATCGATCCTT TAATCCTAAT TAAAAACTAT GGAACTGATG CCCTACGGTA TACCCTAATT AAAGAAGTTG CTGGTGCAGG TCAAGATATT AGCCTACAAT ATAACCGAAA AACCGACGAA TCAGAATCCG TTGAAGCGTC TCGAAATTTT GCCAATAAAC TCTGGAATGC TGCGCGGTTC GTGATGATGA ACTTGCAAGG AAACACCCCG AAAACCTTGG GAGTTCCTGA TAGGGAAAAA TTAGAATTAT GCGATCGCTG GATCTTATCC CGTTTCCATC AATTAGTCCA ACAAACCCGC AACTACGTCG AAAACTACGG ACTTGGAGAA GCAGCCAAAG GACTTTATGA GTTCATTTGG GGTGATTTTT GCGACTGGTA TATCGAATTA GTCAAAACCC GTCTTTGGAA AGACTCTAAC ACAGAGTCTC GCTTAGTTGC TCAGCAAACC CTAGCCTACG TCCTCGATAA TACCTTAAGG CTACTCCATC CCTTTATGCC CCATATTACC GAAGAAATTT GGCAAACCCT AACCCAAACA AGCGATCAAT TCCTCGCTCT ACAAGCTTAT CCCATTGCTG ACACCCAAGC CATTGATCCT CAACTAGAAA CGTCATTTGA TCTAATCATT GCAACCATTC GTACCTTAAG AAACTTAAGG GCGGAAGCGG AAATCAAACC TGGGGTGAAA GTTTCCGTAA TTTTACAGAG TGAAAATGAG CACGAACGGG CTATTCTAGA ATCTGCTCAA CCTTATATTA AGGATCTCTC AAAAGTGGAA CAACTAACGA TTACTCCCCA ATTAGAAGCG GAGATAGGAC AAGTTATCGC GGGAGTTGTC GGTACCGTTC AGGCTTTAAT TCCCCTATCT GGTATTATTG ATATTGCCAA TCTTCGAGGA AAGCTAGAGA AAAATTTAGC TAAGATAGAA GCTGAGATCA AATCTTTGAG CAGTCGGCTG AGTAACCCTG GTTTTGTCAA CAAAGCCCCT CAAGAAATCA TTCAAGGAGC TCAAGAATCC CTCGCTGAAG CCCAAAAACA AGCAGAAATT CTTCGAGAAC GTCTCCATCG TCTGAAATAA
|
Protein sequence | MSANVPELPT QYDPKTTEAK WQEAWETHQV FKADTNHPGT PYSIVIPPPN VTGSLHMGHA FEDCLMDVLM RYHRMCGHNT LCLPGTDHAS IAVHTILDRQ LKAEGKTRYD MGREKFLEKA WQWKEESGGT IVNQLKRMGL SADWSRERFT LDEGLSKAVR KAFIQLYEAG LIYRGNYLVN WCPASLSAVS DLEVESKEVD GHLWHFRYPL TDGTGYIEVA TTRPETMLGD TGVAVNPNDK RYQSLIGKTL TLPLVGREIP IFADELVDPE FGTGCVKVTP AHDPNDFEMG NRHNLPFINI MNKDGTLNEN AGIFQGQDRY VARKNVVKKL EEEGYLIKVE EYRHAVPYSD RGKVPVEPLL STQWFVKIEP LSKKALTFLD EYNSPRFVPD RWTKVYRDWL IKLKDWCISR QLWWGHQIPA WYVISETNNE ITDHTPFIVA DNEDEAIKKA QEQYGENITL EQDPDVLDTW FSSGLWPFST LGWPENTEDL TRYYPTSTLV TGFDIIFFWV ARMTMMAGYF TDQIPFKDVY IHGLVRDENG KKMSKSANNG IDPLILIKNY GTDALRYTLI KEVAGAGQDI SLQYNRKTDE SESVEASRNF ANKLWNAARF VMMNLQGNTP KTLGVPDREK LELCDRWILS RFHQLVQQTR NYVENYGLGE AAKGLYEFIW GDFCDWYIEL VKTRLWKDSN TESRLVAQQT LAYVLDNTLR LLHPFMPHIT EEIWQTLTQT SDQFLALQAY PIADTQAIDP QLETSFDLII ATIRTLRNLR AEAEIKPGVK VSVILQSENE HERAILESAQ PYIKDLSKVE QLTITPQLEA EIGQVIAGVV GTVQALIPLS GIIDIANLRG KLEKNLAKIE AEIKSLSSRL SNPGFVNKAP QEIIQGAQES LAEAQKQAEI LRERLHRLK
|
| |