Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3441 |
Symbol | valS |
ID | 4243384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5260963 |
End bp | 5263698 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108417 |
Product | valyl-tRNA synthetase |
Protein accession | YP_723007 |
Protein GI | 113476946 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGAA GCATACCAGA ACTACCAACA AAATACGACC CCTTTGTCAC TGAAGCTAAA TGGCAGAGAT ATTGGGAAGA AAACAATACT TTCAAAGCAG ACCCTAAGCA AGCAGGCGAC CCCTACTGCA TAGTCATTCC TCCACCAAAT GTTACAGGTA GTCTCCATGC AGGCCATGCT TTCAATAATT CTCTTATTGA TGCCCTTATC CGTTACCATC GGATGCGTGG TGACAATACC CTCTACCTCC CAGGCACAGA TCACGCTAGT ATCGCAGTTC AAACTATTCT GGAACGCAAA CTCAAAGCGG ATGGCCAAAC TCGCTACGAT GTCGGGCGGG AAAAATTTCT CAAACTTGCC TGGGAATGGA AAGCAGAGTC TGGTGGCACT ATTGTTAATC AACTGCGTCG CTTAGGAGTT TCTACAGACT GGACACGGGA ACGTTTTACC CTGGATGAAG GCTTATCAAA AGCTGTCATT GAGGCATTCA TGCGTCTCTA CGAACAAGGT CTCATCTATC GTGGTAATTA TTTGGTCAAC TGGTGTCCGG CTAGTCAGTC TGCTGTTTCT GACTTAGAGG TAGAAAATAA AGAAGTTAAT GGGAACCTCT GGCATTTCCG TTATCCTCTC ACTGATGGCA GTGGTTATGT GGAAGTCGCT ACTACTCGCC CAGAAACAAT GTTGGGAGAT ACTGGTGTTG CTGTTAACCC TAATGACGAT CGCTACAAAG ATATTATCGG CAAAACTGTT ATGCTACCAA TAATGAAACG GGAAATACCT ATTATTGCTG ATGAGTTGGT TGATCCTGAG TTTGGTACTG GTTGTGTTAA AGTTACTCCC GCTCACGACC CTAATGATTT TGAAATGGGT AAACGTCATA ACTTGCCGTT AATCAATATT ATGAATAAAG ATGGTAGCTT GAATGAAAAT GCGGGAGTGT TTGTTGAGCA AGACCGTTTT GAGGCGCGGA AAAATGTTGT CCAACGGTTA AAGGAAGAAG GGGTTTTAGT AAAAGTAGAA GATTATAGTC ATAGTGTTCC CTATAGTGAT CGTGGCAAAG TACCTGTAGA ACCACTGCTC TCAACTCAGT GGTTTGTTAA AATTAAACCC TTGGCGGAGA AAACACTGGA ATTATTGGAT AGGCACAACC AACCCAAATT TGTTCCTGAA CGTTGGACGA AAGTTTATCG CGACTGGTTG GTAAAAATTC AAGACTGGTG TATTTCCCGA CAACTCTGGT GGGGTCATCA AATTCCGGCT TGGTATGTTA TTAGTGAAAC TAATGGAGAA ATTGCCGATA ATACTCCTTT TGTTGTGGCT CGCTTTGAAG CGGAAGCGTT AGAAGTAGCA AAGAAAAAAT TTGGCGACCA TGTCAAAATT CAACAAGACC CAGATGTTCT TGATACTTGG TTTTCATCTA GTTTATGGCC GTTTTCCACA TTGGGCTGGC CGGAAAATAC TCCAGACTTA GAAAAATATT ATCCTACTAG CACTCTTTCT ACTGGTTTTG ATATCATCTT TTTTTGGGTA GCTAGAATGA CAATGATGGG AGTACATTTG ACTGGAAAAA TGCCATTTGA AACTGTTTAT ATTCACGGTT TAATGTTAGA TGAAAACGGT AAAAAACAAT CTAAATCTGC TGGAAATGGT ATTAACCCAT TATTATTAAT TGAAAAGTAT GGTACCGATG CTTTACGTTA CACTTTGATG CGAGAAACTG CTGGAGCAGG ACAAGATGTG AGGATGGATT ATAATAGAGA AACTGATGAG TCTGCATCAG TAGAAGCTTC TCGAAATTTT ACTAATAAAA TTTGGAATGC TGCTAGGTTT GTCATGATGA AATTAGAGGG TAAAACTCCT GAAGAATTAG GGAAACCAGA GGTAGATAAA TTAGAATTAG TTGACCGTTG GATTTTGTCT CACTTCTATC AGACTGTGCA ACAAACTTGC GATTATTTAG ATAATTATGG TTTGGGAGAA GCTGCTAAAG GTCTTTATGA ATTTATTTGG GGATATTTTT GTGATTGGTA TCTTGAATTA GTTAAGTCTC GTTTACAGGG TGAAGATGAA AATTCTCGGT TGGTGGCACA GCAAACTTTA GCTTATGTTT TGGATGGAAT TTTAAGGTTA TTACATCCTT TTATGCCTCA TATTACTGAA GAAATTTGGC ATACTTTGAA TCAAGTTGGA GAAGAAAATT GTTTGGCTTT ACAGTCTTAT CCAAAGTTAG ATAAATCTTT AATTAATTCT GATTTGGAGC ATGAGTTTGA GTTATTAATT GGGGTAATTC GGAGTCTCAG AAATTTACGG GCGGAGGTTG ATATTAAGCC AAAGGTAAAA ATTACGGCAA TTTTGCAGAG TCAGAATGAA CAGGAACGTG AAATTTTGAG GAAAGGAGAG AATTATATTC AGGATTTAAT TAAAATTGAA AAGTTGAATA TTACTTCTAG TGTTGATTTA AAGGTTGGTC AAACTATTGC TGGTGTTATT GGTACTGTAC AAGTTTTAAT TCCATTATCA GGAGTAGTAG ATATTGAAGC TTTATCTGCT AGACTGAACA AAAAGTTAGA GAAGTTGGAA AAAGAGATAG GATCTACTAA AAAACGGTTG AGTAAACCAG ATTTTGTGAA GAAGGCAGAC CCTAAATTTG TGGAGGAAAC TAGAAATAAT TTGGCAGAGG CAGAAAAGCA GGCAGAATTT TTGCGCGATC GCTTGGATAA GTTCCAGTCA AATTAA
|
Protein sequence | MTRSIPELPT KYDPFVTEAK WQRYWEENNT FKADPKQAGD PYCIVIPPPN VTGSLHAGHA FNNSLIDALI RYHRMRGDNT LYLPGTDHAS IAVQTILERK LKADGQTRYD VGREKFLKLA WEWKAESGGT IVNQLRRLGV STDWTRERFT LDEGLSKAVI EAFMRLYEQG LIYRGNYLVN WCPASQSAVS DLEVENKEVN GNLWHFRYPL TDGSGYVEVA TTRPETMLGD TGVAVNPNDD RYKDIIGKTV MLPIMKREIP IIADELVDPE FGTGCVKVTP AHDPNDFEMG KRHNLPLINI MNKDGSLNEN AGVFVEQDRF EARKNVVQRL KEEGVLVKVE DYSHSVPYSD RGKVPVEPLL STQWFVKIKP LAEKTLELLD RHNQPKFVPE RWTKVYRDWL VKIQDWCISR QLWWGHQIPA WYVISETNGE IADNTPFVVA RFEAEALEVA KKKFGDHVKI QQDPDVLDTW FSSSLWPFST LGWPENTPDL EKYYPTSTLS TGFDIIFFWV ARMTMMGVHL TGKMPFETVY IHGLMLDENG KKQSKSAGNG INPLLLIEKY GTDALRYTLM RETAGAGQDV RMDYNRETDE SASVEASRNF TNKIWNAARF VMMKLEGKTP EELGKPEVDK LELVDRWILS HFYQTVQQTC DYLDNYGLGE AAKGLYEFIW GYFCDWYLEL VKSRLQGEDE NSRLVAQQTL AYVLDGILRL LHPFMPHITE EIWHTLNQVG EENCLALQSY PKLDKSLINS DLEHEFELLI GVIRSLRNLR AEVDIKPKVK ITAILQSQNE QEREILRKGE NYIQDLIKIE KLNITSSVDL KVGQTIAGVI GTVQVLIPLS GVVDIEALSA RLNKKLEKLE KEIGSTKKRL SKPDFVKKAD PKFVEETRNN LAEAEKQAEF LRDRLDKFQS N
|
| |