Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29141 |
Symbol | valS |
ID | 4777818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2577888 |
End bp | 2580731 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088437 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001018909 |
Protein GI | 124024602 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCGC CGCGGCCTTT TGCGAGCATG GTTCGAATGC CGTCCACTTC CATCGTGACT GAGCTGAGCG CCGCAGACCC AGCCTTTGTT CAGGCGGCTG ATGCCCTTGC TAAGACCTAC GACCCAGCTG GGACGGAGAG TCGCTGGCAG TGCGCATGGG AGGAGAGTGG TGCTTTTCAC CCTGACCCAC AGGCTGCAGG CGAGCCTTTT TCGGTGGTGA TCCCCCCCCC AAACGTGACT GGCAGTTTGC ATATGGGGCA TGCCTTCAAT ACGGCTCTGA TCGACACAAT CGTGCGTTTT CAGCGCTTGC AAGGGAAAAA TGTGCTTTGT CTGCCTGGGA CCGATCACGC ATCGATTGCT GTGCAGACCA TTCTTGAAAA GCAGCTCAAG GCGGAAGCGA TCAGTCGGTA TGACTTAGGC CGGGAGGCCT TTCTTGAACG TGCCTGGGCC TGGAAAGAGG AAAGCGGTGG GCGGATTGTT GATCAGCTTC GCCGCTTGGG TTACTCGGTT GACTGGCAGC GTCAGCGCTT CACGTTGGAT GAAGGTCTCA GTGCGGCTGT CCGTGAAGCT TTTGTTCGCT TACATGAGCA GGGCTTGATC TATCGAGGTG AGTACCTCGT GAATTGGTGC CCGGCTTCCG GATCGGCAGT GAGCGACTTG GAGGTTGAGA TGAAGGAAGT CGATGGTCAT CTGTGGCACT TGCGCTATCC CCTAACGGGT GGCCCGGCCG CTGATGGCAC TACTCATCTT GAAGTGGCCA CAACTCGCCC TGAAACCATG CTTGGCGATG TGGCTGTTGC GGTGAATCCG GCCGACGAGC GTTATCGCCA TTTGGTGGGT CAAACCCTCA CATTGCCCCT GCTGGGGCGA GAGATTCCCG TGATTGCTGA TGACCATGTG GATCAGGATT TCGGGACTGG TTGCGTCAAG GTGACACCCG CCCATGACCC CAACGATTTT GCGATTGGAC GGCGGCACGA CTTGCCTCAG ATCACGGTGA TGAACAAAAA CGGAAGCATG AATTGTCATG CCGGTCGTTT TGAGGGGCTG GATCGCTTTG AGGCCCGCAA GGCAGTTGTG GCGGCCTTGC AGGAAGAGGG CCTATTGGTG AAAGTGGAGC CCCATCGCCA TAGCGTTCCT TATTCCGACC GAGGCAAGGT GCCGGTGGAG CCTTTGCTTT CCACTCAGTG GTTCGTGCGT ATGGAACCTC TAGCGGCACG TTGCCATGAG TGTCTTGATC ATGGAGCACC CCGCTTCGTA CCCAATCGTT GGCAAAAGGT CTATCGCGAT TGGCTCACTG ACATTCGTGA TTGGTGCATC AGCCGTCAGC TGTGGTGGGG CCATCGCATT CCTGCTTGGT TTGTTGTTAG TGAGACTGAC GATCAGTTGA CCGATGCCAC TCCTTATCTG GTGGCTCGCT CGGAGGAGGA GGCATGGCAG CAGGCTCGTG ATCAGTTTGG AGAGGCTGTG GTCATCCAGC AGGATGAAGA TGTGCTTGAT ACCTGGTTTT CCAGTGGTCT TTGGCCCTTC TCCACCATGG GCTGGCCTGA TCAAGAGAGT GCAGACCTTG AATGTTGGTA TCCCACCAGC ACTTTGGTCA CAGGTTTCGA CATCATCTTT TTCTGGGTGG CGAGGATGAC GATGATGGCT GGTGCCTTCA CCGGGCGCAT GCCGTTCGCA GACGTCTATA TCCATGGCTT GGTGAGGGAT GAGCAGAATC GCAAGATGAG CAAAAGCGCC GGCAATGGTA TTGATCCGTT GTTGCTCATC GAGCGGTATG GCACCGATGC TCTGCGCTTT GCCCTGGTGC GTGAAGTTGC TGGAGCTGGT CAAGACATCC GCCTGGATTA CGACCGCAAG AGCGACACCT CTGCGACGGT GGAGGCGGCC CGGAACTTCG CTAATAAGCT CTGGAATGCC ACTCGTTTTG CCCTGATGAA TCTGGGTGGA GAGACGCCGG CATCGCTGGG CGAGCCTGAT CCTGCGAGCT TGCAGCTCGC GGATCGTTGG ATTCTTTCGC GCCTAGCTCG CATGACTCGC GATGTTGCTG AGCGCTACGA CAGTTATCGC CTTGGTGAGG CGGCTAAATG CCTTTATGAG TTCGCTTGGA ACGATATTTG CGACTGGTAT TTAGAGCTGA GTAAACGACG GCTACATCCG GGTGAAGATC CCAGTGATGA AGTTTTGGCG GATCAGTGCA CAGCTCGTCA GGTGTTGGCC AAGGTGCTTG CTGATCTATT GGTGATGCTT CACCCATTAA TGCCTCATTT GAGCGAGGAA CTTTGGCATG GGTTAACTGG TGCTCCCAAA GATACTTTTC TGGCTTTGCA AAGCTGGCCA GCCAGCAACA AATCATCTCT TGATGAGGCT CTTGAACTTT CGTTTACTGA GCTGATTGAG GCCATCCGGG TGGTGCGCAA CTTGCGTGCA GTTGCTGGTC TGAAGCCAGC TCAGACGGTG CCGGTTCAAT TCATTACAGG CCGCCGTGAG CTGGCCGCTT TGTTAGAGCA GGCGACTGCG GATATCACAG CTCTCACGCG TGCTGAGAGC GTGGTGGTGG CGACCAGTGC TGATCTGAGG CAGCGCTGCT TAGCCGGAGT TAGTGGGGAG TTGCAAGTGC TGCTGCCCAT CGATGGATTG GTGGATCTGG ATGCTCTTAG GGGTCGCTTG GAGAAGGATT TAGCTAAGGC AGAGAAAGAG ATTGCTGGTC TGGCGGGTCG CTTGGCCAAT CCGAATTTCG CAATCAAAGC TCCGCCGAAC GTCGTTGAAG AATGCCAATC CAACCTTGCT GAGGCTGAGG CTCAGGCTGA GCTTGCGCGT CAGCGGTTGT CCGATTTGGG TTAA
|
Protein sequence | MDAPRPFASM VRMPSTSIVT ELSAADPAFV QAADALAKTY DPAGTESRWQ CAWEESGAFH PDPQAAGEPF SVVIPPPNVT GSLHMGHAFN TALIDTIVRF QRLQGKNVLC LPGTDHASIA VQTILEKQLK AEAISRYDLG REAFLERAWA WKEESGGRIV DQLRRLGYSV DWQRQRFTLD EGLSAAVREA FVRLHEQGLI YRGEYLVNWC PASGSAVSDL EVEMKEVDGH LWHLRYPLTG GPAADGTTHL EVATTRPETM LGDVAVAVNP ADERYRHLVG QTLTLPLLGR EIPVIADDHV DQDFGTGCVK VTPAHDPNDF AIGRRHDLPQ ITVMNKNGSM NCHAGRFEGL DRFEARKAVV AALQEEGLLV KVEPHRHSVP YSDRGKVPVE PLLSTQWFVR MEPLAARCHE CLDHGAPRFV PNRWQKVYRD WLTDIRDWCI SRQLWWGHRI PAWFVVSETD DQLTDATPYL VARSEEEAWQ QARDQFGEAV VIQQDEDVLD TWFSSGLWPF STMGWPDQES ADLECWYPTS TLVTGFDIIF FWVARMTMMA GAFTGRMPFA DVYIHGLVRD EQNRKMSKSA GNGIDPLLLI ERYGTDALRF ALVREVAGAG QDIRLDYDRK SDTSATVEAA RNFANKLWNA TRFALMNLGG ETPASLGEPD PASLQLADRW ILSRLARMTR DVAERYDSYR LGEAAKCLYE FAWNDICDWY LELSKRRLHP GEDPSDEVLA DQCTARQVLA KVLADLLVML HPLMPHLSEE LWHGLTGAPK DTFLALQSWP ASNKSSLDEA LELSFTELIE AIRVVRNLRA VAGLKPAQTV PVQFITGRRE LAALLEQATA DITALTRAES VVVATSADLR QRCLAGVSGE LQVLLPIDGL VDLDALRGRL EKDLAKAEKE IAGLAGRLAN PNFAIKAPPN VVEECQSNLA EAEAQAELAR QRLSDLG
|
| |