Gene P9303_29141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29141 
SymbolvalS 
ID4777818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2577888 
End bp2580731 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content56% 
IMG OID640088437 
Productvalyl-tRNA synthetase 
Protein accessionYP_001018909 
Protein GI124024602 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGC CGCGGCCTTT TGCGAGCATG GTTCGAATGC CGTCCACTTC CATCGTGACT 
GAGCTGAGCG CCGCAGACCC AGCCTTTGTT CAGGCGGCTG ATGCCCTTGC TAAGACCTAC
GACCCAGCTG GGACGGAGAG TCGCTGGCAG TGCGCATGGG AGGAGAGTGG TGCTTTTCAC
CCTGACCCAC AGGCTGCAGG CGAGCCTTTT TCGGTGGTGA TCCCCCCCCC AAACGTGACT
GGCAGTTTGC ATATGGGGCA TGCCTTCAAT ACGGCTCTGA TCGACACAAT CGTGCGTTTT
CAGCGCTTGC AAGGGAAAAA TGTGCTTTGT CTGCCTGGGA CCGATCACGC ATCGATTGCT
GTGCAGACCA TTCTTGAAAA GCAGCTCAAG GCGGAAGCGA TCAGTCGGTA TGACTTAGGC
CGGGAGGCCT TTCTTGAACG TGCCTGGGCC TGGAAAGAGG AAAGCGGTGG GCGGATTGTT
GATCAGCTTC GCCGCTTGGG TTACTCGGTT GACTGGCAGC GTCAGCGCTT CACGTTGGAT
GAAGGTCTCA GTGCGGCTGT CCGTGAAGCT TTTGTTCGCT TACATGAGCA GGGCTTGATC
TATCGAGGTG AGTACCTCGT GAATTGGTGC CCGGCTTCCG GATCGGCAGT GAGCGACTTG
GAGGTTGAGA TGAAGGAAGT CGATGGTCAT CTGTGGCACT TGCGCTATCC CCTAACGGGT
GGCCCGGCCG CTGATGGCAC TACTCATCTT GAAGTGGCCA CAACTCGCCC TGAAACCATG
CTTGGCGATG TGGCTGTTGC GGTGAATCCG GCCGACGAGC GTTATCGCCA TTTGGTGGGT
CAAACCCTCA CATTGCCCCT GCTGGGGCGA GAGATTCCCG TGATTGCTGA TGACCATGTG
GATCAGGATT TCGGGACTGG TTGCGTCAAG GTGACACCCG CCCATGACCC CAACGATTTT
GCGATTGGAC GGCGGCACGA CTTGCCTCAG ATCACGGTGA TGAACAAAAA CGGAAGCATG
AATTGTCATG CCGGTCGTTT TGAGGGGCTG GATCGCTTTG AGGCCCGCAA GGCAGTTGTG
GCGGCCTTGC AGGAAGAGGG CCTATTGGTG AAAGTGGAGC CCCATCGCCA TAGCGTTCCT
TATTCCGACC GAGGCAAGGT GCCGGTGGAG CCTTTGCTTT CCACTCAGTG GTTCGTGCGT
ATGGAACCTC TAGCGGCACG TTGCCATGAG TGTCTTGATC ATGGAGCACC CCGCTTCGTA
CCCAATCGTT GGCAAAAGGT CTATCGCGAT TGGCTCACTG ACATTCGTGA TTGGTGCATC
AGCCGTCAGC TGTGGTGGGG CCATCGCATT CCTGCTTGGT TTGTTGTTAG TGAGACTGAC
GATCAGTTGA CCGATGCCAC TCCTTATCTG GTGGCTCGCT CGGAGGAGGA GGCATGGCAG
CAGGCTCGTG ATCAGTTTGG AGAGGCTGTG GTCATCCAGC AGGATGAAGA TGTGCTTGAT
ACCTGGTTTT CCAGTGGTCT TTGGCCCTTC TCCACCATGG GCTGGCCTGA TCAAGAGAGT
GCAGACCTTG AATGTTGGTA TCCCACCAGC ACTTTGGTCA CAGGTTTCGA CATCATCTTT
TTCTGGGTGG CGAGGATGAC GATGATGGCT GGTGCCTTCA CCGGGCGCAT GCCGTTCGCA
GACGTCTATA TCCATGGCTT GGTGAGGGAT GAGCAGAATC GCAAGATGAG CAAAAGCGCC
GGCAATGGTA TTGATCCGTT GTTGCTCATC GAGCGGTATG GCACCGATGC TCTGCGCTTT
GCCCTGGTGC GTGAAGTTGC TGGAGCTGGT CAAGACATCC GCCTGGATTA CGACCGCAAG
AGCGACACCT CTGCGACGGT GGAGGCGGCC CGGAACTTCG CTAATAAGCT CTGGAATGCC
ACTCGTTTTG CCCTGATGAA TCTGGGTGGA GAGACGCCGG CATCGCTGGG CGAGCCTGAT
CCTGCGAGCT TGCAGCTCGC GGATCGTTGG ATTCTTTCGC GCCTAGCTCG CATGACTCGC
GATGTTGCTG AGCGCTACGA CAGTTATCGC CTTGGTGAGG CGGCTAAATG CCTTTATGAG
TTCGCTTGGA ACGATATTTG CGACTGGTAT TTAGAGCTGA GTAAACGACG GCTACATCCG
GGTGAAGATC CCAGTGATGA AGTTTTGGCG GATCAGTGCA CAGCTCGTCA GGTGTTGGCC
AAGGTGCTTG CTGATCTATT GGTGATGCTT CACCCATTAA TGCCTCATTT GAGCGAGGAA
CTTTGGCATG GGTTAACTGG TGCTCCCAAA GATACTTTTC TGGCTTTGCA AAGCTGGCCA
GCCAGCAACA AATCATCTCT TGATGAGGCT CTTGAACTTT CGTTTACTGA GCTGATTGAG
GCCATCCGGG TGGTGCGCAA CTTGCGTGCA GTTGCTGGTC TGAAGCCAGC TCAGACGGTG
CCGGTTCAAT TCATTACAGG CCGCCGTGAG CTGGCCGCTT TGTTAGAGCA GGCGACTGCG
GATATCACAG CTCTCACGCG TGCTGAGAGC GTGGTGGTGG CGACCAGTGC TGATCTGAGG
CAGCGCTGCT TAGCCGGAGT TAGTGGGGAG TTGCAAGTGC TGCTGCCCAT CGATGGATTG
GTGGATCTGG ATGCTCTTAG GGGTCGCTTG GAGAAGGATT TAGCTAAGGC AGAGAAAGAG
ATTGCTGGTC TGGCGGGTCG CTTGGCCAAT CCGAATTTCG CAATCAAAGC TCCGCCGAAC
GTCGTTGAAG AATGCCAATC CAACCTTGCT GAGGCTGAGG CTCAGGCTGA GCTTGCGCGT
CAGCGGTTGT CCGATTTGGG TTAA
 
Protein sequence
MDAPRPFASM VRMPSTSIVT ELSAADPAFV QAADALAKTY DPAGTESRWQ CAWEESGAFH 
PDPQAAGEPF SVVIPPPNVT GSLHMGHAFN TALIDTIVRF QRLQGKNVLC LPGTDHASIA
VQTILEKQLK AEAISRYDLG REAFLERAWA WKEESGGRIV DQLRRLGYSV DWQRQRFTLD
EGLSAAVREA FVRLHEQGLI YRGEYLVNWC PASGSAVSDL EVEMKEVDGH LWHLRYPLTG
GPAADGTTHL EVATTRPETM LGDVAVAVNP ADERYRHLVG QTLTLPLLGR EIPVIADDHV
DQDFGTGCVK VTPAHDPNDF AIGRRHDLPQ ITVMNKNGSM NCHAGRFEGL DRFEARKAVV
AALQEEGLLV KVEPHRHSVP YSDRGKVPVE PLLSTQWFVR MEPLAARCHE CLDHGAPRFV
PNRWQKVYRD WLTDIRDWCI SRQLWWGHRI PAWFVVSETD DQLTDATPYL VARSEEEAWQ
QARDQFGEAV VIQQDEDVLD TWFSSGLWPF STMGWPDQES ADLECWYPTS TLVTGFDIIF
FWVARMTMMA GAFTGRMPFA DVYIHGLVRD EQNRKMSKSA GNGIDPLLLI ERYGTDALRF
ALVREVAGAG QDIRLDYDRK SDTSATVEAA RNFANKLWNA TRFALMNLGG ETPASLGEPD
PASLQLADRW ILSRLARMTR DVAERYDSYR LGEAAKCLYE FAWNDICDWY LELSKRRLHP
GEDPSDEVLA DQCTARQVLA KVLADLLVML HPLMPHLSEE LWHGLTGAPK DTFLALQSWP
ASNKSSLDEA LELSFTELIE AIRVVRNLRA VAGLKPAQTV PVQFITGRRE LAALLEQATA
DITALTRAES VVVATSADLR QRCLAGVSGE LQVLLPIDGL VDLDALRGRL EKDLAKAEKE
IAGLAGRLAN PNFAIKAPPN VVEECQSNLA EAEAQAELAR QRLSDLG