Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1281 |
Symbol | valS |
ID | 3606675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1788038 |
End bp | 1790839 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637688157 |
Product | valyl-tRNA synthetase |
Protein accession | YP_292474 |
Protein GI | 72383119 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.202707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAGAGC GGGTAAAAAC TACAAAATTA TCTGAAGCCT CAGGGCTTCC TAAAACATAT GATCCAGTAG GTACTGAAAA TCGCTGGCAG AAAGCTTGGG AAGAAAAAGG AGCTTTTAAA CCTGATCCAT CAGCCCCTGG AGACCCATTC TCCGTAGTTA TTCCTCCTCC AAATGTCACA GGTAGTTTGC ATATGGGGCA TGCTTTTAAT ACTGCCTTAA TCGATACAGT TGTCAGGTAT AAGAGATTAA AAGGAAATAA TGTTCTTTGT CTTCCAGGAA CAGACCATGC TTCAATTGCG GTTCAAACTA TTCTCGAGCG ACAACTTAAG GAAGAAGGCA AAAATCGTCG TGATCTTGGT AGAGCTTCTT TTTTGGAAAA AGCTTGGGAG TGGAAAGAAA AAAGTGGTGG AAGAATTGTT GATCAATTAA AGCGTTTGGG ATATTCCGTA GACTGGAGTA GAGAGAGATT TACATTGGAT GAAGGACTGA GTAAAGCTGT TTCTGAGGCA TTTGTTCGTT TACATGAAAA GGGATTGATA TATCGAGGAG AATATTTAGT GAATTGGTGT CCTGCCTCTG GTTCGGCTGT GAGTGATTTA GAAGTTGAAA TGAAAGAAGT AGATGGACAT CTATGGCATT TTCGATATCC CCTAGTCACA TCATCTGTAT CAAGTGCGAA ACAAATTAGT TACTTAGAAG TAGCGACTAC ACGTCCTGAG ACGATGCTTG GGGATGTCGC AGTTGCTGTG AACCCATCAG ATGAAAGGTA TAAAGATCTC ATAGGGGAGA AACTTACTTT GCCTTTAGTT GGCAGAACCA TTCCAATTAT TGGAGACCCT CATGTAGATA AAGATTTTGG AACTGGATGT GTCAAAGTCA CTCCAGCTCA TGATCCTAAT GATTTTGAGA TAGGTCAGAG ACATGACTTA CCTCAAATAA CAGTCATGAC TAAAAAAGGA ACGATGAATC ACAATGCAGG TCAATTTGAA GGCTTGGATC GTTTTGAAGC TCGTGAAGCT GTTATTGATT CTTTAAAGGA GATTGGTCTT TTAACCAAAA TAGAAGCTTA TAAACATAGT GTGCCTTTCT CTGACCGAGG GAAAGTACCA GTAGAACCAT TGCTGTCAAC TCAGTGGTTT GTGAAAATGG ATCCTCTCTC TAGTAGTTGT TCTGAATTTT TTGAGAAAGG ACAACCTAAA TTTATTCCTA ATAGATGGTC TAAAGTTTAT CGTGATTGGT TAACTGATAT AAGAGATTGG TGTATTAGTA GACAACTTTG GTGGGGACAT CGCATTCCGG CTTGGTTTGT AATTAGTCAA ACAGATAATA AAGTTGTTAA TGAAACCCCG TACATTGTTG CTCGAACAGA AGATGAAGCG AAGAAATTAG CACGAGAAAA ATATGGAGAT TCAGTTAAAA TTGAGCAAGA TGAAGATGTG CTAGATACAT GGTTTTCCAG CGGATTATGG CCTTTTTCTA CATTAGGTTG GCCTGATGAA ACCCATCCTG ATTTTCAACG TTGGTATCCC ACGAATACTT TGGTTACTGG CTTTGACATT ATTTTCTTTT GGGTAGCAAG GATGACAATG ATGGCTGGTG TCTTTACGGA GCGGATGCCA TTTGCTGACG TCTATATTCA CGGACTTGTT AGAGATGAAC AGAACAGAAA GATGAGTAAA AGTGCTGGAA ATGGCATTGA TCCTTTATTA CTAATAGAAA GATATGGAAC AGATGCTTTG AGGTTTGCTC TTGTTCGTGA AGTTGCAGGT GCTGGCCAAG ATATACGTCT TGACTTTGAT CGTAAAAATC AAACATCAGC AACGGTTGAG GCATCTAGAA ATTTTGCTAA TAAGCTTTGG AATGCAACTA GGTTTGCTCT TATTAATCTT GAAGACCAGG ATTATGAAAA CTTGGAGTCA TACGATTCTT CTAAGTTGCA ATTATCAGAC AGGTGGATTT TATCAAGACT TGCACGAGTC AATCATGAGA CTGCTAATCG ATATGAAAAT TATGCTCTAG GAGAGGCCGC TAAGGGACTA TATGAATTTG CTTGGAATGA TTTTTGTGAT TGGTATTTAG AATTAATTAA ACGTCGATTG AATAATTCAG AAAATCTTTC TTCCGATGAA TTATTAGATC GAAAAATAGC GAAAAGTGTT TTATACAAAG TTCTAAGTGA TCTATTGATT ATGCTTCATC CTCTAATGCC TCATTTGACA GAGGAGCTTT GGCATGGATT AACAGGTTTA GATGAGGATC AATTTTTAGC TTTGCAGCCT TGGCCCAAAT CAAATGAACA AGACTTGAAT CTAGATTTAG AAAGTTCTTT CTCTGATTTA TTTGCATCTA TCAGATTGAT TCGCAATCTA AGAGCAGTTG CTGGGTTGAA ACCCTCTCAA AAAGTTCCTG TCATGTTGGT TTCTGGTAAA GAGGTCTTAC AAAAAACACT AACAACATCA ATCAATGATA TTGCTGTTTT GACCAAGGCT AAGGAAGTAC AGATATTATC TCCAGAGCAA GCAAAGTCAT TGCCTTCAAT GAAAGCTCTA GCAGGCGTAA GTGGAGAGCT TGAGGTAGTG TTGCCTATTG AAGGGTTAAT AGATATAGCT TCATTAAGAT CTAGGCTAGA AAAAGATTTA AATAAAGCAC AAAAAGAAAT TGAAAGTCTT TCTGGACGTT TAGCGAATAA GAATTTTGTT GATAAAGCTC CCAAAGATGT TGTTGAAGAA TGCAGAGCAA ACTTAACGGA GTCAGAAGCT CAAGTCCGTC TAGTCAAAGA GCGTCTCATG GGATTGGATT GA
|
Protein sequence | MIERVKTTKL SEASGLPKTY DPVGTENRWQ KAWEEKGAFK PDPSAPGDPF SVVIPPPNVT GSLHMGHAFN TALIDTVVRY KRLKGNNVLC LPGTDHASIA VQTILERQLK EEGKNRRDLG RASFLEKAWE WKEKSGGRIV DQLKRLGYSV DWSRERFTLD EGLSKAVSEA FVRLHEKGLI YRGEYLVNWC PASGSAVSDL EVEMKEVDGH LWHFRYPLVT SSVSSAKQIS YLEVATTRPE TMLGDVAVAV NPSDERYKDL IGEKLTLPLV GRTIPIIGDP HVDKDFGTGC VKVTPAHDPN DFEIGQRHDL PQITVMTKKG TMNHNAGQFE GLDRFEAREA VIDSLKEIGL LTKIEAYKHS VPFSDRGKVP VEPLLSTQWF VKMDPLSSSC SEFFEKGQPK FIPNRWSKVY RDWLTDIRDW CISRQLWWGH RIPAWFVISQ TDNKVVNETP YIVARTEDEA KKLAREKYGD SVKIEQDEDV LDTWFSSGLW PFSTLGWPDE THPDFQRWYP TNTLVTGFDI IFFWVARMTM MAGVFTERMP FADVYIHGLV RDEQNRKMSK SAGNGIDPLL LIERYGTDAL RFALVREVAG AGQDIRLDFD RKNQTSATVE ASRNFANKLW NATRFALINL EDQDYENLES YDSSKLQLSD RWILSRLARV NHETANRYEN YALGEAAKGL YEFAWNDFCD WYLELIKRRL NNSENLSSDE LLDRKIAKSV LYKVLSDLLI MLHPLMPHLT EELWHGLTGL DEDQFLALQP WPKSNEQDLN LDLESSFSDL FASIRLIRNL RAVAGLKPSQ KVPVMLVSGK EVLQKTLTTS INDIAVLTKA KEVQILSPEQ AKSLPSMKAL AGVSGELEVV LPIEGLIDIA SLRSRLEKDL NKAQKEIESL SGRLANKNFV DKAPKDVVEE CRANLTESEA QVRLVKERLM GLD
|
| |