Gene NATL1_21531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21531 
SymbolvalS 
ID4780641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1809895 
End bp1812696 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content38% 
IMG OID640085450 
Productvalyl-tRNA synthetase 
Protein accessionYP_001015973 
Protein GI124026858 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0606355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.859244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGAGC GGGCAAAAAC TACAAAATTA TCTGAAGCCT CAGGGCTTCC TAAAACATAT 
GATCCAGTAG GTACTGAAAA TCGCTGGCAG AAAGCTTGGG AAGAAAAAGG AGCTTTTAAA
CCTGATCCAT CAGCCCCTGG AGACCCATTC TCCGTAGTTA TTCCTCCTCC AAATGTCACA
GGTAGTTTGC ATATGGGGCA TGCTTTTAAT ACTGCCTTAA TCGATACAGT TGTCAGGTAT
AAGAGATTAA AAGGAAATAA TGTTCTTTGT CTTCCAGGAA CAGACCATGC TTCAATTGCG
GTTCAAACTA TTCTTGAGCG ACAACTTAAG GAAGAAGGCA AAAATCGTCG TGATCTTGGT
AGAGCTTCTT TTTTGGAAAA AGCTTGGGAG TGGAAAGAAA AAAGTGGTGG AAGAATTGTT
GATCAATTAA AGCGTTTGGG ATATTCCGTA GACTGGAGTA GAGAGAGATT TACATTGGAT
GAAGGACTGA GTAAAGCTGT TTCTGAGGCA TTTGTTCGTT TACATGAAAA GGGATTGATA
TATCGAGGAG AATATTTAGT GAATTGGTGC CCTGCCTCTG GTTCGGCTGT GAGTGATTTA
GAAGTTGAAA TGAAAGAAGT AGATGGACAT CTATGGCATT TTCGATATCC CCTAGTCACA
TCATCTGTAT CAAGTGCGAA ACAAATTAGT TACTTAGAAG TAGCGACTAC ACGTCCTGAG
ACGATGCTTG GGGATGTCGC AGTTGCTGTG AACCCATCAG ATGAAAGGTA TAAAGATCTC
ATAGGGGAGA AACTTACTTT GCCTTTAGTT GGCAGAACCA TTCCAATTAT TGGAGACCCT
CATGTAGATA AAGATTTTGG AACTGGATGT GTCAAAGTCA CTCCAGCTCA TGATCCTAAT
GATTTTGAGA TAGGTCAAAG ACATGACTTA CCTCAAATAA CAGTCATGAC TAAAAAAGGA
ACGATGAATC ACAATGCAGG TCAATTTGAA GGCTTGGATC GTTTTGAAGC TCGTGAAGCT
GTTATTGATT CTTTAAAGGA GATTGGTCTT TTAACCAAAA TAGAAGCTTA TAAACATAGT
GTGCCTTTCT CTGACCGAGG GAAAGTACCA GTAGAACCAT TGCTGTCAAC TCAGTGGTTT
GTGAAAATGG ATCCTCTCTC TAGTAGTTGT TCTGAATTTT TTGAGAAAGG ACAACCTAAA
TTTATTCCTA ATAGATGGTC TAAAGTTTAT CGTGATTGGT TAACTGATAT AAGAGATTGG
TGTATTAGTA GACAACTTTG GTGGGGACAT CGCATTCCGG CTTGGTTTGT AATTAGTCAA
ACAGATAATA AAGTTGTTAA TGAAACCCCG TACATTGTTG CTCGAACAGA AGATGAAGCG
AAGAAATTAG CACGAGAAAA ATATGGAGAT TCAGTTAAAA TTGAGCAAGA TGAAGATGTG
CTAGATACAT GGTTTTCCAG CGGATTATGG CCTTTTTCTA CATTAGGTTG GCCTGATGAA
ACCCATCCTG ATTTTCAACG TTGGTATCCC ACGAATACTT TGGTTACTGG CTTTGACATT
ATTTTCTTTT GGGTAGCAAG GATGACAATG ATGGCTGGTG TCTTTACGGA GCGGATGCCA
TTTGCTGACG TCTATATTCA CGGACTTGTT AGAGATGAAC AGAACAGAAA GATGAGTAAA
AGTGCTGGAA ATGGCATTGA TCCTTTATTA CTAATAGAAA GATATGGAAC AGATGCTTTG
AGGTTTGCTC TTGTTCGTGA AGTTGCAGGT GCTGGCCAAG ATATACGTCT TGACTTTGAT
CGTAAAAATC AAACATCAGC AACGGTTGAG GCATCTAGAA ATTTTGCTAA TAAGCTTTGG
AATGCAACTA GGTTTGCTCT TATTAATCTT GAAGATCAGG ATTATGAAAA CTTGGAGTCA
TACGATTCTT CTAAGTTGCA ATTATCAGAC AGGTGGATTT TATCAAGACT TGCACGAGTC
AATCATGAGA CTGCTAATCG ATATGAAAAT TATGCTCTAG GAGAGGCCGC TAAGGGACTA
TATGAATTTG CTTGGAATGA TTTTTGTGAT TGGTATTTAG AATTAATTAA ACGTCGATTG
AATAATTCAG AAAATCTTTC TTCCGATGAA TTATTAGATC GAAAAATAGC GAAAAGTGTT
TTATACAAAG TTCTAAGTGA TCTCTTGATT ATGCTTCATC CTCTAATGCC TCATTTGACA
GAGGAGCTTT GGCATGGATT AACAGGTTTA GATGAGGATC AATTTTTAGC TTTGCAGCCT
TGGCCCAAAT CAAATGAACA AGACTTGAAT CTAGATTTAG AAAGTTCTTT CTCTGATTTA
TTTGCATCTA TTAGATTGAT TCGCAATCTA AGAGCAGTTG CTGGGTTGAA ACCCTCTCAA
AAAGTTCCTG TCATGTTGGT TTCTGGTAAA GAGGTCTTAC AAAAAACACT AACAACATCA
ATCAATGATA TTGCTGTTTT GACCAAGGCT AAGGAAGTAC AGATATTATC TCCAGAGCAA
GCAAAGTCAT TGCCTTCAAT GAAAGCTCTA GCAGGCGTAA GTGGAGAGCT TGAGGTAGTG
TTGCCTATTG AAGGATTAAT AGATATAGCT TCATTAAGAT CTAGGCTAGA AAAAGATTTA
AATAAAGCAC AAAAAGAAAT TGAAGGTCTT TCTGGACGTT TAGCGAATAA GAATTTTGTT
GATAAAGCTC CCAAAGATGT TGTTGAAGAA TGCAGAGCAA ACTTAACGGA GTCAGAAGCT
CAAGTCCGTC TAGTCAAAGA GCGTCTAATG GGATTGGATT GA
 
Protein sequence
MIERAKTTKL SEASGLPKTY DPVGTENRWQ KAWEEKGAFK PDPSAPGDPF SVVIPPPNVT 
GSLHMGHAFN TALIDTVVRY KRLKGNNVLC LPGTDHASIA VQTILERQLK EEGKNRRDLG
RASFLEKAWE WKEKSGGRIV DQLKRLGYSV DWSRERFTLD EGLSKAVSEA FVRLHEKGLI
YRGEYLVNWC PASGSAVSDL EVEMKEVDGH LWHFRYPLVT SSVSSAKQIS YLEVATTRPE
TMLGDVAVAV NPSDERYKDL IGEKLTLPLV GRTIPIIGDP HVDKDFGTGC VKVTPAHDPN
DFEIGQRHDL PQITVMTKKG TMNHNAGQFE GLDRFEAREA VIDSLKEIGL LTKIEAYKHS
VPFSDRGKVP VEPLLSTQWF VKMDPLSSSC SEFFEKGQPK FIPNRWSKVY RDWLTDIRDW
CISRQLWWGH RIPAWFVISQ TDNKVVNETP YIVARTEDEA KKLAREKYGD SVKIEQDEDV
LDTWFSSGLW PFSTLGWPDE THPDFQRWYP TNTLVTGFDI IFFWVARMTM MAGVFTERMP
FADVYIHGLV RDEQNRKMSK SAGNGIDPLL LIERYGTDAL RFALVREVAG AGQDIRLDFD
RKNQTSATVE ASRNFANKLW NATRFALINL EDQDYENLES YDSSKLQLSD RWILSRLARV
NHETANRYEN YALGEAAKGL YEFAWNDFCD WYLELIKRRL NNSENLSSDE LLDRKIAKSV
LYKVLSDLLI MLHPLMPHLT EELWHGLTGL DEDQFLALQP WPKSNEQDLN LDLESSFSDL
FASIRLIRNL RAVAGLKPSQ KVPVMLVSGK EVLQKTLTTS INDIAVLTKA KEVQILSPEQ
AKSLPSMKAL AGVSGELEVV LPIEGLIDIA SLRSRLEKDL NKAQKEIEGL SGRLANKNFV
DKAPKDVVEE CRANLTESEA QVRLVKERLM GLD