Gene NATL1_20701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20701 
SymbollysS 
ID4780175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1713935 
End bp1715476 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content38% 
IMG OID640085366 
Productlysyl-tRNA synthetase 
Protein accessionYP_001015890 
Protein GI124026775 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGTATG TACCCATTGC CTTGTCTGAA TTAAGAGATA CCCGCCTCGA GAAGGCAAAA 
GCACTAAAAA CCCTTGGGAA AGGTCCCTAT GGTTTGAATT TTCGACCTAC TGACTCTGCC
GCTGTTTTGC AGGGAAAATA TAAAGACTTG CCAAATGGCG AAGAAAAAAA AGACGAAGTA
TCTATTGCAG GGCGAGTAAC TTCTAGAAGA GTGATGGGGA AACTTGCTTT TTTTACTCTT
TCCGATGAAA CAGGCATAAT TCAACTCTTT TTGGAAAAGG CGACCTTAAA TCGGAATGAA
GATAGTGAGA ATCCAAAGAA TAATTTTGAG AACATTACCT CTCTGGTTGA CTCTGGTGAT
TGGATAGGAG TCAGTGGAAT ATTAAGGAGA ACTGATAGAG GTGAACTTTC TATAAAGGTT
TTCGAATGGT CAATGCTATC AAAATCTTTA CAACCCCTAC CAGATAAATG GCATGGACTT
GCCGATGTAG AAAAGCGCTA TAGACAAAGA TATTTAGATT TAATTGTAAA TCCACAGTCA
AGAAAAACTT TTCGAACTAG GGCTTTATTA GTTAGTTCAA TTCGACGTTG GTTAGATGAA
AAAGACTTTC TTGAAATCGA GACTCCAGTT TTACAATCAG AAGCTGGAGG AGCGGATGCA
AGGCCTTTCA TAACTCATCA CAACACGCTC GATTTACCTT TGTATCTGAG GATTGCAACA
GAATTGCATC TTAAAAGACT TGTGGTGGGT GGATTTCAAA GGGTCTATGA ACTTGGAAGA
ATTTTTAGAA ATGAAGGGAT TAGCACTAGG CATAATCCTG AATTTACTTC TGTTGAAATT
TATGAGGCCT TTGCAGATTA TTTCGACATG ATGGATTTAA CAGAAAAATT ACTTTCTTCA
GTTTGCGAAA AGATTTGTGG ATCGACCAAA ATTAATTATC AAGAGCAAGA AATAGATTTG
CAACCTCCTT GGAGAAGGGC CACGATGCAT GACTTAGTTA AGGAATTTAC TGGAATAGAT
TTTGAATTAT TTGGAGACAA TGCTGATGAT GCTAAGGCTG AAATGAGTCG AGAGGGGCTT
CAAGTGCCCG ATAAGGCTGA TACTGTTGGA ATCCTATTAA ATGAAGCTTT TGAACAAGCA
GTAGAGCCTG AGCTGATTCA GCCTACATTT GTGATGGATT ATCCAATTGA GATTTCTCCA
TTGGCTAGAA AACACAGAAC TAAAAAAGGT TTAGTGGAAA GATTTGAACT TTTTATTGTT
GGTAGAGAAA CTGCCAATGC TTTTAGCGAA TTAATTGACC CTATTGATCA AAGAGAACGT
TTACTTTTAC AGCAAGCGAA AAAAGAAGCA GGTGATCTTG AGGCTCAAAG CTTGGATGAG
GATTTTATCA ATGCTCTTGA AGTCGGAATG CCTCCAACAG GAGGTCTCGG AATAGGAATA
GATAGGTTTG TAATGTTGTT AACAGATAGT CCTTCCATAA GAGATGTAAT AGCTTTTCCA
CTTTTACGAC CAGAAGCAAA TTTAAAACAA ACTGGAAAGT GA
 
Protein sequence
MLYVPIALSE LRDTRLEKAK ALKTLGKGPY GLNFRPTDSA AVLQGKYKDL PNGEEKKDEV 
SIAGRVTSRR VMGKLAFFTL SDETGIIQLF LEKATLNRNE DSENPKNNFE NITSLVDSGD
WIGVSGILRR TDRGELSIKV FEWSMLSKSL QPLPDKWHGL ADVEKRYRQR YLDLIVNPQS
RKTFRTRALL VSSIRRWLDE KDFLEIETPV LQSEAGGADA RPFITHHNTL DLPLYLRIAT
ELHLKRLVVG GFQRVYELGR IFRNEGISTR HNPEFTSVEI YEAFADYFDM MDLTEKLLSS
VCEKICGSTK INYQEQEIDL QPPWRRATMH DLVKEFTGID FELFGDNADD AKAEMSREGL
QVPDKADTVG ILLNEAFEQA VEPELIQPTF VMDYPIEISP LARKHRTKKG LVERFELFIV
GRETANAFSE LIDPIDQRER LLLQQAKKEA GDLEAQSLDE DFINALEVGM PPTGGLGIGI
DRFVMLLTDS PSIRDVIAFP LLRPEANLKQ TGK