Gene A9601_18281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18281 
SymbollysS 
ID4718565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1559871 
End bp1561337 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content32% 
IMG OID640079561 
Productlysyl-tRNA synthetase 
Protein accessionYP_001010218 
Protein GI123969360 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.505849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAAG GATTTGCTTC TTACGCACAA AGCTTTAAGG TATCACATAC TACCAGTTTT 
CTTATTCAAA AATTTGATCA TCTAGAAAAT GGTCAAGAGG AAGACTTCAG TGTTTCTATT
GCTGGTAGAG TTCTGGCAAA AAGGGTAATG GGCAAAATTG CCTTTTTCAC AATAAGCGAT
CAAGAAGGTC AGATTCAGCT TTATCTAGAT AAAAGGATTA TTAATTTCAA TTTAGAAAAA
CAAAAATTAC TTTCTTTTGA AGATCTCAAA GAAATAGTAG ATATTGGTGA TTGGATAGGA
GTCTATGGAA CTATTAAAAA AACTAATAAA GGTGAGCTTT CAATTAAAGT AGAAAAATGG
GAAATGTTAT CCAAATCATT ACAACCTCTC CCAGATAAAT GGCATGGATT GACTGATATT
GAAAAAAGAT ATAGACAACG TTATTTAGAT TTAATAGTTA ATCCTCACTC TAAAAATGTA
TTTAAAACCA GAGCGAAATG TATAAGTTTT ATAAGAAAAT GGCTAGATAA TAGAAATTTT
TTAGAGATAG AGACTCCAAT TCTGCAATCT GAAGCTGGTG GTGCTGAAGC AAGACCATTT
ATAACTCATC ACAATACATT AGATATTCCG TTGTATTTAA GAATAGCTAC AGAATTACAT
TTAAAGCGAA TGGTTGTTGG AGGTTTTGAG AAAGTCTATG AATTGGGAAG AATCTTCCGT
AATGAGGGGA TAAGTACAAG GCATAATCCA GAATTCACCT CAGTGGAAAT TTATGAAGCT
TATTCTGATT ATGTAGATAT GATGAATTTA ACTGAAGAAT TGATTAAAGA TATCGTAGCT
GATGCATGTG GGTCCTTAAT TATAAATTAT CAAAATAAAG AAATTGATTT TTCTAAGCCT
TGGTCAAGAA TATCCATGAA AGCTATTGTC AAAAAATATA CAGGGATTGA TTTTGATTCT
TTCAGTGGAG ACTTTCTAGC AGCAAAACAA GCCGTTAAAA ATATCAATGT TGATTGTTCT
AATAAAGTAA ATACTATGGG AAGACTTTTA AATGAGGTCT TCGAGCAAAA AGTAGAATCA
AAACTTGTAG AACCCACTTT TGTTATTGAT TATCCTGTTG AAATTTCTCC TTTAGCTAGG
CCTCATCATG ATAATAAACA AATAGTTCAG AGATTTGAAT TATTCATTGT TGGTAGAGAA
CTGGCAAATG CGTTTAGTGA GTTGATAGAT CCAGTAGATC AAAGAGAAAG AATGCAATTA
CAGCAATCTC TTAGAGACGA AGGAGATCTT GAGGCTCACT GTATAGATGA AGATTTTTTA
AATGCTTTAG AGATTGGCAT GCCGCCTACG GGAGGATTAG GTATAGGCAT TGATAGGCTA
ATTATGTTAA TTACTAATAG CGCATCGATT AGAGATGTAA TCCCTTTCCC ATTGTTAAAA
CCAGAAATAA CTTCCAAAAA AAGTTAA
 
Protein sequence
MNKGFASYAQ SFKVSHTTSF LIQKFDHLEN GQEEDFSVSI AGRVLAKRVM GKIAFFTISD 
QEGQIQLYLD KRIINFNLEK QKLLSFEDLK EIVDIGDWIG VYGTIKKTNK GELSIKVEKW
EMLSKSLQPL PDKWHGLTDI EKRYRQRYLD LIVNPHSKNV FKTRAKCISF IRKWLDNRNF
LEIETPILQS EAGGAEARPF ITHHNTLDIP LYLRIATELH LKRMVVGGFE KVYELGRIFR
NEGISTRHNP EFTSVEIYEA YSDYVDMMNL TEELIKDIVA DACGSLIINY QNKEIDFSKP
WSRISMKAIV KKYTGIDFDS FSGDFLAAKQ AVKNINVDCS NKVNTMGRLL NEVFEQKVES
KLVEPTFVID YPVEISPLAR PHHDNKQIVQ RFELFIVGRE LANAFSELID PVDQRERMQL
QQSLRDEGDL EAHCIDEDFL NALEIGMPPT GGLGIGIDRL IMLITNSASI RDVIPFPLLK
PEITSKKS