Gene NATL1_16621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16621 
SymbolserS 
ID4779877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1353713 
End bp1354993 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content36% 
IMG OID640084945 
Productseryl-tRNA synthetase 
Protein accessionYP_001015484 
Protein GI124026368 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.897711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGATC AGAGACTTCT AAGAGAAAAT CCTAATTTAA TTTCGGAAGG GCTTAAATCA 
AGAGGAATGG ATGTTGACTT AGGACCTCTA CAAAAATTCT GCAAAGATCT TAAAGACCTA
GAAGAAAAGA GAAATTCCCT ACAAGCACAA GGGAATTCAA TCGGGAAAGA GGTTGGACAA
AAAATAAAAC AAGGTCTTCC TCATGATTCT GAGGAGATTT CAAATCTTCG TGTTAAGGGC
AATCAAATCA AAAAACAAGT TGGAATAATT GAAGAGGAAG AGAAATCAAT TTCAAATAAA
TTAAATGAAC AAATCCTTTG CTTACCTAAT CTTCCAGAGA AGAATTCTCT GGAAGGGAAA
AACGAAAAAG ATAATAAGGA ACTAAGAAGA TGGGGAGAAC CTATTTCAGG AAATACTTTA
AAAGAGCATT GGGAAATTGC TAATCAATTG AACCTCTGGG ATAGTGAGAG ATCTTCTGTA
ATAGCAAAAA GTCGATTTGT AACCCTTTTC AAGCACGCTG CAAAACTTGA GAGATCACTT
ATAAATTTCA TGCTTGATTT ACATATTAAA AAAGGATATT TAGAAGTTCT TCCCCCAGCT
CTTGTTAACA CAGCCAGTCT TACTGGTTCT GGACAGTTAC CAAAATTTGC AGAAGAAAGT
TTTCGATGTG CTGATGATGA TTTATGGCTG ACTCCTACTG CTGAAGTTCC AATAACATCT
CTCCATCGTG GAGAGATCAT CCCTAGAGAT TTGTTACCAT TAAAGTACGT TGCTTATAGC
CCTTGTTTCA GAAGAGAAGC GGGAAGTTAC GGAAGGGATA CTAGAGGTCT AATCAGACTT
CATCAATTCA ATAAAGTTGA ACTATATTGG TTTTCTACTC CAGAAACATC TGAAGATGCT
CTAGAACAAA TCACATCTGA TGCAGAGTCT GTGTTGCAAG AACTTGAACT ACCGTATCGA
GTAATTCAAC TTTGCACAGG CGACTTAGGT TTCTCAGCAA AAAAAACTTA TGATTTGGAG
GTTTGGCTTC CAGGTGCTAA CACTTTTAGA GAAATATCTA GTTGCAGTAA TTGTGGAGAT
TTTCAAGCTA GACGTTCATC AATACGAACA AAAGATAACA ATAAAAAAAA CATACTTTTG
CATACATTAA ATGGAAGTGG ATTGGCTATT GGTCGCACTA TGGCTGCCAT TTTGGAAAAT
GGTCAACAAA GCGATGGAAG TATCAATTTG CCAAAAGCCT TAATACCTTA CTTCGGTTCA
AACAAATTAC AGCCAGAATA A
 
Protein sequence
MIDQRLLREN PNLISEGLKS RGMDVDLGPL QKFCKDLKDL EEKRNSLQAQ GNSIGKEVGQ 
KIKQGLPHDS EEISNLRVKG NQIKKQVGII EEEEKSISNK LNEQILCLPN LPEKNSLEGK
NEKDNKELRR WGEPISGNTL KEHWEIANQL NLWDSERSSV IAKSRFVTLF KHAAKLERSL
INFMLDLHIK KGYLEVLPPA LVNTASLTGS GQLPKFAEES FRCADDDLWL TPTAEVPITS
LHRGEIIPRD LLPLKYVAYS PCFRREAGSY GRDTRGLIRL HQFNKVELYW FSTPETSEDA
LEQITSDAES VLQELELPYR VIQLCTGDLG FSAKKTYDLE VWLPGANTFR EISSCSNCGD
FQARRSSIRT KDNNKKNILL HTLNGSGLAI GRTMAAILEN GQQSDGSINL PKALIPYFGS
NKLQPE