Gene NATL1_16891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16891 
SymbolpheS 
ID4779468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1376972 
End bp1377979 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content39% 
IMG OID640084973 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_001015509 
Protein GI124026394 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTTCAA CATTATCCCT AAAACAGCTC ATCGGTGAGC TTGAGATTCT AGAAAGCGAG 
GCGGCAAAAG AAATTGCTTC TGCTGAAAAT TCTGAATCAA TAGAGAAATT AAGGTTGAGT
TTCCTTGGGA AGAAAGGAAA ACTCTCACTT CTATTGGGAG GGATGAAAAA TCTTTCTAAT
GAAGAAAGAC CTTTAATTGG TCAAAGAGCG AACGTTTTAA AAACTCAATT GCAAGAATTA
ATCAAGGAAA AGCTTGAAAT TTTAAAAACT CAAGCTTTAA GTCAGATATT AATAAAGGAA
ACTATAGATG TTACAGCGCC TCCAACAGGT ATTCCTCAAG GACATCGCCA CCCCTTAATA
ACGACTACTG AGCAAATAAT TGATCTTTTC TTGGGTCTTG GATACCAAGT TTCTGAAGGT
CCTGAGATAG AGAATGATTA CTACAATTTC GAGGCACTGA ATATTCCACC TGATCATCCT
GCAAGAGATA TGCAAGATAC TTTTTATCTG GGAGGTGAAT ACCTTTTGAG AACTCATACA
TCGCCTGTTC AGATTCGTTG CCTTGAAAGC AAAAAGCCAC CTGTAAGAAT TGTTTCACCC
GGTCGGGTTT ATCGAAGAGA TGCAGTTGAT GCAACTCATT CGCCTGTGTT CCACCAGGTT
GAGGTCTTAG CAATTGATGA AAAGCTTGAC TTTAGTCATT TAAGAGGAAC AGTAATGGCC
TTTTTAAAAG CATTTTTTGG AGATCTTCCT ATTCGATTCA GGGCTAGCTA TTTCCCATTT
ACGGAGCCAT CAGCAGAAGT TGATGTCCAA TGGAGAGGTA AGTGGTTAGA AGTTATGGGT
TGCGGGATGG TCGATCCTGC TGTCTTAGAG GAATTAGGGA TTGATCCAGA AAAATATAGT
GGATTTGCTG CTGGACTAGG GGTTGAAAGA TTTTGCATGG TTCGTCATGG CCTAGATGAT
ATTAGAAAGT TATATACAAG TGATCTCAGA TTTTTAGAAC AATTTTAA
 
Protein sequence
MSSTLSLKQL IGELEILESE AAKEIASAEN SESIEKLRLS FLGKKGKLSL LLGGMKNLSN 
EERPLIGQRA NVLKTQLQEL IKEKLEILKT QALSQILIKE TIDVTAPPTG IPQGHRHPLI
TTTEQIIDLF LGLGYQVSEG PEIENDYYNF EALNIPPDHP ARDMQDTFYL GGEYLLRTHT
SPVQIRCLES KKPPVRIVSP GRVYRRDAVD ATHSPVFHQV EVLAIDEKLD FSHLRGTVMA
FLKAFFGDLP IRFRASYFPF TEPSAEVDVQ WRGKWLEVMG CGMVDPAVLE ELGIDPEKYS
GFAAGLGVER FCMVRHGLDD IRKLYTSDLR FLEQF