Gene NATL1_02071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02071 
SymbolplsX 
ID4780601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp193062 
End bp194387 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content39% 
IMG OID640083472 
Productputative glycerol-3-phosphate acyltransferase PlsX 
Protein accessionYP_001014036 
Protein GI124024920 
COG category[I] Lipid transport and metabolism 
COG ID[COG0416] Fatty acid/phospholipid biosynthesis enzyme 
TIGRFAM ID[TIGR00182] fatty acid/phospholipid synthesis protein PlsX 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.564797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAA ATCACCTAAA TAATAAAACT AATCGCTCTA AAGCAATTAG AAGATTGGTC 
ATTTGGTATC GCCGAAACTC AGCTGTAACA AGTCTTGTTG ACACTGCAAC AAGCTCAGCC
ACAGCAGCTA GTAATGTCGC AGGAACAGTT GTTTCTAACG CTGGTTCCGT TGTTACTAAT
GCTGGATCAA TTGCTAGAAG TACTTTAGAG CCATTTGTGT TTGATCCCCT TAGAAGACTC
CAAGGTGGAG AAAGTACGGG TGAAAAAAAT ACAATTGAAG ATTCTGACAG AATTTGGGTC
GCTGTCGATG GAATGGGAGG AGATTATGCA CCTGGAGCAA TTCTTGATGG GTGTTTGAAA
TCTTTGTCTC TACTTCCATT GAAAATTAAA TTTGTAGGTG AAGTTGAGAA AGTAGAAAAA
GCAGCGATTG AATTTGGCTT AAAAGAATCT CTAGACAAAG CTATGGAAGA TGGAAAATTT
CAATTAATTT CTAGTGGTCT TTCAGTTGGC ATGGATGAAG AAGCCACTGC AGTGCGTAAA
AAAAAGGATG CGAGCATAAA TATTGCAATG AAATTGGTTA GAGAAGGAAA AGCTATGGGT
GTCTATTCAG CTGGGAATTC TGGAGCAATG ATGGCCTCAG CCATTTTTAA ATTGGGACGT
TTAAAAGGGA TTGATCGTCC AGCAATTGGA GCATTATTCC CAACTAAAGA CCCTGGGCAA
CCTGTATTGG TTTTAGATGT TGGAGCGAAT ATGGATTGCA AACCAACCTA TTTGCATCAA
TTTGCCCTCC TTGGAAACAT CTACAGTCGA GATGTTTTGC AGGTAGACAA GCCAAGAATA
GGATTATTGA ATATTGGTGA AGAATCTTGT AAGGGTAATG ATCTTTCTCT AGCAACTTAC
AAACTTTTAA ACGAGGAAGA ACGTTTTTGC TTTTCTGGCA ATTGTGAAGG GCGAGATGTA
TTATCAGGCG ATTTCGATGT TGTGGTTTGT GATGGATTTA CAGGAAACGT TTTGCTTAAA
TTTTTAGAAT CAGTAGGAAG CGTTCTTTTG GGAGTTTTGA GAGCTGAGTT GCCTAGAGGA
AGAAGAGGCA AAGTTGGTTC TGCTTTTTTA AGAAATAATT TAAAACGAAT AAAGAAACGC
TTAGATCATG CAGAACATGG TGGTGCTTTA CTTCTAGGAA TAAATGGAAT TTGTGTGATT
GGTCACGGAG GAAGTAAAGC TCTATCTGTT TTAAGTGCTT TAAGAGTTAT GCATTCAGCT
GCAAGCCACG GAGTAATGGA TGATTTAGCG GATTTAAATA AACCAGATGT CTTAAGGTCT
GATTAG
 
Protein sequence
MEKNHLNNKT NRSKAIRRLV IWYRRNSAVT SLVDTATSSA TAASNVAGTV VSNAGSVVTN 
AGSIARSTLE PFVFDPLRRL QGGESTGEKN TIEDSDRIWV AVDGMGGDYA PGAILDGCLK
SLSLLPLKIK FVGEVEKVEK AAIEFGLKES LDKAMEDGKF QLISSGLSVG MDEEATAVRK
KKDASINIAM KLVREGKAMG VYSAGNSGAM MASAIFKLGR LKGIDRPAIG ALFPTKDPGQ
PVLVLDVGAN MDCKPTYLHQ FALLGNIYSR DVLQVDKPRI GLLNIGEESC KGNDLSLATY
KLLNEEERFC FSGNCEGRDV LSGDFDVVVC DGFTGNVLLK FLESVGSVLL GVLRAELPRG
RRGKVGSAFL RNNLKRIKKR LDHAEHGGAL LLGINGICVI GHGGSKALSV LSALRVMHSA
ASHGVMDDLA DLNKPDVLRS D