Gene EcSMS35_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2037 
SymbolplsX 
ID6144324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2057836 
End bp2058906 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID641616913 
Productputative glycerol-3-phosphate acyltransferase PlsX 
Protein accessionYP_001744089 
Protein GI170679989 
COG category[I] Lipid transport and metabolism 
COG ID[COG0416] Fatty acid/phospholipid biosynthesis enzyme 
TIGRFAM ID[TIGR00182] fatty acid/phospholipid synthesis protein PlsX 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.0879e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value2.25509e-07 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGACACGTC TAACCCTGGC GTTAGATGTC ATGGGAGGGG ATTTTGGCCC TTCCGTGACA 
GTGCCTGCAG CATTGCAGGC ACTGAATTCT AATTCGCAAC TCACTCTTCT TTTAGTCGGC
AATCCCGACG CCATCACGCC ATTACTTGCT AAAGCTGACT TTGAACAACG TTCGCGTCTG
CAGATTATTC CTGCGCAGTC AGTTATCGCC AGTGATGCCC GGCCTTCGCA AGCTATCCGC
GCCAGTCGTG GGAGTTCAAT GCGCGTGGCC CTGGAGCTGG TGAAAGAGGG TCGAGCGCAA
GCCTGTGTCA GTGCCGGTAA TACCGGGGCG CTGATGGGGC TGGCAAAATT ATTACTCAAG
CCCCTGGAGG GGATTGAGCG TCCGGCGCTG GTGACGGTAT TACCGCATCA GCAAAAGGGC
AAAACGGTGG TCCTCGATTT AGGGGCCAAC GTCGATTGTG ACAGTACAAT GTTGGTGCAA
TTTGCCATTA TGGGCTCAGT TCTGGCTGAA GAGGTGGTGG AAATTCCTAA TCCTCGCGTG
GCGTTGCTCA ATATTGGTGA AGAAGAAGTA AAGGGTCTCG ACAGTATTCG GGATGCCTCA
GCGGTGCTTA AAACAATCCC TTCTATCAAT TATATCGGCT ATCTTGAAGC CAATGAGTTG
TTAACTGGCA AGACAGATGT GCTGGTTTGT GATGGCTTTA CAGGAAATGT CACATTAAAG
ACGATGGAAG GTGTTGTCAG GATGTTCCTT TCTCTGCTGA AATCTCAGGG TGAAGGGAAA
AAACGGTCGT GGTGGCTACT GTTATTAAAG CGTTGGCTAC AAAAGAGCCT GACGAGGCGA
TTCAGTCACC TCAACCCCGA CCAGTATAAC GGCGCCTGTC TGTTAGGATT GCGCGGCACG
GTGATAAAAA GTCATGGTGC AGCCAATCAG CGAGCTTTTG CGGTCGCGAT TGAACAGGCA
GTGCAGGCGG TGCAGCGACA AGTTCCTCAG CGAATTGCCG CTCGCCTGGA ATCTGTATAC
CCAGCTGGTT TTGAGCTGCT GGACGGTGGC AAAAGCGGAA CTCTGCGGTA G
 
Protein sequence
MTRLTLALDV MGGDFGPSVT VPAALQALNS NSQLTLLLVG NPDAITPLLA KADFEQRSRL 
QIIPAQSVIA SDARPSQAIR ASRGSSMRVA LELVKEGRAQ ACVSAGNTGA LMGLAKLLLK
PLEGIERPAL VTVLPHQQKG KTVVLDLGAN VDCDSTMLVQ FAIMGSVLAE EVVEIPNPRV
ALLNIGEEEV KGLDSIRDAS AVLKTIPSIN YIGYLEANEL LTGKTDVLVC DGFTGNVTLK
TMEGVVRMFL SLLKSQGEGK KRSWWLLLLK RWLQKSLTRR FSHLNPDQYN GACLLGLRGT
VIKSHGAANQ RAFAVAIEQA VQAVQRQVPQ RIAARLESVY PAGFELLDGG KSGTLR