Gene ECH74115_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1469 
SymbolplsX 
ID6969004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1450301 
End bp1451371 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID643385440 
Productputative glycerol-3-phosphate acyltransferase PlsX 
Protein accessionYP_002269934 
Protein GI209397257 
COG category[I] Lipid transport and metabolism 
COG ID[COG0416] Fatty acid/phospholipid biosynthesis enzyme 
TIGRFAM ID[TIGR00182] fatty acid/phospholipid synthesis protein PlsX 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000150327 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000000717792 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACACGTC TAACCCTGGC GTTAGATGTC ATGGGAGGGG ATTTTGGCCC TTCCGTGACA 
GTGCCTGCAG CATTGCAGGC ACTGAATTCT AATTCGCAAC TCACTCTTCT TTTAGTCGGC
AATCCCGACG CCATCACGCC ATTACTTGCT AAAGCTGACT TTGAACAACG TTCGCGTCTG
CAGATTATTC CTGCGCAGTC AGTTATCGCC AGTGATGCCC GGCCTTCGCA AGCTATCCGC
GCCAGTCGTG GGAGTTCAAT GCGCGTGGCC CTGGAGCTGG TGAAAGAAGG TCGAGCGCAA
GCCTGTGTCA GTGCCGGTAA TACCGGGGCA CTGATGGGGC TGGCAAAATT ATTACTCAAG
CCCCTGGAGG GGATTGAGCG TCCGGCGCTG GTGACGGTAT TACCACATCA GCAAAAGGGC
AAAACGGTGG TCCTTGACTT AGGGGCCAAC GTCGATTGTG ACAGCACAAT GTTGGTGCAA
TTTGCCATTA TGGGCTCAGT CCTGGCTGAA GAGGTGGTGG AAATTCCCAA TCCTCGCGTG
GCGTTGCTCA ATATTGGTGA AGAAGAAGTA AAGGGTCTCG ATAGTATTCG GGATGCCTCA
GCGGTGCTTA AAACAATCCC TTCTATCAAT TATATCGGCT ATCTTGAAGC CAATGAGTTG
TTAACTGGCA AGACAGATGT GCTGGTTTGT GACGGCTTTA CAGGAAATGT CACATTAAAG
ACGATGGAAG GTGTTGTCAG GATGTTCCTT TCTCTGCTGA AATCTCAGGG TGAAGGGAAA
AAACGGTCGT GGTGGCTACT GTTATTAAAG CGTTGGCTAC AAAAGAGCCT GACGAGGCGA
TTCAGTCACC TCAACCCCGA CCAGTATAAC GGCGCCTGTC TGTTAGGATT GCGCGGCACG
GTGATAAAAA GTCATGGTGC AGCCAATCAG CGAGCTTTTG CGGTCGCGAT TGAACAGGCA
GTGCAGGCGG TGCAGCGACA AGTTCCTCAG CGAATTGCCG CTCGCCTGGA ATCTGTATAC
CCAGCTGGTT TTGAGCTGCT GGACGGTGGC AAAAGCGGAA CTCTGCGGTA G
 
Protein sequence
MTRLTLALDV MGGDFGPSVT VPAALQALNS NSQLTLLLVG NPDAITPLLA KADFEQRSRL 
QIIPAQSVIA SDARPSQAIR ASRGSSMRVA LELVKEGRAQ ACVSAGNTGA LMGLAKLLLK
PLEGIERPAL VTVLPHQQKG KTVVLDLGAN VDCDSTMLVQ FAIMGSVLAE EVVEIPNPRV
ALLNIGEEEV KGLDSIRDAS AVLKTIPSIN YIGYLEANEL LTGKTDVLVC DGFTGNVTLK
TMEGVVRMFL SLLKSQGEGK KRSWWLLLLK RWLQKSLTRR FSHLNPDQYN GACLLGLRGT
VIKSHGAANQ RAFAVAIEQA VQAVQRQVPQ RIAARLESVY PAGFELLDGG KSGTLR