Gene ECH74115_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1433 
SymbollpxL 
ID6966943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1418389 
End bp1419309 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content53% 
IMG OID643385406 
Productlipid A biosynthesis lauroyl acyltransferase 
Protein accessionYP_002269900 
Protein GI209397977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02207] lipid A biosynthesis lauroyl (or palmitoleoyl) acyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.728097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.116938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TACCCAAGTT CTCCACCGCA CTGCTTCATC CGCGTTATTG GTTAACCTGG 
TTGGGTATTG GCGTACTTTG GTTAGTCGTG CAATTGCCCT ACCCGGTTAT CTACCGCCTC
GGTTGTGGAT TAGGAAAACT GGCGTTACGT TTTATGAAAC GACGCGCAAA AATTGTGCAT
CGCAACCTGG AACTGTGCTT CCCGGAAATG AGCGAACAAG AACGCCGTAA AATGGTGGTG
AAGAATTTCG AATCCGTTGG CATGGGACTG ATGGAAACTG GCATGGCGTG GTTCTGGCCG
GACCGCCGAA TCGCTCGCTG GACGGAAGTG ATCGGCATGG AACACATTCG TGACGTGCAG
GCGCAAAAAC GCGGCATCCT GTTAGTTGGC ATCCATTTTC TGACACTGGA GCTGGGTGCG
CGGCAGTTTG GTATGCAGGA ACCGGGGATT GGCGTTTACC GCCCGAACGA TAATCCACTG
ATTGACTGGC TACAAACCTG GGGCCGTTTG CGCTCAAATA AATCGATGCT CGACCGCAAA
GATTTAAAAG GCATGATTAA AGCCCTGAAA AAAGGCGAAG TGGTCTGGTA CGCACCGGAT
CATGATTACG GCCCGCGCTC AAGCGTTTTC GTCCCTTTAT TTGCCGTTGA GCAGGCTGCG
ACCACGACCG GAACCTGGAT GCTGGCACGG ATGTCCGGCG CATGTCTGGT GCCCTTCGTT
CCACGCCGTA AGCCAGATGG CAAAGGGTAT CAATTGATTA TGCTGCCGCC AGAGTGTTCT
CCGCCGCTGG ATGATGCCGA AACTACCGCC GCGTGGATGA ACAAAGTGGT CGAAAAATGC
ATCATGATGG CACCAGAGCA GTATATGTGG TTACACCGTC GCTTTAAAAC ACGCCCGGAA
GGCGTTCCTT CACGCTATTA A
 
Protein sequence
MTNLPKFSTA LLHPRYWLTW LGIGVLWLVV QLPYPVIYRL GCGLGKLALR FMKRRAKIVH 
RNLELCFPEM SEQERRKMVV KNFESVGMGL METGMAWFWP DRRIARWTEV IGMEHIRDVQ
AQKRGILLVG IHFLTLELGA RQFGMQEPGI GVYRPNDNPL IDWLQTWGRL RSNKSMLDRK
DLKGMIKALK KGEVVWYAPD HDYGPRSSVF VPLFAVEQAA TTTGTWMLAR MSGACLVPFV
PRRKPDGKGY QLIMLPPECS PPLDDAETTA AWMNKVVEKC IMMAPEQYMW LHRRFKTRPE
GVPSRY