Gene EcSMS35_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2076 
SymbollpxL 
ID6144969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2090715 
End bp2091635 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content53% 
IMG OID641616952 
Productlipid A biosynthesis lauroyl acyltransferase 
Protein accessionYP_001744128 
Protein GI170681218 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02207] lipid A biosynthesis lauroyl (or palmitoleoyl) acyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.836501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.965956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TACCCAAGTT CTCCACCGCA CTGCTTCATC CGCGTTATTG GTTAACCTGG 
TTGGGTATTG GCGTACTTTG GTTAGTCGTG CAATTGCCCT ACCCGGTTAT CTACCGCCTC
GGTTGTGGAT TAGGAAAACT TGCGTTACGT TTTATGAAAC GACGCGCAAA AATTGTGCAT
CGCAACCTGG AACTGTGCTT CCCGGAAATG AGCGAACAAG AACGCCGTAA AATGGTGGTG
AAGAATTTCG AATCCGTTGG CATGGGACTG ATGGAAACTG GCATGGCGTG GTTCTGGTCG
GACCGCAGAA TCGCCCGCTG GACGGAAGTG ATCGGCATGG AACACATTCG TGACGTGCAG
GCGCAAAAAC GCGGCATCCT GTTAGTTGGC ATCCATTTTC TGACACTGGA GCTTGGTGCG
CGGCAGTTTG GTATGCAGGA ACCGGGGATT GGCGTTTACC GCCCGAACGA TAATCCACTG
ATTGACTGGC TACAAACCTG GGGCCGTTTA CGCTCAAATA AATCGATGCT CGACCGCAAA
GATTTAAAAG GCATGATTAA AGCCCTGAAA AAAGGCGAAG TGGTCTGGTA CGCACCGGAT
CATGATTACG GCCCGCGCTC AAGCGTTTTC GTCCCGTTGT TTGCCGTTGA GCAGGCTGCG
ACCACGACCG GAACCTGGAT GCTGGCACGG ATGTCCGGCG CATGTCTGGT GCCCTTCGTT
CCATGCCGTA AGCCAGATGG CAAAGGGTAT CAGTTGATTA TACTGCCGCC AGAGTGTTCT
CCGCCGCTGG ATGATGCCGA AACCACCGCC GCGTGGATGA ACAAAGTGGT CGAAAAATGC
ATCATGATGG CACCAGAGCA GTATATGTGG TTACACCGTC GCTTTAAAAC ACGCCCGGAA
GGCGTTCCTT CACGCTATTA A
 
Protein sequence
MTNLPKFSTA LLHPRYWLTW LGIGVLWLVV QLPYPVIYRL GCGLGKLALR FMKRRAKIVH 
RNLELCFPEM SEQERRKMVV KNFESVGMGL METGMAWFWS DRRIARWTEV IGMEHIRDVQ
AQKRGILLVG IHFLTLELGA RQFGMQEPGI GVYRPNDNPL IDWLQTWGRL RSNKSMLDRK
DLKGMIKALK KGEVVWYAPD HDYGPRSSVF VPLFAVEQAA TTTGTWMLAR MSGACLVPFV
PCRKPDGKGY QLIILPPECS PPLDDAETTA AWMNKVVEKC IMMAPEQYMW LHRRFKTRPE
GVPSRY