Gene EcSMS35_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2085 
Symbol 
ID6143510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2099521 
End bp2100942 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content51% 
IMG OID641616961 
Productphopholipase D 
Protein accessionYP_001744137 
Protein GI170682775 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.169577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.374109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCGGC TGGCGAGCGC GGTGCTGCCA CTGTGTTCGC AACATCCCGG TCAGTGTGGC 
CTTTTTCCTC TGGAGAAAAG TCTGGATGCG TTTGCCGCCC GGTATCGTCT GGCCGAAATG
GCAGAGCATA CGCTGGATGT TCAGTATTAC ATCTGGCAGG ACGATATGTC GGGTCGGTTA
CTGTTTTCCG CCCTGTTAGC CGCAGCAAAG CGTGGCGTTC GCGTCCGTTT GTTGCTGGAC
GACAACAATA CGCCCGGACT TGACGACATT TTACGCTTGC TTGACAGTCA TCCACGCATT
GAAGTCCGGC TTTTTAATCC TTTCTCGTTT CGCTTGCTAC GTCCGCTTGG TTATATCACC
GACTTTTCCC GTCTCAATCG CCGTATGCAC AATAAAAGTT TCACTGTCGA TGGCGTGGTT
ACCCTGGTGG GAGGACGAAA TATTGGTGAT GCCTATTTTG GGGCAGGGGA AGAGCCACTT
TTTTCGGATT TAGATGTCAT GGCAATAGGA CCGGTGGTAG AGGACGTTGC CGATGATTTC
GCCCGCTACT GGTATTGCAA ATCGGTTTCA CCCTTACAGC AGGTGCTGGA TGTCACGGAA
GGTGAAATGG CGGATCGCAT CGAGTTACCC GCCTCCTGGC ATAACGATGC CATGACGCAT
CGTTATTTAC GCAAAATGGA ATCCAGTCCG TTTATAAATC ATCTGGTTGA TGGAACGTTG
CCGCTTATCT GGGCGAAGAC ACGTTTATTA AGTGATGATC CGGCGAAAGG GGAGGGCAAG
GCAAAACGGC ATTCACTGTT ACCGCAGCGC CTGTTCGATA TCATGGGCTC ACCCAGTGAA
CGCATCGATA TTATCTCTTC CTATTTTGTA CCGACACGCG CAGGTGTGGC GCAACTCTTA
CGTATGGTGA GAAAAGGGGT AAAGATTGCG ATCCTAACCA ATTCTCTTGC CGCTAACGAT
GTTGCTGTCG TCCATGCCGG ATACGCACGC TGGCGCAAAA AATTGCTCCG CTACGGCGTG
GAATTATATG AACTCAAGCC GACGCGTGAA CAAAGTAGTA CGTTACACGA TCGCGGCATC
ACCGGTAATT CCGGTGCCAG CCTGCATGCT AAAACCTTTA GCATCGATGG TAAAACGGTG
TTTATCGGCT CTTTCAATTT CGATCCGCGT TCGACATTGC TCAATACTGA AATGGGCTTC
GTGATAGAGA GCGAAACGCT GGCACAGTTA ATTGATAAAC GCTTTATTCA GAGCCAGTAT
GATGCGGCCT GGCAGCTCCG TCTGGACAGG TGGGGACGGA TCAACTGGGT TGATCGTCAT
GCAAAGAAAG AGATTGTTCT CAAAAAAGAA CCCGCCACCA GTTTCTGGAA GCGGGTTATG
GTCAGACTGG CGTCGATATT GCCCGTGGAA TGGTTATTAT AA
 
Protein sequence
MPRLASAVLP LCSQHPGQCG LFPLEKSLDA FAARYRLAEM AEHTLDVQYY IWQDDMSGRL 
LFSALLAAAK RGVRVRLLLD DNNTPGLDDI LRLLDSHPRI EVRLFNPFSF RLLRPLGYIT
DFSRLNRRMH NKSFTVDGVV TLVGGRNIGD AYFGAGEEPL FSDLDVMAIG PVVEDVADDF
ARYWYCKSVS PLQQVLDVTE GEMADRIELP ASWHNDAMTH RYLRKMESSP FINHLVDGTL
PLIWAKTRLL SDDPAKGEGK AKRHSLLPQR LFDIMGSPSE RIDIISSYFV PTRAGVAQLL
RMVRKGVKIA ILTNSLAAND VAVVHAGYAR WRKKLLRYGV ELYELKPTRE QSSTLHDRGI
TGNSGASLHA KTFSIDGKTV FIGSFNFDPR STLLNTEMGF VIESETLAQL IDKRFIQSQY
DAAWQLRLDR WGRINWVDRH AKKEIVLKKE PATSFWKRVM VRLASILPVE WLL