Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2085 |
Symbol | |
ID | 6143510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2099521 |
End bp | 2100942 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616961 |
Product | phopholipase D |
Protein accession | YP_001744137 |
Protein GI | 170682775 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.169577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.374109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCCGGC TGGCGAGCGC GGTGCTGCCA CTGTGTTCGC AACATCCCGG TCAGTGTGGC CTTTTTCCTC TGGAGAAAAG TCTGGATGCG TTTGCCGCCC GGTATCGTCT GGCCGAAATG GCAGAGCATA CGCTGGATGT TCAGTATTAC ATCTGGCAGG ACGATATGTC GGGTCGGTTA CTGTTTTCCG CCCTGTTAGC CGCAGCAAAG CGTGGCGTTC GCGTCCGTTT GTTGCTGGAC GACAACAATA CGCCCGGACT TGACGACATT TTACGCTTGC TTGACAGTCA TCCACGCATT GAAGTCCGGC TTTTTAATCC TTTCTCGTTT CGCTTGCTAC GTCCGCTTGG TTATATCACC GACTTTTCCC GTCTCAATCG CCGTATGCAC AATAAAAGTT TCACTGTCGA TGGCGTGGTT ACCCTGGTGG GAGGACGAAA TATTGGTGAT GCCTATTTTG GGGCAGGGGA AGAGCCACTT TTTTCGGATT TAGATGTCAT GGCAATAGGA CCGGTGGTAG AGGACGTTGC CGATGATTTC GCCCGCTACT GGTATTGCAA ATCGGTTTCA CCCTTACAGC AGGTGCTGGA TGTCACGGAA GGTGAAATGG CGGATCGCAT CGAGTTACCC GCCTCCTGGC ATAACGATGC CATGACGCAT CGTTATTTAC GCAAAATGGA ATCCAGTCCG TTTATAAATC ATCTGGTTGA TGGAACGTTG CCGCTTATCT GGGCGAAGAC ACGTTTATTA AGTGATGATC CGGCGAAAGG GGAGGGCAAG GCAAAACGGC ATTCACTGTT ACCGCAGCGC CTGTTCGATA TCATGGGCTC ACCCAGTGAA CGCATCGATA TTATCTCTTC CTATTTTGTA CCGACACGCG CAGGTGTGGC GCAACTCTTA CGTATGGTGA GAAAAGGGGT AAAGATTGCG ATCCTAACCA ATTCTCTTGC CGCTAACGAT GTTGCTGTCG TCCATGCCGG ATACGCACGC TGGCGCAAAA AATTGCTCCG CTACGGCGTG GAATTATATG AACTCAAGCC GACGCGTGAA CAAAGTAGTA CGTTACACGA TCGCGGCATC ACCGGTAATT CCGGTGCCAG CCTGCATGCT AAAACCTTTA GCATCGATGG TAAAACGGTG TTTATCGGCT CTTTCAATTT CGATCCGCGT TCGACATTGC TCAATACTGA AATGGGCTTC GTGATAGAGA GCGAAACGCT GGCACAGTTA ATTGATAAAC GCTTTATTCA GAGCCAGTAT GATGCGGCCT GGCAGCTCCG TCTGGACAGG TGGGGACGGA TCAACTGGGT TGATCGTCAT GCAAAGAAAG AGATTGTTCT CAAAAAAGAA CCCGCCACCA GTTTCTGGAA GCGGGTTATG GTCAGACTGG CGTCGATATT GCCCGTGGAA TGGTTATTAT AA
|
Protein sequence | MPRLASAVLP LCSQHPGQCG LFPLEKSLDA FAARYRLAEM AEHTLDVQYY IWQDDMSGRL LFSALLAAAK RGVRVRLLLD DNNTPGLDDI LRLLDSHPRI EVRLFNPFSF RLLRPLGYIT DFSRLNRRMH NKSFTVDGVV TLVGGRNIGD AYFGAGEEPL FSDLDVMAIG PVVEDVADDF ARYWYCKSVS PLQQVLDVTE GEMADRIELP ASWHNDAMTH RYLRKMESSP FINHLVDGTL PLIWAKTRLL SDDPAKGEGK AKRHSLLPQR LFDIMGSPSE RIDIISSYFV PTRAGVAQLL RMVRKGVKIA ILTNSLAAND VAVVHAGYAR WRKKLLRYGV ELYELKPTRE QSSTLHDRGI TGNSGASLHA KTFSIDGKTV FIGSFNFDPR STLLNTEMGF VIESETLAQL IDKRFIQSQY DAAWQLRLDR WGRINWVDRH AKKEIVLKKE PATSFWKRVM VRLASILPVE WLL
|
| |