Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1426 |
Symbol | |
ID | 6967143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1409083 |
End bp | 1410504 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385400 |
Product | phopholipase D |
Protein accession | YP_002269894 |
Protein GI | 209399856 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.33637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.352917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCCGGC TGGCGAGCGC GGTGCTGCCA CTGTGTTCGC AACATCCCGG TCAGTGTGGT CTTTTCCCTC TGGAGAAAAG TCTGGATGCG TTTGCCGCCC GGTATCGTCT GGCCGAAATG TCAGAGCATA CGCTGGATGT TCAGTATTAC ATCTGGCAGG ACGATATGTC GGGTCGGTTA CTGTTTTCCG CCCTGTTAGC CGCAGCAAAG CGTGGCGTTC GCGTCCGTTT GTTGCTGGAC GACAACAATA CGCCCGGACT TGACGACATT TTACGCTTGC TTGACAGCCA TCCCCGCATT GAAGTCCGGC TTTTTAATCC TTTCTCGTTT CGCTTGCTGC GTCCGCTTGG TTATATCACC GACTTTTCCC GTCTCAATCG CCGTATGCAC AATAAAAGTT TCACTGTCGA TGGCGTGGTG ACGTTGGTAG GGGGGCGAAA TATTGGTGAT GCCTATTTTG GGGCAGGGGA AGAGCCACTT TTTTCGGATT TAGATGTCAT GGCAATAGGA CCCGTAGTAG AGGACGTTGC CGATGATTTC GCCCGCTACT GGTATTGCAA ATCGGTTTCA CCCTTACAGC AGGTGCTGGA TGTCCCGGAA GGTGAAATGG CGGATCGCAT CGAGTTACCC GCCTCCTGGC ATAACGATGC CATGACGCAT CGTTACTTAC GCAAAATGGA ATCCAGTCCG TTTATAAATC ATCTGGTTGA TGGAACGTTG CCGCTTATCT GGGCGAAAAC ACGTTTATTA AGTGATGATC CGGCGAAAGG GGAGGGCAAG GCAAAACGGC ATTCACTGTT ACCGCAGCGC CTGTTCGATA TCATGGGCTC ACCCAGTGAA CGCATCGATA TTATCTCTTC CTATTTTGTA CCGACACGCG CAGGTGTGGC GCAACTCTTA CGGATGGTGA GAAAAGGGGT AAAGATTGCG ATCCTAACCA ATTCTCTTGC CGCTAACGAT GTTGCTGTCG TCCATGCCGG ATACGCGCGC TGGCGCAAAA AATTGCTCCG CTATGGCGTG GAATTATATG AACTCAAGCC GACGCGTGAA CAAAGTAGTA CGTTACACGA TCGCGGCATA ACCGGTAATT CCGGTGCCAG CCTGCATGCT AAAACCTTTA GCATCGATGG TAAAACGGTG TTTATCGGTT CTTTCAATTT CGATCCGCGT TCAACATTGC TCAATACTGA AATGGGCTTC GTGATAGAGA GCGAAACGCT GGCTCAGTTA ATTGATAAAC GCTTTATTCA GAGCCAGTAT GATGCGGCCT GGCAGCTCCG TCTGGACAGG TGGGGACGGA TCAACTGGGT TGATCGTCAT GCAAAGAAAG AGATTGTTCT CAAAAAAGAA CCCGCTACCA GTTTCTGGAA GCGGGTTATG GTCAGACTGG CGTCGATATT GCCCGTGGAA TGGTTATTGT AA
|
Protein sequence | MPRLASAVLP LCSQHPGQCG LFPLEKSLDA FAARYRLAEM SEHTLDVQYY IWQDDMSGRL LFSALLAAAK RGVRVRLLLD DNNTPGLDDI LRLLDSHPRI EVRLFNPFSF RLLRPLGYIT DFSRLNRRMH NKSFTVDGVV TLVGGRNIGD AYFGAGEEPL FSDLDVMAIG PVVEDVADDF ARYWYCKSVS PLQQVLDVPE GEMADRIELP ASWHNDAMTH RYLRKMESSP FINHLVDGTL PLIWAKTRLL SDDPAKGEGK AKRHSLLPQR LFDIMGSPSE RIDIISSYFV PTRAGVAQLL RMVRKGVKIA ILTNSLAAND VAVVHAGYAR WRKKLLRYGV ELYELKPTRE QSSTLHDRGI TGNSGASLHA KTFSIDGKTV FIGSFNFDPR STLLNTEMGF VIESETLAQL IDKRFIQSQY DAAWQLRLDR WGRINWVDRH AKKEIVLKKE PATSFWKRVM VRLASILPVE WLL
|
| |