Gene EcHS_A1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1167 
Symbol 
ID5591566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1170019 
End bp1171449 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content50% 
IMG OID640920326 
Productphopholipase D 
Protein accessionYP_001457889 
Protein GI157160571 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.383725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT TGCCCCGGCT GGCGAGCGCG GTGCTGCCAC TGTGTTCGCA ACATCCCGGT 
CAGTGTGGCC TTTTTCCTCT GGAGAAAAGT CTGGATGCGT TTGCAGCCCG GTATCGTCTG
GCCGAAATGG CAGAGCATAC GCTGGATGTT CAGTATTACA TCTGGCAGGA CGATATGTCG
GGTCGGTTAC TGTTTTCCGC CCTGTTAGCC GCAGCAAAGC GTGGCGTTCG CGTCCGTTTG
TTGCTGGACG ACAACAATAC GCCCGGACTT GACGATATTT TACGCTTGCT TGACAGCCAT
CCCCGCATTG AAGTCCGGCT TTTTAATCCT TTCTCGTTTC GCTTGCTGCG TCCGCTTGGT
TATATCACCG ACTTTTCCCG TCTCAATCGC CGTATGCACA ATAAAAGTTT CACTGTCGAT
GGCGTGGTTA CCCTGGTGGG AGGACGAAAT ATTGGTGATG CCTATTTTGG GGCAGGGGAA
GAGCCACTTT TTTCGGATTT AGATGTTATG GCAATAGGAC CCGTGGTAGA GGACGTTGCC
GATGATTTCG CCCGCTACTG GTATTGCAAA TCGGTTTCAC CCTTACAGCA GGTGCTGGAT
GTCCCGGAAG GTGAAATGGC GGATCGCATC GAGTTACCCG CCTCCTGGCA TAACGATGCC
ATGACGCATC GTTACTTACG CCAAATGGAA TCCAGTCCGT TTATAAATCA TCTGGTTGAT
GGAACGTTGC CGCTTATCTG GGCGAAAACA CGTTTATTAA GTGATGATCC GGCGAAAGGG
GAGGGCAAGG CAAAACGGCA TTCACTGTTA CCGCAGCGCC TGTTCGATAT CATGGGCTCA
CCCAGTGAAC GCATCGATAT TATCTCTTCC TATTTTGTAC CGACACGCGC AGGTGTGGCG
CAACTCTTAC GGATGGTGAG AAAAGGGGTA AAGATTGCGA TCCTAACCAA TTCTCTTGCC
GCTAACGATG TTGCTGTCGT CCATGCCGGA TACGCGCGCT GGCGCAAAAA ATTGCTCCGC
TATGGCGTGG AATTATATGA ACTCAAGCCG ACGCGTGAAC AAAGTAGTAC GTTACACGAT
CGCGGCATAA CCGGTAATTC CGGAGCTAGC CTGCATGCTA AAACTTTTAG CATCGATGGT
AAAACGGTGT TTATCGGCTC TTTCAATTTC GATCCGCGTT CAACATTGCT CAATACTGAA
ATGGGCTTCG TGATAGAGAG CGAAACGCTG GCACAGTTAA TTGATAAACG CTTTATTCAG
AGCCAGTATG ATGCGGCCTG GCAGCTCCGT CTGGACAGGT GGGGACGGAT CAACTGGGTT
GATCGTCATG CAAAGAAAGA GATTATTCTC AAAAAAGAAC CCGCCACCAG TTTCTGGAAG
CGGGTTATGG TCAGACTGGC GTCGATATTG CCCGTGGAAT GGTTATTGTA A
 
Protein sequence
MNDLPRLASA VLPLCSQHPG QCGLFPLEKS LDAFAARYRL AEMAEHTLDV QYYIWQDDMS 
GRLLFSALLA AAKRGVRVRL LLDDNNTPGL DDILRLLDSH PRIEVRLFNP FSFRLLRPLG
YITDFSRLNR RMHNKSFTVD GVVTLVGGRN IGDAYFGAGE EPLFSDLDVM AIGPVVEDVA
DDFARYWYCK SVSPLQQVLD VPEGEMADRI ELPASWHNDA MTHRYLRQME SSPFINHLVD
GTLPLIWAKT RLLSDDPAKG EGKAKRHSLL PQRLFDIMGS PSERIDIISS YFVPTRAGVA
QLLRMVRKGV KIAILTNSLA ANDVAVVHAG YARWRKKLLR YGVELYELKP TREQSSTLHD
RGITGNSGAS LHAKTFSIDG KTVFIGSFNF DPRSTLLNTE MGFVIESETL AQLIDKRFIQ
SQYDAAWQLR LDRWGRINWV DRHAKKEIIL KKEPATSFWK RVMVRLASIL PVEWLL