Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1167 |
Symbol | |
ID | 5591566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1170019 |
End bp | 1171449 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920326 |
Product | phopholipase D |
Protein accession | YP_001457889 |
Protein GI | 157160571 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 0.383725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATT TGCCCCGGCT GGCGAGCGCG GTGCTGCCAC TGTGTTCGCA ACATCCCGGT CAGTGTGGCC TTTTTCCTCT GGAGAAAAGT CTGGATGCGT TTGCAGCCCG GTATCGTCTG GCCGAAATGG CAGAGCATAC GCTGGATGTT CAGTATTACA TCTGGCAGGA CGATATGTCG GGTCGGTTAC TGTTTTCCGC CCTGTTAGCC GCAGCAAAGC GTGGCGTTCG CGTCCGTTTG TTGCTGGACG ACAACAATAC GCCCGGACTT GACGATATTT TACGCTTGCT TGACAGCCAT CCCCGCATTG AAGTCCGGCT TTTTAATCCT TTCTCGTTTC GCTTGCTGCG TCCGCTTGGT TATATCACCG ACTTTTCCCG TCTCAATCGC CGTATGCACA ATAAAAGTTT CACTGTCGAT GGCGTGGTTA CCCTGGTGGG AGGACGAAAT ATTGGTGATG CCTATTTTGG GGCAGGGGAA GAGCCACTTT TTTCGGATTT AGATGTTATG GCAATAGGAC CCGTGGTAGA GGACGTTGCC GATGATTTCG CCCGCTACTG GTATTGCAAA TCGGTTTCAC CCTTACAGCA GGTGCTGGAT GTCCCGGAAG GTGAAATGGC GGATCGCATC GAGTTACCCG CCTCCTGGCA TAACGATGCC ATGACGCATC GTTACTTACG CCAAATGGAA TCCAGTCCGT TTATAAATCA TCTGGTTGAT GGAACGTTGC CGCTTATCTG GGCGAAAACA CGTTTATTAA GTGATGATCC GGCGAAAGGG GAGGGCAAGG CAAAACGGCA TTCACTGTTA CCGCAGCGCC TGTTCGATAT CATGGGCTCA CCCAGTGAAC GCATCGATAT TATCTCTTCC TATTTTGTAC CGACACGCGC AGGTGTGGCG CAACTCTTAC GGATGGTGAG AAAAGGGGTA AAGATTGCGA TCCTAACCAA TTCTCTTGCC GCTAACGATG TTGCTGTCGT CCATGCCGGA TACGCGCGCT GGCGCAAAAA ATTGCTCCGC TATGGCGTGG AATTATATGA ACTCAAGCCG ACGCGTGAAC AAAGTAGTAC GTTACACGAT CGCGGCATAA CCGGTAATTC CGGAGCTAGC CTGCATGCTA AAACTTTTAG CATCGATGGT AAAACGGTGT TTATCGGCTC TTTCAATTTC GATCCGCGTT CAACATTGCT CAATACTGAA ATGGGCTTCG TGATAGAGAG CGAAACGCTG GCACAGTTAA TTGATAAACG CTTTATTCAG AGCCAGTATG ATGCGGCCTG GCAGCTCCGT CTGGACAGGT GGGGACGGAT CAACTGGGTT GATCGTCATG CAAAGAAAGA GATTATTCTC AAAAAAGAAC CCGCCACCAG TTTCTGGAAG CGGGTTATGG TCAGACTGGC GTCGATATTG CCCGTGGAAT GGTTATTGTA A
|
Protein sequence | MNDLPRLASA VLPLCSQHPG QCGLFPLEKS LDAFAARYRL AEMAEHTLDV QYYIWQDDMS GRLLFSALLA AAKRGVRVRL LLDDNNTPGL DDILRLLDSH PRIEVRLFNP FSFRLLRPLG YITDFSRLNR RMHNKSFTVD GVVTLVGGRN IGDAYFGAGE EPLFSDLDVM AIGPVVEDVA DDFARYWYCK SVSPLQQVLD VPEGEMADRI ELPASWHNDA MTHRYLRQME SSPFINHLVD GTLPLIWAKT RLLSDDPAKG EGKAKRHSLL PQRLFDIMGS PSERIDIISS YFVPTRAGVA QLLRMVRKGV KIAILTNSLA ANDVAVVHAG YARWRKKLLR YGVELYELKP TREQSSTLHD RGITGNSGAS LHAKTFSIDG KTVFIGSFNF DPRSTLLNTE MGFVIESETL AQLIDKRFIQ SQYDAAWQLR LDRWGRINWV DRHAKKEIIL KKEPATSFWK RVMVRLASIL PVEWLL
|
| |