Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3993 |
Symbol | nlpD |
ID | 6966757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3692582 |
End bp | 3693673 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387762 |
Product | lipoprotein NlpD |
Protein accession | YP_002272205 |
Protein GI | 209400494 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG GAAGCCCAAA ATTCACCGTT CGCCGCATTG CGGCTTTGTC ACTGGTTTCG CTATGGCTGG CAGGCTGTTC TGACACTTCA AATCCACCGG CACCGGTCAG CTCCGTTAAT GGCAATGCGC CTGCACATAC CAATTCTGGT ATGTTGATTA CGCCGCCGCC GAAAATGGGG ACGACGTCTA CAGCGCAGCA ACCGCAAATC CAGCCAGTGC AGCCAGTAGC TCAGCAGCCG GTACAGATGG AAAACGGACG CATCGTCTAT AACCGTCAGT ATGGGAACAT TCCGAAAGGC AGTTATAGCG GCAGTACCTA TACCGTGAAA AAAGGCGACA CGCTTTTCTA TATCGCCTGG ATTACTGGCA ACGATTTCCG TGACCTTGCT CAGCGCAACA ATATTCAGGC ACCATACGCG CTGAACGTCG GTCAGACCTT ACAGGTGGGT AATGCTTCCG GTACGCCAAT CACTGGCGGA AATGCCATTA CCCAGGCCGA CGCAGCAGAG CAAGGAGTTG TGATCAAGCC TGCACAAAAT TCCACCGTTG CTGTTGCGTC GCAACCGACA ATTACGTATT CTGAGTCTTC GGGTGAACAG AGTGCTAACA AAATGTTGCC GAACAACAAG CCAACTGCGA CCACGGTCAC AGCGCCTGTA ACGGTACCAA CAGCAAGCAC AACCGAGCCG ACTGTCAGCA GTACATCAAC CAGTACGCCT ATCTCCACCT GGCGCTGGCC GACTGAGGGC AAAGTGATCG AAACCTTTGG CGCTTCTGAG GGGGGCAACA AGGGGATTGA TATCGCAGGC AGCAAAGGAC AGGCAATTAT CGCGACCGCA GATGGCCGTG TTGTTTATGC CGGTAACGCG CTGCGCGGCT ACGGTAATCT GATTATCATC AAACATAATG ATGATTACCT GAGTGCCTAC GCCCATAACG ACACAATGCT GGTCCGGGAA CAACAAGAAG TGAAGGCGGG GCAAAAAATA GCAACCATGG GTAGCACCGG AACCAGTTCA ACACGCTTGC ATTTTGAAAT TCGTTACAAG GGGAAATCCG TAAACCCGCT GCGTTATTTG CCGCAGCGAT AA
|
Protein sequence | MSAGSPKFTV RRIAALSLVS LWLAGCSDTS NPPAPVSSVN GNAPAHTNSG MLITPPPKMG TTSTAQQPQI QPVQPVAQQP VQMENGRIVY NRQYGNIPKG SYSGSTYTVK KGDTLFYIAW ITGNDFRDLA QRNNIQAPYA LNVGQTLQVG NASGTPITGG NAITQADAAE QGVVIKPAQN STVAVASQPT ITYSESSGEQ SANKMLPNNK PTATTVTAPV TVPTASTTEP TVSSTSTSTP ISTWRWPTEG KVIETFGASE GGNKGIDIAG SKGQAIIATA DGRVVYAGNA LRGYGNLIII KHNDDYLSAY AHNDTMLVRE QQEVKAGQKI ATMGSTGTSS TRLHFEIRYK GKSVNPLRYL PQR
|
| |