Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2867 |
Symbol | nlpD |
ID | 6143460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2940812 |
End bp | 2941951 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617736 |
Product | lipoprotein NlpD |
Protein accession | YP_001744891 |
Protein GI | 170679671 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG GAAGCCCAAA ATTCACCGTT CGCCGCATTG CGGCTTTGTC ACTGGTTTCG CTATGGCTGG CAGGCTGTTC TGACACTTCA AATCCACCGG CCCCGGTCAG CTCCGTTAAT GGCAATGCGC CTGCAAATAC CAATTCTGGT ATGTTGATTA CGCCGCCGCC GAAAATGGGG ACGACGTCTA CAGCGCAGCA ACCGCAAATT CAGCCGGTAC AGCAGCCACA AATTCAGGCC ACTCAACAAC CGCAAATCCA GCCGGTGCAG CCAGTAGCTC AGCAGCCGGT ACAGATGGAA AACGGACGCA TCGTCTATAA CCGTCAGTAT GGGAACATTC CGAAAGGCAG TTATAGCGGC AGTACCTATA CAGTGAAAAA AGGCGACACA CTTTTCTATA TCGCCTGGAT TACTGGCAAC GATTTCCGTG ACCTTGCTCA GCGCAACAAT ATTCAGGCAC CATATGCGCT GAACGTCGGT CAGACCTTAC AGGTGGGGAA TGCTTCCGGT ACGCCAATCA CTGGCGGAAA TGCCATTACC CAGGCCGACG CAGCAGAGCA AGGAGTTGTG ATCAAGCCTG CACAAAATTC CACCGTTGCT GTTGCTTCGC AACCGACAAT TACGTATTCT GAGTCTTCGG GTGAACAGAG TGCTAACAAA ATGTTGCCGA ACAACAAGCC AACTGCGACC ACGGTCACAG CGCCTGTAAC GGTACCAACA GCAAGCACAA CCGAGCCGAC TGTCAGCAGT ACATCAACCA GTACGCCTAT CTCCACCTGG CGCTGGCCGA CTGAGGGCAA AGTGATCGAA ACCTTTGGCG CTTCTGAGGG GGGCAACAAG GGGATTGATA TCGCAGGCAG CAAAGGACAG GCAATTATCG CGACTGCAGA TGGCCGCGTT GTTTATGCCG GTAACGCGCT GCGCGGCTAC GGTAATCTGA TTATCATCAA ACATAATGAT GATTACCTGA GTGCCTACGC CCATAACGAC ACAATGCTGG TCCGGGAACA ACAAGAAGTG AAGGCGGGGC AAAAAATAGC AACCATGGGT AGCACCGGAA CCAGTTCAAC ACGCTTGCAT TTTGAAATTC GTTACAAGGG GAAATCCGTA AACCCGCTGC GTTATTTGCC GCAGCGATAA
|
Protein sequence | MSAGSPKFTV RRIAALSLVS LWLAGCSDTS NPPAPVSSVN GNAPANTNSG MLITPPPKMG TTSTAQQPQI QPVQQPQIQA TQQPQIQPVQ PVAQQPVQME NGRIVYNRQY GNIPKGSYSG STYTVKKGDT LFYIAWITGN DFRDLAQRNN IQAPYALNVG QTLQVGNASG TPITGGNAIT QADAAEQGVV IKPAQNSTVA VASQPTITYS ESSGEQSANK MLPNNKPTAT TVTAPVTVPT ASTTEPTVSS TSTSTPISTW RWPTEGKVIE TFGASEGGNK GIDIAGSKGQ AIIATADGRV VYAGNALRGY GNLIIIKHND DYLSAYAHND TMLVREQQEV KAGQKIATMG STGTSSTRLH FEIRYKGKSV NPLRYLPQR
|
| |