Gene EcSMS35_2867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2867 
SymbolnlpD 
ID6143460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2940812 
End bp2941951 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content53% 
IMG OID641617736 
Productlipoprotein NlpD 
Protein accessionYP_001744891 
Protein GI170679671 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG GAAGCCCAAA ATTCACCGTT CGCCGCATTG CGGCTTTGTC ACTGGTTTCG 
CTATGGCTGG CAGGCTGTTC TGACACTTCA AATCCACCGG CCCCGGTCAG CTCCGTTAAT
GGCAATGCGC CTGCAAATAC CAATTCTGGT ATGTTGATTA CGCCGCCGCC GAAAATGGGG
ACGACGTCTA CAGCGCAGCA ACCGCAAATT CAGCCGGTAC AGCAGCCACA AATTCAGGCC
ACTCAACAAC CGCAAATCCA GCCGGTGCAG CCAGTAGCTC AGCAGCCGGT ACAGATGGAA
AACGGACGCA TCGTCTATAA CCGTCAGTAT GGGAACATTC CGAAAGGCAG TTATAGCGGC
AGTACCTATA CAGTGAAAAA AGGCGACACA CTTTTCTATA TCGCCTGGAT TACTGGCAAC
GATTTCCGTG ACCTTGCTCA GCGCAACAAT ATTCAGGCAC CATATGCGCT GAACGTCGGT
CAGACCTTAC AGGTGGGGAA TGCTTCCGGT ACGCCAATCA CTGGCGGAAA TGCCATTACC
CAGGCCGACG CAGCAGAGCA AGGAGTTGTG ATCAAGCCTG CACAAAATTC CACCGTTGCT
GTTGCTTCGC AACCGACAAT TACGTATTCT GAGTCTTCGG GTGAACAGAG TGCTAACAAA
ATGTTGCCGA ACAACAAGCC AACTGCGACC ACGGTCACAG CGCCTGTAAC GGTACCAACA
GCAAGCACAA CCGAGCCGAC TGTCAGCAGT ACATCAACCA GTACGCCTAT CTCCACCTGG
CGCTGGCCGA CTGAGGGCAA AGTGATCGAA ACCTTTGGCG CTTCTGAGGG GGGCAACAAG
GGGATTGATA TCGCAGGCAG CAAAGGACAG GCAATTATCG CGACTGCAGA TGGCCGCGTT
GTTTATGCCG GTAACGCGCT GCGCGGCTAC GGTAATCTGA TTATCATCAA ACATAATGAT
GATTACCTGA GTGCCTACGC CCATAACGAC ACAATGCTGG TCCGGGAACA ACAAGAAGTG
AAGGCGGGGC AAAAAATAGC AACCATGGGT AGCACCGGAA CCAGTTCAAC ACGCTTGCAT
TTTGAAATTC GTTACAAGGG GAAATCCGTA AACCCGCTGC GTTATTTGCC GCAGCGATAA
 
Protein sequence
MSAGSPKFTV RRIAALSLVS LWLAGCSDTS NPPAPVSSVN GNAPANTNSG MLITPPPKMG 
TTSTAQQPQI QPVQQPQIQA TQQPQIQPVQ PVAQQPVQME NGRIVYNRQY GNIPKGSYSG
STYTVKKGDT LFYIAWITGN DFRDLAQRNN IQAPYALNVG QTLQVGNASG TPITGGNAIT
QADAAEQGVV IKPAQNSTVA VASQPTITYS ESSGEQSANK MLPNNKPTAT TVTAPVTVPT
ASTTEPTVSS TSTSTPISTW RWPTEGKVIE TFGASEGGNK GIDIAGSKGQ AIIATADGRV
VYAGNALRGY GNLIIIKHND DYLSAYAHND TMLVREQQEV KAGQKIATMG STGTSSTRLH
FEIRYKGKSV NPLRYLPQR