Gene EcolC_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0970 
SymbolnlpD 
ID6068029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1055131 
End bp1056270 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID641600378 
Productlipoprotein NlpD 
Protein accessionYP_001723966 
Protein GI170019012 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.957286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG GAAGCCCAAA ATTCACCGTT CGCCGCATTG CGGCTTTGTC ACTGGTTTCG 
CTATGGCTGG CAGGCTGTTC TGACACTTCA AATCCACCGG CACCGGTCAG CTCCGTTAAT
GGCAATGCGC CTGCAAATAC TAATTCTGGT ATGTTGATTA CGCCGCCGCC GAAAATGGGG
ACGACGTCTA CAGCGCAGCA ACCGCAAATT CAGCCGGTAC AGCAGCCACA AATTCAGGCT
ACTCAACAAC CGCAAATCCA GCCAGTGCAG CCAGTAGCTC AGCAGCCGGT ACAGATGGAA
AACGGACGCA TCGTCTATAA CCGTCAGTAT GGGAACATTC CGAAAGGCAG TTATAGCGGC
AGTACCTATA CCGTGAAAAA AGGCGACACA CTTTTCTATA TCGCCTGGAT TACTGGCAAC
GATTTCCGTG ACCTTGCTCA GCGCAACAAT ATTCAGGCAC CATACGCGCT GAACGTTGGT
CAGACCTTGC AGGTGGGTAA TGCTTCCGGT ACGCCAATCA CTGGCGGAAA TGCCATTACC
CAGGCCGACG CAGCAGAGCA AGGAGTTGTG ATCAAGCCTG CACAAAATTC CACCGTTGCT
GTTGCGTCGC AACCGACAAT TACGTATTCT GAATCTTCGG GTGAACAGAG TGCTAACAAA
ATGTTGCCGA ACAACAAGCC AGCTGCGACC ACGGTCACAG CGCCTGTAAC GGTACCAACA
GCAAGCACAA CCGAGCCAAC TGTCAGCAGT ACATCAACCA GTACGCCTAT CTCCACCTGG
CGCTGGCCGA CTGAGGGCAA AGTGATCGAA ACCTTTGGCG CTTCTGAGGG GGGCAACAAG
GGGATTGATA TCGCAGGCAG CAAAGGACAG GCAATTATCG CGACCGCAGA TGGCCGCGTT
GTTTATGCTG GTAACGCGCT GCGCGGCTAC GGTAATCTGA TTATCATCAA ACATAATGAT
GATTACCTGA GTGCCTACGC CCATAACGAC ACAATGCTGG TCCGGGAACA ACAAGAAGTG
AAGGCGGGGC AAAAAATAGC AACCATGGGT AGCACCGGAA CCAGTTCAAC ACGCTTGCAT
TTTGAAATTC GTTACAAGGG GAAATCCGTA AACCCGCTGC GTTATTTGCC GCAGCGATAA
 
Protein sequence
MSAGSPKFTV RRIAALSLVS LWLAGCSDTS NPPAPVSSVN GNAPANTNSG MLITPPPKMG 
TTSTAQQPQI QPVQQPQIQA TQQPQIQPVQ PVAQQPVQME NGRIVYNRQY GNIPKGSYSG
STYTVKKGDT LFYIAWITGN DFRDLAQRNN IQAPYALNVG QTLQVGNASG TPITGGNAIT
QADAAEQGVV IKPAQNSTVA VASQPTITYS ESSGEQSANK MLPNNKPAAT TVTAPVTVPT
ASTTEPTVSS TSTSTPISTW RWPTEGKVIE TFGASEGGNK GIDIAGSKGQ AIIATADGRV
VYAGNALRGY GNLIIIKHND DYLSAYAHND TMLVREQQEV KAGQKIATMG STGTSSTRLH
FEIRYKGKSV NPLRYLPQR