Gene EcSMS35_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1051 
SymboldacD 
ID6147385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1070106 
End bp1071278 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content49% 
IMG OID641615938 
ProductD-alanyl-D-alanine carboxypeptidase 
Protein accessionYP_001743130 
Protein GI170683658 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.168288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTTGA AACGCCGTCT TATTATTGCT GCTTCTTTGT TCGTTTTTAA TTTATCGTCT 
GGTTTTGCGG CGGAAAACAT TCCTTTTTCA CCTCAGCCTC CAGAGATTCA TGCCGGGTCC
TGGGTATTGA TGGATTACAC CACCGGTCAG ATTCTCACCG CGGGTAATGA GCATCAACAG
CGCAATCCCG CCAGCCTGAC AAAGCTGATG ACGGGCTATG TCGTGGATCG TGCTATCGAT
AGTCATCGCA TTACGCCAGA CGATATTGTC ACCGTGGGGC GTGATGCGTG GGCGAAAGAT
AATCCGGTGT TTGTCGGTTC TTCACTGATG TTTTTGAAAG AGGGCGATCG CGTATCGGTA
CGTGATTTAA GCCGTGGTTT AATTGTGGAT TCCGGAAATG ACGCTTGTGT TGCACTGGCT
GACTATATTG CCGGTGGGCA ACGGCAGTTT GTTGAAATGA TGAACAACTA TGCCGAGAAG
CTGCATCTCA AGGATACGCA TTTTGAAACA GTGCATGGTC TGGATGCACC TGGCCAGCAT
AGCTCGGCTT ATGATTTAGC TGTGCTTTCT CGCGCTATCA TCCACGGCGA GCCCGAGTTT
TATCATATGT ACAGTGAGAA AAGCCTCACC TGGAACGGTA TCACCCAGCA AAACCGTAAC
GGGTTATTGT GGGATAAAAC CATGAATGTT GACGGCCTGA AAACGGGTCA TACTTCTGGT
GCCGGGTTTA ATCTCATTGC TTCGGCTGTA GATGGGCAGC GTCGTCTCAT TGCAGTGGTA
ATGGGGGCTG ACAGCGCAAA AGGTCGTGAG GAAGAGGCAA GAAAATTACT GCGTTGGGGT
CAACAAAACT TTACTACGGT GCAAATTTTG CACCGTGGGA AAAAGGTTGG TACGGAACGC
ATCTGGTATG GCGATAAAGA AAATATCGCC CTGGGAACGG AACAAGAGTT CTGGATGGTG
CTACCGAAAG CCGAAATTCC ACATATCAAA GCCAAATATA CCCTTGATGG TAAAGAACTC
ACCGCGCCAA TTAGCGCCCA TCAGCGGGTA GGGGAAATTG AACTTTACGA CCGTGATAAA
CAGGTGGCGC ACTGGCCGCT GGTTACCCTG GAATCTGTCG GGGAAGGCAG CATGTTTTCC
CGGCTGAGTG ATTATTTCCA CCATAAGGCC TGA
 
Protein sequence
MLLKRRLIIA ASLFVFNLSS GFAAENIPFS PQPPEIHAGS WVLMDYTTGQ ILTAGNEHQQ 
RNPASLTKLM TGYVVDRAID SHRITPDDIV TVGRDAWAKD NPVFVGSSLM FLKEGDRVSV
RDLSRGLIVD SGNDACVALA DYIAGGQRQF VEMMNNYAEK LHLKDTHFET VHGLDAPGQH
SSAYDLAVLS RAIIHGEPEF YHMYSEKSLT WNGITQQNRN GLLWDKTMNV DGLKTGHTSG
AGFNLIASAV DGQRRLIAVV MGADSAKGRE EEARKLLRWG QQNFTTVQIL HRGKKVGTER
IWYGDKENIA LGTEQEFWMV LPKAEIPHIK AKYTLDGKEL TAPISAHQRV GEIELYDRDK
QVAHWPLVTL ESVGEGSMFS RLSDYFHHKA