Gene EcSMS35_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0652 
SymboldacA 
ID6146184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp663355 
End bp664566 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content50% 
IMG OID641615542 
ProductD-alanyl-D-alanine carboxypeptidase fraction A 
Protein accessionYP_001742748 
Protein GI170683720 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.71199e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCA TTTTTTCCGC TCGTATCATG AAGCGCCTGG CGCTCACCAC GGCTCTTTGC 
ACAGCCTTTA TCTCTGCTGC ACATGCCGAT GACCTGAATA TCAAAACTAT GATCCCGGGT
GTACCGCAGA TCGATGCGGA GTCCTACATC CTGATTGACT ATAACTCCGG CAAAGTGCTC
GCCGAACAGA ACGCAGATGT CCGCCGCGAT CCTGCCAGCC TGACCAAAAT GATGACCAGT
TACGTTATCG GCCAGGCAAT GAAAGCCGGT AAATTTAAAG AAACTGATTT AGTCACTATC
GGCAACGACG CATGGGCCAC CGGTAACCCG GTGTTTAAAG GTTCTTCACT GATGTTCCTC
AAACCGGGCA TGCAGGTTCC GGTTTCTCAG CTAATCCGCG GTATTAACCT GCAATCGGGT
AACGATGCTT GTGTCGCCAT GGCCGATTTT GCCGCTGGTA GCCAGGACGC TTTTGTTGGC
TTGATGAACA GCTACGTTAA CGCACTGGGC CTGAAAAATA CCCACTTCCA GACGGTACAT
GGTCTGGATG CTGATGGTCA GTACAGCTCC GCGCGAGATA TGGCGCTGAT CGGCCAGGCG
TTGATCCGTG ACGTACCGAA TGAATACTCG ATCTATAAAG AAAAAGAATT TACGTTTAAC
GGTATTCGCC AGCTGAACCG TAACGGCCTG TTATGGGATA ACAGCCTGAA TGTCGACGGC
ATCAAAACCG GACACACTGA CAAAGCAGGT TACAACCTTG TTGCTTCTGC GACTGAAGGC
CAGATGCGCT TGATCTCTGC GGTGATGGGC GGACGTACTT TTAAAGGCCG TGAAGCCGAA
AGTAAAAAAC TGCTAACCTG GGGCTTCCGT TTCTTCGAAA CCGTTAACCC ACTGAAAGTA
GGTAAAGAGT TCGCCTCTGA ACCGGTTTGG TTTGGTGATT CTGATCGCGC TTCGTTAGGT
GTTGATAAAG ACGTGTATCT GACCATTCCG CGTGGCCGCA TGAAAGATCT GAAAGCCAGC
TATGTGCTGA ACAGCAGTGA ATTGCATGCG CCGCTGCAAA AGAATCAGGT CGTCGGTACT
ATCAACTTCC AGCTTGATGG CAAAACGATC GAACAACGCC CGCTGGTTGT GCTGCAAGAA
ATCCCGGAAG GCAACTTCTT CGGCAAAATC ATTGATTACA TTAAATTAAT GTTCCATCAC
TGGTTTGGTT AA
 
Protein sequence
MNTIFSARIM KRLALTTALC TAFISAAHAD DLNIKTMIPG VPQIDAESYI LIDYNSGKVL 
AEQNADVRRD PASLTKMMTS YVIGQAMKAG KFKETDLVTI GNDAWATGNP VFKGSSLMFL
KPGMQVPVSQ LIRGINLQSG NDACVAMADF AAGSQDAFVG LMNSYVNALG LKNTHFQTVH
GLDADGQYSS ARDMALIGQA LIRDVPNEYS IYKEKEFTFN GIRQLNRNGL LWDNSLNVDG
IKTGHTDKAG YNLVASATEG QMRLISAVMG GRTFKGREAE SKKLLTWGFR FFETVNPLKV
GKEFASEPVW FGDSDRASLG VDKDVYLTIP RGRMKDLKAS YVLNSSELHA PLQKNQVVGT
INFQLDGKTI EQRPLVVLQE IPEGNFFGKI IDYIKLMFHH WFG