Gene EcSMS35_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0866 
SymboldacC 
ID6144628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp873350 
End bp874561 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID641615754 
ProductD-alanyl-D-alanine carboxypeptidase fraction C 
Protein accessionYP_001742946 
Protein GI170681795 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATTTA TGACGCAATA CTCCTCTCTC CTTCGTGGTC TTGCAGCGGG TTCTGCATTT 
TTATTCCTTT TTGCCCCAAC GGCATTCGCG GCGGAACAAA CCGTTGAAGC GCCGAGCGTG
GATGCGCGTG CATGGATTTT AATGGATTAC GCCAGCGGTA AAGTGCTGGC AGAAGGCAAC
GCGGATGAGA AACTGGATCC CGCGAGCCTG ACTAAAATCA TGACCAGCTA TGTGGTTGGG
CAGGCGCTTA AGGCCGATAA GATTAAACTC ACCGATATGG TGACGGTCGG TAAAGATGCC
TGGGCGACGG GAAATCCGGC ACTGCGTGGT TCATCGGTAA TGTTCCTCAA ACCGGGCGAT
CAGGTTTCGG TGGCAGACTT GAACAAAGGT GTGATTATCC AGTCCGGTAA TGACGCCTGT
ATTGCGCTGG CCGATTACGT TGCCGGGAGC CAGGAGTCAT TTATTGGTTT GATGAATGGT
TATGCCAAAA AACTGGGTCT GACCAACACT ACCTTCCAGA CGGTGCACGG TCTGGATGCG
CCGGGGCAGT TCAGTACCGC GCGCGATATG GCATTGCTGG GTAAAGCATT AATCCACGAT
GTGCCGGAAG AGTACGCCAT TCATAAAGAG AAAGAGTTCA CCTTCAACAA AATTCGTCAG
CCTAACCGTA ACCGTCTGCT GTGGAGCAGC AATCTGAATG TTGATGGCAT GAAGACAGGA
ACCACTGCAG GCGCGGGATA TAATCTGGTT GCTTCGGCTA CCCAGGGTGA TATGCGTTTA
ATCTCCGTAG TGCTGGGGGC GAAAACTGAC CGTATCCGTT TTAATGAGTC TGAGAAATTA
TTGACCTGGG GTTTCCGCTT CTTTGAAACC GTGACGCCAA TTAAACCTGA TGCCACCTTT
GTGACTCAGC GCGTCTGGTT TGGTGATAAG AGCGAAGTGA ATCTCGGGGC AGGTGAAGCG
GGCTCAGTGA CCATACCGCG TGGGCAGCTG AAAAACCTGA AAGCGAGTTA TACGTTAACG
GAACCGCAGC TTACCGCACC GCTGAAAAAA GGTCAGGTTG TCGGGACCAT TGATTTCCAG
CTTAACGGTA AATCCATTGA GCAGCGTCCG CTGATCGTGA TGGAAAATGT GGAAGAGGGC
GGATTCTTTG GTCGGATGTG GGATTTCGTG ATGATGAAAT TCCATCAGTG GTTCGGCAGC
TGGTTCTCTT AA
 
Protein sequence
MAFMTQYSSL LRGLAAGSAF LFLFAPTAFA AEQTVEAPSV DARAWILMDY ASGKVLAEGN 
ADEKLDPASL TKIMTSYVVG QALKADKIKL TDMVTVGKDA WATGNPALRG SSVMFLKPGD
QVSVADLNKG VIIQSGNDAC IALADYVAGS QESFIGLMNG YAKKLGLTNT TFQTVHGLDA
PGQFSTARDM ALLGKALIHD VPEEYAIHKE KEFTFNKIRQ PNRNRLLWSS NLNVDGMKTG
TTAGAGYNLV ASATQGDMRL ISVVLGAKTD RIRFNESEKL LTWGFRFFET VTPIKPDATF
VTQRVWFGDK SEVNLGAGEA GSVTIPRGQL KNLKASYTLT EPQLTAPLKK GQVVGTIDFQ
LNGKSIEQRP LIVMENVEEG GFFGRMWDFV MMKFHQWFGS WFS