Gene EcSMS35_0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0410 
Symbolddl 
ID6142905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp422216 
End bp423310 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content52% 
IMG OID641615306 
ProductD-alanyl-alanine synthetase A 
Protein accessionYP_001742513 
Protein GI170684315 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAC TGCGGGTAGG AATCGTTTTT GGTGGTAAAT CAGCGGAACA TGAAGTGTCT 
CTGCAATCGG CAAAAAACAT TGTCGATGCC ATTGATAAAA GTCGCTTCGA CGTTGTGCTG
CTGGGCATTG ATAAACAAGG GCAATGGCAC GTCAGCGATG CCAGCAATTA TCTGCTAAAT
GCAGACGATC CTGCCCATAT TGCGTTGCGC CCTTCGGCGA CCAGCCTTGC GCAGGTGCCT
GGTAAACATG AACATCAGCT TATCGACGCG CAAAACGGTC AGCCGTTGCC GACGGTGGAT
GTCATTTTCC CGATTGTCCA CGGTACGCTG GGCGAAGATG GTTCCTTGCA GGGAATGCTG
CGGGTCGCCA ATTTACCGTT TGTAGGTTCT GATGTTCTGG CTTCAGCGGC CTGTATGGAT
AAAGATGTCA CCAAACGTCT GCTGCGCGAT GCCGGGCTGA ACATTGCGCC ATTTATTACC
CTGACGCGCG CTAATCGTCA CAACATCAGT TTTGCCGAAG TGGAGTCTAA ACTGGGGTTA
CCGCTGTTTG TAAAACCGGC TAATCAGGGC TCTTCTGTTG GTGTCAGCAA AGTAACCAGT
GAAGAACAGT ACGCAATTGC CGTCGATCTG GCGTTCGAGT TCGACCATAA AGTGATCGTT
GAGCAAGGGA TCAAAGGTCG TGAGATCGAA TGCGCAGTTC TGGGCAACGA CAATCCGCAA
GCCAGCACCT GTGGCGAGAT CGTACTCACC AGCGATTTCT ATGCCTACGA CACCAAGTAC
ATTGACGAAG ATGGCGCGAA AGTGGTTGTT CCGGCAGCCA TTGCGCCAGA AATCAACGAT
AAGATCCGGG CGATTGCCGT TCAGGCCTAT CAAACGTTGG GATGCGCAGG CATGGCGCGT
GTAGACGTGT TTTTAACCCC AGAGAACGAA GTGGTGATCA ACGAGATCAA CACCCTGCCT
GGCTTCACCA ACATCAGTAT GTATCCGAAG TTGTGGCAAG CCAGCGGTCT GGGTTACACC
GACCTGATCA CGCGTTTGAT TGAGCTGGCG CTGGAGCGTC ACGCTGCAGA TAACGCACTG
AAAACCACAA TGTAA
 
Protein sequence
MEKLRVGIVF GGKSAEHEVS LQSAKNIVDA IDKSRFDVVL LGIDKQGQWH VSDASNYLLN 
ADDPAHIALR PSATSLAQVP GKHEHQLIDA QNGQPLPTVD VIFPIVHGTL GEDGSLQGML
RVANLPFVGS DVLASAACMD KDVTKRLLRD AGLNIAPFIT LTRANRHNIS FAEVESKLGL
PLFVKPANQG SSVGVSKVTS EEQYAIAVDL AFEFDHKVIV EQGIKGREIE CAVLGNDNPQ
ASTCGEIVLT SDFYAYDTKY IDEDGAKVVV PAAIAPEIND KIRAIAVQAY QTLGCAGMAR
VDVFLTPENE VVINEINTLP GFTNISMYPK LWQASGLGYT DLITRLIELA LERHAADNAL
KTTM