Gene EcSMS35_2712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2712 
Symbol 
ID6146433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2791022 
End bp2792578 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content50% 
IMG OID641617582 
Productputative transglycosylase 
Protein accessionYP_001744747 
Protein GI170681428 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAAAT TAAAGATTAA TTATCTGTTC ATCGGCATTC TGGCACTGCT GCTCGCGGTC 
GCTCTCTGGC CATCCATTCC CTGGTTTGGT AAAGCCGACA ACCGTATCGC CGCCATTCAA
GCGCGGGGAG AGTTGCGTGT GAGCACCATT AATACTCCCC TGACGTATAA CGAAATCAAC
GGGAAACCTT TTGGCCTGGA TTACGAACTG GCGAAACAGT TTGCCGATTA CCTCGGCGTA
AAACTGAAAG TGACCGTGCG GCAGAATATC AGCCAGCTGT TTGACGACCT CGATAATGGT
AACGCCGACC TGCTGGCGGC AGGCCTGGTC TATAACAGTG AGCGTGTAAA AAATTATCAG
CCTGGCCCTA CCTATTATTC CGTGTCACAA CAACTGGTTT ATAAAGTGGG TCAGTATCGC
CCACGTACAC TGGGCAACCT GACGGCGGAG CAACTCACCG TTGCACCGGG TCATGTGGTG
GTTAACGATC TCCAGACCCT GAAAGAAACA AAATTCCCGG AATTAAGCTG GAAGGTTGAC
GACAAAAAAG GCTCTGCGGA ATTAATGGAA GATGTCATCG AAGGAAAACT CGATTACACC
ATTGCTGATT CTGTTGCCAT TAGCCTGTTT CAGCGCGTTC ATCCGGAACT CGCCGTAGCG
CTCGATATCA CCGATGAACA ACCGGTGACC TGGTTTAGCC CGTTAGATGG CGATAATACC
CTTTCCGCCG CTCTGCTCGA TTTCTTCAAT GAGATGAATG AAGACGGTAC GCTGGCACGC
ATTGAAGAGA AATACCTGGG GCATGGCGAT GATTTTGATT ACGTCGATAC GCGCACATTT
TTACGCGCCG TCGATGCGGT ACTGCCGCAG TTAAAGCCCC TGTTTGAGAA ATACGCCGAA
GAAATTGACT GGCGTTTGCT GGCCGCTATT GCTTATCAGG AATCGCACTG GGATGCTCAG
GCCACTTCAC CGACGGGTGT GCGCGGCATG ATGATGTTAA CCAAAAACAC CGCGCAAAGC
CTCGGCATTA CGGATCGTAC CGATGCCGAA CAGAGCATCA GCGGCGGCGT GCGTTATTTG
CAGGATATGA TGAGTAAAGT GCCGGAAAGT GTGCCGGAGA ACGAACGGAT TTGGTTTGCC
CTCGCCGCGT ACAATATGGG CTATGCGCAT ATGCTGGATG CCCGCGCTCT GACGGCAAAA
ACCAAAGGGA ATCCTGACAG TTGGGCTGAC GTAAAACAGC GTCTGCCTTT ACTTAGCCAG
AAACCCTATT ACAGCAAGCT GACTTACGGC TACGCTCGTG GGCATGAAGC CTACGCTTAT
GTCGAAAATA TTCGTAAGTA TCAGATTAGC CTGGTGGGTT ATCTGCAAGA GAAAGAGAAG
CAGGCTACAG AAGCGGCGAT GCAACTGGCG CAGGATTATC CGGCGGTATC GCCTACGGAG
TTGGGCAAAG AGAAATTTCC TTTTCTCTCG TTTCTTTCCC AGTCGTCATC AAACTATTTG
ACCCACTCTC CCTCTCTGCT GTTTTCCAGG AAAGGGAGTG AAGAGAAACA AAATTAA
 
Protein sequence
MKKLKINYLF IGILALLLAV ALWPSIPWFG KADNRIAAIQ ARGELRVSTI NTPLTYNEIN 
GKPFGLDYEL AKQFADYLGV KLKVTVRQNI SQLFDDLDNG NADLLAAGLV YNSERVKNYQ
PGPTYYSVSQ QLVYKVGQYR PRTLGNLTAE QLTVAPGHVV VNDLQTLKET KFPELSWKVD
DKKGSAELME DVIEGKLDYT IADSVAISLF QRVHPELAVA LDITDEQPVT WFSPLDGDNT
LSAALLDFFN EMNEDGTLAR IEEKYLGHGD DFDYVDTRTF LRAVDAVLPQ LKPLFEKYAE
EIDWRLLAAI AYQESHWDAQ ATSPTGVRGM MMLTKNTAQS LGITDRTDAE QSISGGVRYL
QDMMSKVPES VPENERIWFA LAAYNMGYAH MLDARALTAK TKGNPDSWAD VKQRLPLLSQ
KPYYSKLTYG YARGHEAYAY VENIRKYQIS LVGYLQEKEK QATEAAMQLA QDYPAVSPTE
LGKEKFPFLS FLSQSSSNYL THSPSLLFSR KGSEEKQN