Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2712 |
Symbol | |
ID | 6146433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2791022 |
End bp | 2792578 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617582 |
Product | putative transglycosylase |
Protein accession | YP_001744747 |
Protein GI | 170681428 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAAAAT TAAAGATTAA TTATCTGTTC ATCGGCATTC TGGCACTGCT GCTCGCGGTC GCTCTCTGGC CATCCATTCC CTGGTTTGGT AAAGCCGACA ACCGTATCGC CGCCATTCAA GCGCGGGGAG AGTTGCGTGT GAGCACCATT AATACTCCCC TGACGTATAA CGAAATCAAC GGGAAACCTT TTGGCCTGGA TTACGAACTG GCGAAACAGT TTGCCGATTA CCTCGGCGTA AAACTGAAAG TGACCGTGCG GCAGAATATC AGCCAGCTGT TTGACGACCT CGATAATGGT AACGCCGACC TGCTGGCGGC AGGCCTGGTC TATAACAGTG AGCGTGTAAA AAATTATCAG CCTGGCCCTA CCTATTATTC CGTGTCACAA CAACTGGTTT ATAAAGTGGG TCAGTATCGC CCACGTACAC TGGGCAACCT GACGGCGGAG CAACTCACCG TTGCACCGGG TCATGTGGTG GTTAACGATC TCCAGACCCT GAAAGAAACA AAATTCCCGG AATTAAGCTG GAAGGTTGAC GACAAAAAAG GCTCTGCGGA ATTAATGGAA GATGTCATCG AAGGAAAACT CGATTACACC ATTGCTGATT CTGTTGCCAT TAGCCTGTTT CAGCGCGTTC ATCCGGAACT CGCCGTAGCG CTCGATATCA CCGATGAACA ACCGGTGACC TGGTTTAGCC CGTTAGATGG CGATAATACC CTTTCCGCCG CTCTGCTCGA TTTCTTCAAT GAGATGAATG AAGACGGTAC GCTGGCACGC ATTGAAGAGA AATACCTGGG GCATGGCGAT GATTTTGATT ACGTCGATAC GCGCACATTT TTACGCGCCG TCGATGCGGT ACTGCCGCAG TTAAAGCCCC TGTTTGAGAA ATACGCCGAA GAAATTGACT GGCGTTTGCT GGCCGCTATT GCTTATCAGG AATCGCACTG GGATGCTCAG GCCACTTCAC CGACGGGTGT GCGCGGCATG ATGATGTTAA CCAAAAACAC CGCGCAAAGC CTCGGCATTA CGGATCGTAC CGATGCCGAA CAGAGCATCA GCGGCGGCGT GCGTTATTTG CAGGATATGA TGAGTAAAGT GCCGGAAAGT GTGCCGGAGA ACGAACGGAT TTGGTTTGCC CTCGCCGCGT ACAATATGGG CTATGCGCAT ATGCTGGATG CCCGCGCTCT GACGGCAAAA ACCAAAGGGA ATCCTGACAG TTGGGCTGAC GTAAAACAGC GTCTGCCTTT ACTTAGCCAG AAACCCTATT ACAGCAAGCT GACTTACGGC TACGCTCGTG GGCATGAAGC CTACGCTTAT GTCGAAAATA TTCGTAAGTA TCAGATTAGC CTGGTGGGTT ATCTGCAAGA GAAAGAGAAG CAGGCTACAG AAGCGGCGAT GCAACTGGCG CAGGATTATC CGGCGGTATC GCCTACGGAG TTGGGCAAAG AGAAATTTCC TTTTCTCTCG TTTCTTTCCC AGTCGTCATC AAACTATTTG ACCCACTCTC CCTCTCTGCT GTTTTCCAGG AAAGGGAGTG AAGAGAAACA AAATTAA
|
Protein sequence | MKKLKINYLF IGILALLLAV ALWPSIPWFG KADNRIAAIQ ARGELRVSTI NTPLTYNEIN GKPFGLDYEL AKQFADYLGV KLKVTVRQNI SQLFDDLDNG NADLLAAGLV YNSERVKNYQ PGPTYYSVSQ QLVYKVGQYR PRTLGNLTAE QLTVAPGHVV VNDLQTLKET KFPELSWKVD DKKGSAELME DVIEGKLDYT IADSVAISLF QRVHPELAVA LDITDEQPVT WFSPLDGDNT LSAALLDFFN EMNEDGTLAR IEEKYLGHGD DFDYVDTRTF LRAVDAVLPQ LKPLFEKYAE EIDWRLLAAI AYQESHWDAQ ATSPTGVRGM MMLTKNTAQS LGITDRTDAE QSISGGVRYL QDMMSKVPES VPENERIWFA LAAYNMGYAH MLDARALTAK TKGNPDSWAD VKQRLPLLSQ KPYYSKLTYG YARGHEAYAY VENIRKYQIS LVGYLQEKEK QATEAAMQLA QDYPAVSPTE LGKEKFPFLS FLSQSSSNYL THSPSLLFSR KGSEEKQN
|
| |