Gene EcolC_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1119 
Symbol 
ID6067971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1215566 
End bp1217122 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content50% 
IMG OID641600535 
Productputative transglycosylase 
Protein accessionYP_001724113 
Protein GI170019159 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAAAT TAAAGATTAA TTATCTGTTC ATCGGCATTC TGGCACTGCT GCTCGCGGTC 
GCTCTCTGGC CATCCATTCC CTGGTTTGGT AAAGCCGACA ACCGTATCGC CGCCATTCAA
GCGCGGGGAG AGTTGCGTGT GAGCACCATT CATACTCCCC TGACTTATAA CGAAATCAAC
GGGAAACCTT TTGGCCTGGA TTACGAACTG GCGAAACAGT TTGCCGATTA CCTCGGCGTA
AAACTGAAAG TGACCGTGCG GCAGAATATC AGCCAGCTGT TTGACGACCT TGATAATGGT
AACGCCGACC TGCTGGCGGC AGGACTTGTC TATAACAGTG AGCGGGTAAA AAATTATCAG
CCTGGCCCTA CCTATTATTC CGTGTCACAA CAACTGGTTT ATAAAGTGGG TCAGTATCGC
CCACGTACGC TGGGCAACCT GACGGCGGAG CAACTCACCG TTGCACCGGG TCATGTGGTG
GTTAACGATC TCCAGACCCT GAAAGAAACA AAATTCCCGG AATTAAGCTG GAAGGTAGAC
GACAAAAAAG GCTCTGCGGA ATTAATGGAA GATGTCATCG AAGGAAAACT CGATTACACC
ATTGCTGATT CTGTCGCCAT CAGCCTGTTT CAGCGCGTTC ACCCGGAGCT CGCCGTAGCG
CTCGATATCA CCGATGAACA ACCGGTGACT TGGTTTAGCC CGTTAGATGG CGATAATACC
CTTTCCGCCG CCCTGCTCGA CTTCTTCAAC GAAATGAATG AAGACGGTAC GCTGGCACGC
ATTGAAGAGA AATACCTGGG GCATGGCGAT GATTTTGATT ACGTCGATAC GCGCACATTT
TTACGCGCCG TCGATGCGGT ACTGCCGCAG TTAAAGCCCC TGTTTGAGAA ATACGCCGAA
GAAATTGACT GGCGTTTGCT GGCCGCTATT GCTTATCAGG AATCGCACTG GGATGCACAG
GCCACTTCAC CGACGGGTGT GCGCGGCATG ATGATGTTAA CCAAAAATAC CGCGCAAAGC
CTCGGCATTA CGGATCGTAC CGATGCCGAA CAGAGCATCA GCGGTGGCGT GCGTTATTTG
CAGGATATGA TGAGTAAAGT GCCGGAAAGT GTGCCGGAGA ACGAGCGGAT CTGGTTTGCC
CTCGCTGCGT ACAATATGGG CTATGCGCAT ATGCTGGATG CCCGCGCCCT GACGGCAAAA
ACCAAAGGGA ATCCTGACAG TTGGGCTGAC GTAAAACAGC GTCTGCCTTT ACTTAGCCAG
AAACCCTATT ACAGCAAGCT GACTTACGGC TACGCTCGTG GACATGAAGC CTACGCTTAT
GTCGAAAATA TTCGTAAGTA TCAGATTAGC CTGGTGGGTT ATCTGCAAGA GAAAGAGAAG
CAGGCTACAG AAGCGGCGAT GCAACTGGCG CAGGATTATC CGGCGGTATC GCCTACGGAG
TTGGGCAAAG AGAAATTTCC TTTTCTCTCG TTTCTTTCCC AGTCGTCATC AAACTATTTG
ACCCATTCTC CCTCTCTGCT GTTTTCCAGG AAAGGGAGTG AAGAGAAACA AAATTAA
 
Protein sequence
MKKLKINYLF IGILALLLAV ALWPSIPWFG KADNRIAAIQ ARGELRVSTI HTPLTYNEIN 
GKPFGLDYEL AKQFADYLGV KLKVTVRQNI SQLFDDLDNG NADLLAAGLV YNSERVKNYQ
PGPTYYSVSQ QLVYKVGQYR PRTLGNLTAE QLTVAPGHVV VNDLQTLKET KFPELSWKVD
DKKGSAELME DVIEGKLDYT IADSVAISLF QRVHPELAVA LDITDEQPVT WFSPLDGDNT
LSAALLDFFN EMNEDGTLAR IEEKYLGHGD DFDYVDTRTF LRAVDAVLPQ LKPLFEKYAE
EIDWRLLAAI AYQESHWDAQ ATSPTGVRGM MMLTKNTAQS LGITDRTDAE QSISGGVRYL
QDMMSKVPES VPENERIWFA LAAYNMGYAH MLDARALTAK TKGNPDSWAD VKQRLPLLSQ
KPYYSKLTYG YARGHEAYAY VENIRKYQIS LVGYLQEKEK QATEAAMQLA QDYPAVSPTE
LGKEKFPFLS FLSQSSSNYL THSPSLLFSR KGSEEKQN