Gene B21_04233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04233 
Symbolslt 
ID8115004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4546176 
End bp4548113 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content54% 
IMG OID644850371 
Producthypothetical protein 
Protein accessionYP_003001944 
Protein GI251787640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAAAG CCAAACAAGT TACCTGGCGG CTGTTGGCTG CCGGTGTCTG TCTGCTGACG 
GTCAGCAGCG TGGCGCGAGC CGACTCACTG GATGAGCAGC GTAGCCGTTA CGCGCAAATC
AAGCAGGCCT GGGATAATCG ACAAATGGAT GTGGTCGAAC AAATGATGCC TGGACTGAAA
GATTATCCAC TTTATCCCTA CCTGGAATAC CGTCAGATCA CCGATGACCT GATGAATCAA
CCGGCGGTGA CTGTCACTAA CTTTGTTCGC GCTAACCCCA CGCTTCCTCC CGCTCGCACG
CTGCAATCTC GTTTCGTCAA TGAACTGGCG CGGCGTGAAG ACTGGCGTGG CTTGTTAGCC
TTTAGCCCGG AAAAGCCCGG AACTACCGAA GCGCAATGTA ATTACTACTA TGCAAAATGG
AACACCGGGC AGAGTGAAGA AGCCTGGCAA GGGGCGAAAG AGCTGTGGCT AACCGGCAAG
AGCCAGCCTA ACGCCTGTGA CAAGTTATTT AGTGTCTGGC GTGCGTCAGG TAAACAAGAT
CCGCTGGCGT ATTTAGAGCG TATCCGTCTG GCGATGAAAG CGGGTAACAC TGGCCTGGTA
ACAGTGCTGG CAGGGCAGAT GCCTGCCGAT TACCAGACTA TCGCCTCGGC AATCATTTCA
CTGGCGAACA ACCCTAATAC GGTACTGACC TTCGCGCGTA CAACCGGAGC GACCGATTTT
ACCCGTCAAA TGGCGGCGGT GGCGTTTGCC AGTGTGGCGC GGCAGGATGC TGAAAATGCA
CGGCTGATGA TCCCATCGCT TGCCCAGGCG CAGCAGCTTA ATGAAGATCA GATTCAGGAG
CTGCGCGATA TCGTCGCCTG GCGTTTGATG GGCAACGATG TCACCGACGA GCAGGCGAAA
TGGCGCGATG ACGCCATTAT GCGCTCGCAA TCTACTTCGC TTATTGAACG CCGTGTACGA
ATGGCGCTTG GCACCGGCGA TCGTCGCGGC CTGAATACCT GGCTGGCGCG TTTGCCGATG
GAGGCGAAAG AGAAAGATGA ATGGCGTTAC TGGCAGGCGG ATTTACTGCT GGAACGCGGA
CGTGAAGCTG AAGCAAAAGA GATTTTGCAT CAACTCATGC AACAGCGTGG TTTCTACCCG
ATGGTTGCTG CACAACGCAT CGGCGAAGAG TATGAGCTGA AGATTGATAA AGCGCCGCAG
AATGTTGACA GCGCCCTGAC TCAGGGGTCG GAGATGGCGC GCGTGCGCGA GTTGATGTAC
TGGAATCTCG ATAACACCGC GCGTAGCGAG TGGGCCAATC TGGTGAAGAG CAAGTCAAAA
ACAGAGCAGG CTCAACTGGC GCGGTATGCT TTCAACAACC AATGGTGGGA TCTTAGCGTT
CAGGCAACGA TCGCCGGGAA GCTGTGGGAT CATCTGGAAG AGCGATTCCC GCTGGCTTAC
AACGATCTTT TCAAACGTTA CACCAGTGGG AAGGAGATCC CGCAAAGCTA TGCGATGGCG
ATTGCCCGTC AGGAGAGCGC CTGGAATCCA AAAGTGAAAT CACCGGTAGG GGCCAGCGGC
CTGATGCAGA TTATGCCTGG TACAGCGACC CATACGGTGA AGATGTTCTC TATTCCTGGT
TATAGCAGCC CTGGGCAATT GCTGGATCCG GAAACGAATA TCAACATTGG CACCAGTTAT
CTGCAATATG TTTATCAGCA GTTTGGCAAT AACCGTATTT TCTCCTCAGC AGCTTATAAC
GCCGGACCAG GGCGGGTGCG AACCTGGCTT GGCAACAGCG CCGGGCGTAT CGACGCAGTG
GCATTTGTCG AGAGTATTCC GTTCTCCGAG ACGCGTGGTT ATGTAAAGAA CGTGCTGGCT
TATGACGCTT ACTACCGCTA TTTCATGGGG GATAAACCGA CGTTGATGAG CGCCACGGAA
TGGGGACGTC GTTACTGA
 
Protein sequence
MEKAKQVTWR LLAAGVCLLT VSSVARADSL DEQRSRYAQI KQAWDNRQMD VVEQMMPGLK 
DYPLYPYLEY RQITDDLMNQ PAVTVTNFVR ANPTLPPART LQSRFVNELA RREDWRGLLA
FSPEKPGTTE AQCNYYYAKW NTGQSEEAWQ GAKELWLTGK SQPNACDKLF SVWRASGKQD
PLAYLERIRL AMKAGNTGLV TVLAGQMPAD YQTIASAIIS LANNPNTVLT FARTTGATDF
TRQMAAVAFA SVARQDAENA RLMIPSLAQA QQLNEDQIQE LRDIVAWRLM GNDVTDEQAK
WRDDAIMRSQ STSLIERRVR MALGTGDRRG LNTWLARLPM EAKEKDEWRY WQADLLLERG
REAEAKEILH QLMQQRGFYP MVAAQRIGEE YELKIDKAPQ NVDSALTQGS EMARVRELMY
WNLDNTARSE WANLVKSKSK TEQAQLARYA FNNQWWDLSV QATIAGKLWD HLEERFPLAY
NDLFKRYTSG KEIPQSYAMA IARQESAWNP KVKSPVGASG LMQIMPGTAT HTVKMFSIPG
YSSPGQLLDP ETNINIGTSY LQYVYQQFGN NRIFSSAAYN AGPGRVRTWL GNSAGRIDAV
AFVESIPFSE TRGYVKNVLA YDAYYRYFMG DKPTLMSATE WGRRY