Gene Mlg_1018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1018 
Symbol 
ID4270047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1158462 
End bp1159946 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content69% 
IMG OID638125769 
Productputative transglycosylase 
Protein accessionYP_741861 
Protein GI114320178 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0598275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCA TGGCGGTGCG CCTGGTGGCC GGGGCCATCA CGCTGGCACT AATGGCCTAT 
GCCTGGCTCG CCTGGGAGCG GGCCCGCGAC CCGGAGCCGA TCACGATTCT GGAACGCGTT
CTGGAACGCG GCGAGCTGCG CGTGATCACC CGTATCAGCG CCACCACCTA TTACCAGACG
GACAAGGGAC GGGCCGGCCT GGAGTTCGAG CTGGCCCAGG CGTTCGCCCA CCGCCTGGGT
GTGCAGCTCC GCATGCTGGT GGCCCCCGAC CTGGAGGCCA TCTTCGCGGC GCTGGACGAT
GGCGAGGCGG ACCTGGCCGC CGCCGGCCTC ACCTACACGG AATCCCGGGG CCAGCGCTAC
TGGTTCACAC CGCCCTACAA GGACATTACT CAGCAACTGG TCTACCGGGT GGGCACCCCC
CGACCGGATG ACCTCAGTGA GATCGGACCG GGCGAGTTGG CGGTGATCGC CAACAGCAGC
CACGCGGATC GGCTGCGGGA ACTGCGCAAC CGCAGCCACC CCGACCTGAC CTGGGCGGAG
GATGAACATG CAGACAGCGA GGCCATGCTC TACCGGGTCT GGAACGAGGA ACTGCGCTAC
ACCGTGGCCG ACTCCCATGA GCTGAGCATC AACCGGGCCT ACTACCCGGA GCTGCGCAAG
GCCTTCGAGA TCTCGGGCGT GGAGGGACTG GCGTGGGCCT TCCCGCGCAC CGAGGACCTC
AGCCTCTATA ACGAGGCCGC CCGGTACTTC ACCGATCTGC GCCTGGAGGG CACGCTCTCG
ACGCTGCTGG AGGAGCACTT CGGCCACCTG GGCCGTTTCG ATTACGTGGG GTTCCGGGCC
TTCAACCGCC ACGTGGCGGA TCGACTGCCG CGCTACCGGC ACTGGTTCGA GGAGGCGGCC
GAGGAGTATG GGGTGGACTG GCGGCTGCTG GCGGCCATCG GCTATCAGGA GTCCCACTGG
GATCCGCAGG CGGTCTCACC CACCGGGGTG CGGGGCATCA TGATGCTCAC CCTGGATACT
GCCTCCATGC TGGGGGTGGA CAATCGGCTC GACCCCAAGC AAAGCATCTT TGGCGGCGCC
CGCTATTTCT CCCGGCTGCT CGAGCGCCTG CCCGAGGACA TTGAAGAGCC GCACCGGGCC
TGGATGGCCC TGGCCGCCTA CAACGTGGGC TACGGCCATC TGCAGGACGC GCGCCGGCTC
GCCCGCCAGC GCGGCTACGA CCCGAACGAC TGGCGAGTCA TCCGCGACCA CCTGCCGCTG
CTCAGCCAGC GCCAATGGTA TGTGCAGACC CGGCACGGCT ATGCGCGCGG TTGGGAGCCG
GTGCACTACG TGCGCAACAT CCGGCTCTAT TACCAGCTAC TGCAACGCAT TACCGAGCCC
GGGCGGCGCC AGGTGCCCGC GGGGGAAGCG CTGGGCGAGC CCCCGCTGCC GACACCCCCC
GCCCCGCCCG GGGCGCCGTT GCCGGCGGAC CCGCCGGCCG ACTAA
 
Protein sequence
MRIMAVRLVA GAITLALMAY AWLAWERARD PEPITILERV LERGELRVIT RISATTYYQT 
DKGRAGLEFE LAQAFAHRLG VQLRMLVAPD LEAIFAALDD GEADLAAAGL TYTESRGQRY
WFTPPYKDIT QQLVYRVGTP RPDDLSEIGP GELAVIANSS HADRLRELRN RSHPDLTWAE
DEHADSEAML YRVWNEELRY TVADSHELSI NRAYYPELRK AFEISGVEGL AWAFPRTEDL
SLYNEAARYF TDLRLEGTLS TLLEEHFGHL GRFDYVGFRA FNRHVADRLP RYRHWFEEAA
EEYGVDWRLL AAIGYQESHW DPQAVSPTGV RGIMMLTLDT ASMLGVDNRL DPKQSIFGGA
RYFSRLLERL PEDIEEPHRA WMALAAYNVG YGHLQDARRL ARQRGYDPND WRVIRDHLPL
LSQRQWYVQT RHGYARGWEP VHYVRNIRLY YQLLQRITEP GRRQVPAGEA LGEPPLPTPP
APPGAPLPAD PPAD