Gene Mlg_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1995 
Symbol 
ID4270469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2263965 
End bp2265584 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID638126751 
Productlytic transglycosylase, catalytic 
Protein accessionYP_742827 
Protein GI114321144 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1388] FOG: LysM repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0964783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.42715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCACA CACGTATGAT CCCGCTATGC CTGGCGCTGG CCCTGGGCGC CAGCGGCTGT 
GCCAGCCTGG ACCCGCGCGG CGCCGACGGC GAATCCAAAC GGGGCCATCA CGCCCGTGTC
ATCCTCCCCG GCGAGCCCCT CGCCAATGTC CCCGTGCCGG AAGAAGAGCG CCGCCCCGCG
GAGAGCGCAG CCCCGGCGGC GGCGGAGGAT ATTTGGGCCC GGCTTCGTGA CGGCTTTCAA
CTGCCCGCGG TCACCCATCA GCGCATCGAC CAGGAACGCG CCCGGTTCAC CGGTCGTCAG
AACTACTTCG ACGCGGTGGG TCAGCGAGCC CGACCCTACC TCTACCACAT CCTCGGCGAG
CTGGAGGCGC GCGACCTGCC CACCGAATTG GTACTGGTCG CGATGGTGGA GAGCGCTTTC
CAGCCCTTCG CCTATTCCCA CGGTCGCGCG GCCGGCCTCT GGCAGTTCAT ACCCGCCACC
GGCAAGCACT TCGGCCTGGA GCAGAATTGG TGGTACGACG GCCGGCGCGA TGTCATTGCC
AGCACCGAGG CCGCCCTGAC CTACCTGGAC TACCTGCACG GGTTCTTTGA CGGCGACTGG
CTGCTGGCAC TGGCCGCTTA CAACGCCGGG GAGGGCCGGG TACAGCGGGC GGTCAGGGCC
AACCAGCGGG CCGGGAAGGC CACGGACTTC TGGAGCCTGA ACCTGCCGGC CGAGACCCGC
GCCTACGTCC CCCGGGTGCT GGCACTGCGC GACATCCTTG CCGAACCGGA CCGGTTCGGC
ATCCGGCTGC CGCCCATCGA CAACGAACGA CAACTGGCGG TGGTCGAACT GGAGCACCAG
TTGGACTTGG CCCTGGCGGC GGAGATGGCC GGCGTCGAGC TGGACAAGAT CTATCGCTAT
AACCCCGGTT TCAACCGCTG GGCGACGTTG CCGGAGGGCC GCCACCGCCT GGCCATCCCC
AAGGCGAGCA AAGAGCGCTT CACCGCTGCG CTGGCCGATC TCCACCCCTC CGAGATGGTC
CGCTGGCAAC GCCACGAGGT CCGGCGCGGT GAAACACTCA GCGGCATCGC CAGCCGTTAC
AACACCTCGG TCTCGGTTTT GCGCGACACC AATGACCTGA GCGGTGACCG AATCCGGGTC
GGACAGGCGC TGCTGGTCCC CACCGCGAGC CAGGGCAACG AGGCCTACAC CCTCAGCGCG
GACAACCGGC GCCGCGCCAA CCAGAACCGG CAGCGGGATG GGAGGCACAA GCTGGAGCAC
ACGGTGCGCC CCGGCGATAC CTTCTGGGAG CTCGCCCGGC GCCACGGCGT CAGCGTCCGG
GAACTGGCGG GGTGGAACGA TATGGCGCCC GGCGACCCCC TGCGGCCGGG CAACACGCTG
GTCATCTGGA GTGGCGACGG CGCTGCCGCC AACGCCCGCA ACAGCGGGCC GGGCGAGCGC
CTGCAGCGGG TCACCTATAC GGTGCGCAGC GGCGACTCGG TCTACACCAT CGCTCGGCGC
TTCAACGTCT CCATGCAGGA CGTGAAGCGC TGGAATAACC TCCGCTCGGG CCAGTACCTG
CAACCCGGCC AGACCCTGAC CCTGAACGTC GACGTCACCA ACCAGTCGGC CGGACTCTGA
 
Protein sequence
MVHTRMIPLC LALALGASGC ASLDPRGADG ESKRGHHARV ILPGEPLANV PVPEEERRPA 
ESAAPAAAED IWARLRDGFQ LPAVTHQRID QERARFTGRQ NYFDAVGQRA RPYLYHILGE
LEARDLPTEL VLVAMVESAF QPFAYSHGRA AGLWQFIPAT GKHFGLEQNW WYDGRRDVIA
STEAALTYLD YLHGFFDGDW LLALAAYNAG EGRVQRAVRA NQRAGKATDF WSLNLPAETR
AYVPRVLALR DILAEPDRFG IRLPPIDNER QLAVVELEHQ LDLALAAEMA GVELDKIYRY
NPGFNRWATL PEGRHRLAIP KASKERFTAA LADLHPSEMV RWQRHEVRRG ETLSGIASRY
NTSVSVLRDT NDLSGDRIRV GQALLVPTAS QGNEAYTLSA DNRRRANQNR QRDGRHKLEH
TVRPGDTFWE LARRHGVSVR ELAGWNDMAP GDPLRPGNTL VIWSGDGAAA NARNSGPGER
LQRVTYTVRS GDSVYTIARR FNVSMQDVKR WNNLRSGQYL QPGQTLTLNV DVTNQSAGL