Gene Mlg_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1046 
Symbol 
ID4270519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1197413 
End bp1198765 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content68% 
IMG OID638125798 
Productlytic transglycosylase, catalytic 
Protein accessionYP_741889 
Protein GI114320206 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0396105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGT GGATTGCCAT CCTGGCGGTG GTGCTAGTCC TGTTGCTCAA CGCCTGTACC 
GACGGGCCGG AGGACGGCCC CCGGCTGGAG CACCCCGACG AGACGGGGGT GCTGCGGGTG
GTGACGCGCA ATAGCGCCAC CACCTATTAC CTGGACCGTC ACGAGCGGGC GGTAGGGCCG
GAGGTGGCGC TGGTGGAGGC GTTCGCGGAC CACCGGGGCT GGACGGTGGA CTGGACCGTG
GCGGCCACCA CGGCGGAAGT GCTGGATTAC CTGGAGGCCG GCGCCGCCCA CCTGGCCGCT
GCCGGGCTGA CGCACTTGGA TTCGCGCAAC CAACGCTTCG AGCGCGGCCC GGCCCACACC
GAGATTACCC AGCAGGTGGT CTGTCACCGG GACCGGACCG ACAAACCCCG CAGCCCGGAA
GACCTGGAGG AGGTGGTGTT AAAGGTGACG GCCGCCTCCA GTTACGTGGA ACGGCTGGAG
GTCCTGGCGG AGCGCTATGA CGCATTGACC TTCCAGGAGG ATCAGCGCGG TTCGGAACAA
CTGCTGATGG CGGTGGAGGA GGGGCGGTTG GCGTGCACCG TTGCCGACTC CAATATCGTG
CGCCTCAACC GCCGCTACCT GCCCCACCTG GACGTCACGA TGGATCTGAC CGAGGGCCAG
AACCTGGGCT GGTACCTGGC CGAGGGCCAG GAACGCCTGG CGCAGCAGGC CTTTGAGTGG
ATGAACAGCC GCGCCGGGGA CGAGGTCATC GCGGCGATGG AGAACCGCTA CTATACCTAT
GTCGGCGAGT TCGACTTCGT CGATCTGCGG GCGCTGAAAC GGCGGATGGA AAGCCGCCTG
CCGCGCTATC AGCGCCATTT TGAGCAGGCC GAGGCGGAGA CCGATATGCC GGCGGACCTG
CTCGCCGCGC TGGCCTACCA GGAATCGCAC TGGGATCCAC AGGCGCGCAG TCCCACCGGG
GTGCGTGGCA TGATGATGCT GACCGGCCGC ACGGCCGAGT CCCTGGGTGT GAATGACCGC
CTGGACCCGG AGCAGAGCAT CATGGGCGGG GCCCGCTACC TCGCCGACCG GCACGAGCGG
TTGCCCGAGC ACATCCCTGA GCCGGACCGT ACCTTCCTGG CCCTGGCCAG CTACAACGTC
GGTCGCGGGC ATCTTTTGGA CGCCCGCCAG CTGGCCCGCG ACCTGGGCCG GGACCCGGAC
GACTGGCAGG AGATGCGCGA GGTACTGCCC CTGCTCTCCG ACGAGCGGTA TTACCCGAAC
CTGCGCTACG GCTACGCCCG CGGCTATGAG CCGGTGCACT TCGTCGCCCG TATCCGCAAC
TACCGGGATG TGATCCGGCA GGCGTTCGAG TGA
 
Protein sequence
MRTWIAILAV VLVLLLNACT DGPEDGPRLE HPDETGVLRV VTRNSATTYY LDRHERAVGP 
EVALVEAFAD HRGWTVDWTV AATTAEVLDY LEAGAAHLAA AGLTHLDSRN QRFERGPAHT
EITQQVVCHR DRTDKPRSPE DLEEVVLKVT AASSYVERLE VLAERYDALT FQEDQRGSEQ
LLMAVEEGRL ACTVADSNIV RLNRRYLPHL DVTMDLTEGQ NLGWYLAEGQ ERLAQQAFEW
MNSRAGDEVI AAMENRYYTY VGEFDFVDLR ALKRRMESRL PRYQRHFEQA EAETDMPADL
LAALAYQESH WDPQARSPTG VRGMMMLTGR TAESLGVNDR LDPEQSIMGG ARYLADRHER
LPEHIPEPDR TFLALASYNV GRGHLLDARQ LARDLGRDPD DWQEMREVLP LLSDERYYPN
LRYGYARGYE PVHFVARIRN YRDVIRQAFE