Gene Mlg_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0437 
Symbol 
ID4268290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp484951 
End bp486492 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID638125167 
Productpeptidase M23B 
Protein accessionYP_741281 
Protein GI114319598 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000637591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0934933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAACA CCCGCATTTA CGATCTCGAT TTCAAGAAAC GCCCGCGTTC CCGCTTCCCC 
GCGGGCATCC GTCTACCCTT CAAGCTCCGC TGGCTGGCTA TCAGTGCCAT CGGTGTGGCA
GTGGGCGCGC TCGCCCTGAC CGGCCTCGGC CCCGAAGGCG ATGGTGCGGA GCCCTCGCTT
ACCGCGGAAC TGCCGCAGTA CCGGGCAGCG CCGTCGGTGG ACGTGCTTCA ACTGGCGGGC
ATCGAATGGC CGCAGGGCGG TTTCCTGCAC GCCAGCTACG AACCGGGTCG TGACGAGGGC
GGTGACGGCG ATCTGGGCTC ACGGGGCACC GGGTGGCTGG ACGAACAGGA CTGGCTGGAT
AACGGCACCC TGGCGGAGCC GGGGCGCGAC GACGGCCCGA CGGACCCGCT TGCCCACCTG
GATGGGCTGG ATTGGGAATC CCTCAAGGTC CGCAGTGGTG ACAGCCTGGC GCGGCTGTTC
AACTGGGCGG GCTTTTCGGC CCGCGAGGTG CACGACCTGA TGCAGGCCGG AGAGGAAGCC
GAGCGGCTGA CGCGCGTGCA TCCGGGCGAC ATCATCGAAG TGGTCCGGGA TGGCGACGAC
CGACTGGCGC ATCTGCGCTA CGAGTTCAGC CGCGGGCAGA CACTCTATAT CGAGCGCACC
GAGGAGGGGT TCCAGGCCCA GACCTTCCAG GAGGCCGAGG AGCGGCAGGT GGCTCGCGCC
TCGGTCACGG TGGACTCCTC GCTCTATATT GCCGGCCGGC GTGCCGGGCT CAGCAACCGG
CTGATCATGC AGCTTGCTTC CGTGTTCGGC CAGCAGCTCG ACCTGGGGCG CGATCTGCGT
GCCGGCGACG AGTTCCACCT GGTCTACGAG GAGATCTACC AGAACGGGGA GAAGGTGCGT
GACGGCCACA TTCTCGCCGC GGAGCTGGTG CATCGGGGTG AGCGGTTGCA GGCGGTGCGC
TATGCCCCGC CCGGCGCCGA CCCCGACTAC TACACCCCGG AGGGGGAGAG CCTGCGCCGG
GCCTTCAACC GCCACCCCAT CGACTACGAC CGGATCACCT CGCACTTCGA TCTCAACCGC
AAGCACCCGG TGCTGGGGGT ACGCCGCCCC CACTACGGCA CGGACTACGC CGCCCCGGTG
GGCACACCCA TCCGGTCCAC GGGATCCGGG CGAGTGGTGC ATCGCGGCTG GAAGGGGGGC
TACGGCCGCA CCGTGATCAT CCAGCACGGC AGCGAATACA CCACCCTGTA CGCCCACATG
TCCGGTTACG CCAGCGGCCT GTCCCAGGGC GATCGGGTCC GGCGGGGTCA GGTGGTGGGT
TACCTGGGTG GCTCTGGCAT GGTCACCGGG CCGCACCTGC ACTTCGAGTT CCACGTCAAC
GGGAACCCGC GGGATCCGCT CAAAGTTGCC CTGCCCAAGG CCGACCCGAT CCCGCAGGAG
CACATGGCGG ACTTCCGGGC CACCACTCAT CCGATGCTCG CCCAGCTGGA ACGCGATCAG
CGCGAGTCGG CGACTCAGGT GGCGCAGCAG AGCGGAGAGT GA
 
Protein sequence
MSNTRIYDLD FKKRPRSRFP AGIRLPFKLR WLAISAIGVA VGALALTGLG PEGDGAEPSL 
TAELPQYRAA PSVDVLQLAG IEWPQGGFLH ASYEPGRDEG GDGDLGSRGT GWLDEQDWLD
NGTLAEPGRD DGPTDPLAHL DGLDWESLKV RSGDSLARLF NWAGFSAREV HDLMQAGEEA
ERLTRVHPGD IIEVVRDGDD RLAHLRYEFS RGQTLYIERT EEGFQAQTFQ EAEERQVARA
SVTVDSSLYI AGRRAGLSNR LIMQLASVFG QQLDLGRDLR AGDEFHLVYE EIYQNGEKVR
DGHILAAELV HRGERLQAVR YAPPGADPDY YTPEGESLRR AFNRHPIDYD RITSHFDLNR
KHPVLGVRRP HYGTDYAAPV GTPIRSTGSG RVVHRGWKGG YGRTVIIQHG SEYTTLYAHM
SGYASGLSQG DRVRRGQVVG YLGGSGMVTG PHLHFEFHVN GNPRDPLKVA LPKADPIPQE
HMADFRATTH PMLAQLERDQ RESATQVAQQ SGE