Gene Mlg_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0412 
Symbol 
ID4269451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp460775 
End bp462226 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID638125142 
Productmicrocin-processing peptidase 2 
Protein accessionYP_741256 
Protein GI114319573 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0307046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0847552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAG TAGTCGACCG ACTGGACATC GCCCGGCAAC GGATACTGGC GCCGGCCGGG 
TTGGAGGAGC AGCACCTGGA GCAGGCCTTT GCCCGGCTGA TGGGCCCGGG CGTCGATGCC
GCCGACATCT ATTTCCAGAG CGCCCGCAGC GAGGGCTGGG TGATGGAGGA CGGCCGGGTC
CGTGAAGGCA CCCGCGGCAT CGACCAGGGC GTGGGCGTGC GGGCGATCAG TGGCGAGCAG
AGCGGGTTCG CCTACTCGGA CGAGATCGTC CTGCCGGCGC TGATGGAGGC CTCGGGCAGC
GCCCGTGCCA TCGCCCGCAG CGGCCAGAGC GGCCGGCTGC AGGCCTGGCA CCGTGGCGAG
GGCCACGCCC TCTATCCCAC CGACAACCCC CTGGACGGGC TGAGCGCCGA CGACAAGGTG
GCCCTGCTCA AGGCCGTGGA CGCCGAGGCC CGGGCCCAGG ACCCGCGGGT CGAGCAGGTC
ATCGCCACCC TCGGTGGCGT CCACGAGACC ATGCTCGTCG CCTGCGCCGA CGGCACCCTG
GCCGCCGACG TTCGCCCGCT GGTGCGTTTC AACGTCAGCG TCCTGGTCCG CGAGGGCGAC
CGGCGCGAGA ACGGCATGTG CGGGGGCGGC GGCCGGGTGA GCTACAGCTT CTTCCTCGAC
CAGGACCGCG CCCTGGGCTA TGCCCGCGAG GCCGTCCGCC AGGCCCTGGT CAACCTGGAG
GCCGAGGAGG CCCCGGCCGG CTCCATGCCT GTCGTGCTCG GCCCCGGCTG GCCCGGCGTG
CTGCTCCACG AGGCCGTGGG CCACGGCCTG GAGGGCGACT TCAACCGCAA GGGCACCTCC
GCCTTCGCCG GACGCATGGG CGAACGTGTC GCCTCACCGC TGTGCACCGT GGTCGACGAC
GGCACCCTGG CCAACCGCCG CGGTTCGCTC AACGTCGACG ACGAGGGCAC CCCCACCCGC
TGCACTACCC TGATCGAAAA GGGCGTACTC AAGGGCTTCA TGCAGGACAA GCTCAACGCC
CGCCTGATGG GCACAGCCTC CACCGGCAAC TGCCGGCGAG AATCCTTCGC CCACCTGCCC
ATGCCGCGGA TGACCAACAC CTACATGCTC CCCGGCCCCC ACGACCCGGA GGAGATTATC
CGCTCGGTGG ACCACGGCCT CTACGCCGTC AACTTCGGCG GCGGCCAGGT GGACATCACC
TCCGGCAAGT TCGTCTTCTC CGCTAGCGAG GCCTACCTCA TCGAGAAGGG CCGGATCACC
ACCCCGGTCA AGGGCGCCAC CCTAATCGGC AACGGCCCCG ACGTCCTCAC CCGCGTCAGC
ATGGTCGGCA ACGACCTGAA ACTCGACGGC GGCATCGGCG TCTGCGGCAA GGAGGGCCAG
AGCGTCCCGG TCGGCGTGGG CCAGCCGACC CTCAAGGTCG ACGCCCTCAC CGTGGGCGGC
ACCCGCGGCT GA
 
Protein sequence
MSAVVDRLDI ARQRILAPAG LEEQHLEQAF ARLMGPGVDA ADIYFQSARS EGWVMEDGRV 
REGTRGIDQG VGVRAISGEQ SGFAYSDEIV LPALMEASGS ARAIARSGQS GRLQAWHRGE
GHALYPTDNP LDGLSADDKV ALLKAVDAEA RAQDPRVEQV IATLGGVHET MLVACADGTL
AADVRPLVRF NVSVLVREGD RRENGMCGGG GRVSYSFFLD QDRALGYARE AVRQALVNLE
AEEAPAGSMP VVLGPGWPGV LLHEAVGHGL EGDFNRKGTS AFAGRMGERV ASPLCTVVDD
GTLANRRGSL NVDDEGTPTR CTTLIEKGVL KGFMQDKLNA RLMGTASTGN CRRESFAHLP
MPRMTNTYML PGPHDPEEII RSVDHGLYAV NFGGGQVDIT SGKFVFSASE AYLIEKGRIT
TPVKGATLIG NGPDVLTRVS MVGNDLKLDG GIGVCGKEGQ SVPVGVGQPT LKVDALTVGG
TRG