Gene Mlg_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0413 
Symbol 
ID4269452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp462504 
End bp463862 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID638125143 
Productmicrocin-processing peptidase 1 
Protein accessionYP_741257 
Protein GI114319574 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0570132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.116989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA CCGTCACGCA CAGCAGTCGT ACCTCAGGCC TCCCCGCCAG CGCCGACATG 
GAGGCCCTCA TCCAACAGGC CCTGGACACC GCCCGCACCC TGGGCGCCAC CGGCGCCGAG
GCCGGCCTCG CCTTCGATCT CGGCCTCTCC GTCAACGTCC GCAAGGGCGA AGTCGACACC
TTGGAACACC ACCGGGACCG CGGCCTCAGC GTCACCGTCT ACTTCGGCCA GCGCAAGGGC
AGCGCCAACA CCGCCGACTT CCGCCCCGAA TCCATCCGCG AGACCGTCCA GGCCGCCTGC
GACATCGCCC GCTACACCTC CGAGGACCCC GCCCACGGCC TCGCCGACCC CGAACTCATG
CCCCGCCAGG TCCCCGAGCT GGACCTGGAA CACCCCTGGG CCCTGAACCC GGAAGAGGCC
ATCGACCTCG CCCGCCGCTG CGAAGCCGCC GGGCTGGCGG AAAAAGGCAT CACCAACTCC
GAGGGCGCGG GCGTGGCCAC CCACCACACC CTCCGGGTCT ACGGCAACAG CCACGGCTTC
CTCGGCCACT ACGCCGGCAC CCGCCACAGC ATGAACTGCG TCATGGTCGC CGGCGAGGGC
GACCACATGC AGCGGGACTA CTGGTACACC GTCGACCGCG TCCCCGAGGC CCTGGAACGG
GCCGAGGACG TCGGTCGCGA GGCGGCCCGG CGCACCCTGG CGCGAATGGG CGCCCGCCAA
CTGGGCACCC GGCGGGTGCC GGTCCTGTTC GCCCCGCCCA TGGCCCGGGG ACTCATCGGC
CACTTTATCG GCGCCATTCG CGGCGGCGCC CTCTACCGCA AGGCCTCCTT CCTGCTCGAC
CAGTTGGGCC AGCCGGTCTT CCCGGACTTC GTGCAGATGC GGGAAGAGCC CCACCGCCCG
CGTGGCCTGG GCAGCGTGCC CTTCGACCAT GAGGGCGTGG CCACCCGCGA GCGGACATTG
GTGCGCGACG GCGTGCTGCA GGGCTACGTG CTGGACAGCT ACTCCGCCCG CCGCCTGGGC
ATGCAGACCA CCGGCAACGC CGGCGGCGTG CACAACCTGG TGGTGGAACC AGGCCCCGAC
GACCAGGCCG CCCTGCTCAA GCGCATGGGG ACCGGGCTAC TGGTCACGGA GATGATGGGG
CAGGGGGTTA ACCCGGTCAC CGGCGACTAC TCGCGGGGGG CTACCGGCTT CTGGGTCGAG
GATGGCGAGA TCGCCCACCC GGTGCAGGAG ATCACCGTGG CCGGCAATCT GCGGGAGATG
TACGCCGGAC TCACCGCGGT GGGTTGCGAC GTGGACCGGC GCGGCAACAT CCACACCGGC
TCGCTGCTGG TGGATGCGAT GACCGTCGCC GGCGAATGA
 
Protein sequence
MTNTVTHSSR TSGLPASADM EALIQQALDT ARTLGATGAE AGLAFDLGLS VNVRKGEVDT 
LEHHRDRGLS VTVYFGQRKG SANTADFRPE SIRETVQAAC DIARYTSEDP AHGLADPELM
PRQVPELDLE HPWALNPEEA IDLARRCEAA GLAEKGITNS EGAGVATHHT LRVYGNSHGF
LGHYAGTRHS MNCVMVAGEG DHMQRDYWYT VDRVPEALER AEDVGREAAR RTLARMGARQ
LGTRRVPVLF APPMARGLIG HFIGAIRGGA LYRKASFLLD QLGQPVFPDF VQMREEPHRP
RGLGSVPFDH EGVATRERTL VRDGVLQGYV LDSYSARRLG MQTTGNAGGV HNLVVEPGPD
DQAALLKRMG TGLLVTEMMG QGVNPVTGDY SRGATGFWVE DGEIAHPVQE ITVAGNLREM
YAGLTAVGCD VDRRGNIHTG SLLVDAMTVA GE