Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0412 |
Symbol | |
ID | 4269451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 460775 |
End bp | 462226 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125142 |
Product | microcin-processing peptidase 2 |
Protein accession | YP_741256 |
Protein GI | 114319573 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0307046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0847552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAG TAGTCGACCG ACTGGACATC GCCCGGCAAC GGATACTGGC GCCGGCCGGG TTGGAGGAGC AGCACCTGGA GCAGGCCTTT GCCCGGCTGA TGGGCCCGGG CGTCGATGCC GCCGACATCT ATTTCCAGAG CGCCCGCAGC GAGGGCTGGG TGATGGAGGA CGGCCGGGTC CGTGAAGGCA CCCGCGGCAT CGACCAGGGC GTGGGCGTGC GGGCGATCAG TGGCGAGCAG AGCGGGTTCG CCTACTCGGA CGAGATCGTC CTGCCGGCGC TGATGGAGGC CTCGGGCAGC GCCCGTGCCA TCGCCCGCAG CGGCCAGAGC GGCCGGCTGC AGGCCTGGCA CCGTGGCGAG GGCCACGCCC TCTATCCCAC CGACAACCCC CTGGACGGGC TGAGCGCCGA CGACAAGGTG GCCCTGCTCA AGGCCGTGGA CGCCGAGGCC CGGGCCCAGG ACCCGCGGGT CGAGCAGGTC ATCGCCACCC TCGGTGGCGT CCACGAGACC ATGCTCGTCG CCTGCGCCGA CGGCACCCTG GCCGCCGACG TTCGCCCGCT GGTGCGTTTC AACGTCAGCG TCCTGGTCCG CGAGGGCGAC CGGCGCGAGA ACGGCATGTG CGGGGGCGGC GGCCGGGTGA GCTACAGCTT CTTCCTCGAC CAGGACCGCG CCCTGGGCTA TGCCCGCGAG GCCGTCCGCC AGGCCCTGGT CAACCTGGAG GCCGAGGAGG CCCCGGCCGG CTCCATGCCT GTCGTGCTCG GCCCCGGCTG GCCCGGCGTG CTGCTCCACG AGGCCGTGGG CCACGGCCTG GAGGGCGACT TCAACCGCAA GGGCACCTCC GCCTTCGCCG GACGCATGGG CGAACGTGTC GCCTCACCGC TGTGCACCGT GGTCGACGAC GGCACCCTGG CCAACCGCCG CGGTTCGCTC AACGTCGACG ACGAGGGCAC CCCCACCCGC TGCACTACCC TGATCGAAAA GGGCGTACTC AAGGGCTTCA TGCAGGACAA GCTCAACGCC CGCCTGATGG GCACAGCCTC CACCGGCAAC TGCCGGCGAG AATCCTTCGC CCACCTGCCC ATGCCGCGGA TGACCAACAC CTACATGCTC CCCGGCCCCC ACGACCCGGA GGAGATTATC CGCTCGGTGG ACCACGGCCT CTACGCCGTC AACTTCGGCG GCGGCCAGGT GGACATCACC TCCGGCAAGT TCGTCTTCTC CGCTAGCGAG GCCTACCTCA TCGAGAAGGG CCGGATCACC ACCCCGGTCA AGGGCGCCAC CCTAATCGGC AACGGCCCCG ACGTCCTCAC CCGCGTCAGC ATGGTCGGCA ACGACCTGAA ACTCGACGGC GGCATCGGCG TCTGCGGCAA GGAGGGCCAG AGCGTCCCGG TCGGCGTGGG CCAGCCGACC CTCAAGGTCG ACGCCCTCAC CGTGGGCGGC ACCCGCGGCT GA
|
Protein sequence | MSAVVDRLDI ARQRILAPAG LEEQHLEQAF ARLMGPGVDA ADIYFQSARS EGWVMEDGRV REGTRGIDQG VGVRAISGEQ SGFAYSDEIV LPALMEASGS ARAIARSGQS GRLQAWHRGE GHALYPTDNP LDGLSADDKV ALLKAVDAEA RAQDPRVEQV IATLGGVHET MLVACADGTL AADVRPLVRF NVSVLVREGD RRENGMCGGG GRVSYSFFLD QDRALGYARE AVRQALVNLE AEEAPAGSMP VVLGPGWPGV LLHEAVGHGL EGDFNRKGTS AFAGRMGERV ASPLCTVVDD GTLANRRGSL NVDDEGTPTR CTTLIEKGVL KGFMQDKLNA RLMGTASTGN CRRESFAHLP MPRMTNTYML PGPHDPEEII RSVDHGLYAV NFGGGQVDIT SGKFVFSASE AYLIEKGRIT TPVKGATLIG NGPDVLTRVS MVGNDLKLDG GIGVCGKEGQ SVPVGVGQPT LKVDALTVGG TRG
|
| |