Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2681 |
Symbol | |
ID | 4269556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3035976 |
End bp | 3037223 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638127440 |
Product | peptidase M42 family protein |
Protein accession | YP_743511 |
Protein GI | 114321828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000914613 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCAACC ACGCCATTCC GCCACCCTGG GTCCGGCCCA TGCCCGAGGC GCAGTTTCAG CTCATGCGCC GGATCCTTGC AGCGCCCAGC CCCGTGGGAC TGGAGGCGGC CATGACGGAG GGCGTGCTCT GCCCCCATTT CCGCTCCTTT GCCCCAGAGA GCTGGCAACT GCAGCGGTTC CAGGGACACG CCGGTATCGT GCTGGACACC CATCCCGGTG ACGACGAGCG CTTCCGGCTC ATGATTGTCG GTCACGCCGA CAAGATCCGC CTGCAGGTCC GGAGTATCGG CGACGACGGC AAGGTCTGGA TCAACAGCGA CGGCTTTCTG CCCGCCACCC TGATCGGCCA CGAGGTGCGC CTGTTCACCG AGAACCCCGG CCGTCCGGGC CACTACCGGG TGATCGACGG GGGCACGGTG GAGGCCCTCG GCGCCATCCA CTTCGCCGAG CCGGAGCTGC GCAGTGGCGA AAAGGGGGTG AAGAAAGAAC AGCTCTACCT GGAGCTGCAG GTGCACGGCG ATGACCGCAA GGCCCAGGTG GAGCACCTGG GGGTCCGGCC CGGCGATACC CTGCTGCTCA ACCGGCCCAT CCGGCGCGGC TTCAGCCCGA ATACCTTTTA CGGCGCCTAC CTGGACAACG GCCTGGGCAG CTTCGTCACG GCCGAGGCCG CCCGGCTGCT GGTGGAGGCC GGCGCCCCGG CGAGCATCCG TGTGTTGTTC GCCATTGCCG GGTATGAGGA GATCGGCTGC TTCGGCAGCC GGGTGCTGGC GGCCCACTAC CGGCCGGATG CCCTGATCGC GGTGGACGTG GAGCATGATT ATCGGGCCGC CCCGCAGGTG AGTGACCGGC GGCTGCCGCC CTTGGAGATG GGCAAGGGCT TTAGCCTGTC GGTGGGCTCC ATCGTCAGTG AGCAGTTGAA CCAAGTGATC GAGGAGGGGG CTCGGAGCCG TGGGATCCCC AGCCAGCGTG ACGTGGTGGG TCCGGATACC GGCACGGACG GCATGGCCGG GGTGCTGGCC AATGTTGACT GCGCCGCTGC CTCAGTGGGC ATCCCCATCC GCAACATGCA CACCATCTCC GAGACCGGCC ACACCTCGGA TGTGCTCGCG GCCCTGCACG GTGTGGTGGA GGCGGCCCTG GCGCTGGACG CCGCCGGCAC GGACCCGGAG GCGCTGCGTC GGCGCTTTCG CGAACACCAT CCGCGGCTGG ACCAGGCCGC CCCCCTGCGC CACCCGGGCC CCGCCTGA
|
Protein sequence | MPNHAIPPPW VRPMPEAQFQ LMRRILAAPS PVGLEAAMTE GVLCPHFRSF APESWQLQRF QGHAGIVLDT HPGDDERFRL MIVGHADKIR LQVRSIGDDG KVWINSDGFL PATLIGHEVR LFTENPGRPG HYRVIDGGTV EALGAIHFAE PELRSGEKGV KKEQLYLELQ VHGDDRKAQV EHLGVRPGDT LLLNRPIRRG FSPNTFYGAY LDNGLGSFVT AEAARLLVEA GAPASIRVLF AIAGYEEIGC FGSRVLAAHY RPDALIAVDV EHDYRAAPQV SDRRLPPLEM GKGFSLSVGS IVSEQLNQVI EEGARSRGIP SQRDVVGPDT GTDGMAGVLA NVDCAAASVG IPIRNMHTIS ETGHTSDVLA ALHGVVEAAL ALDAAGTDPE ALRRRFREHH PRLDQAAPLR HPGPA
|
| |