Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1913 |
Symbol | |
ID | 4270114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2178996 |
End bp | 2180264 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638126669 |
Product | peptidase M42 family protein |
Protein accession | YP_742747 |
Protein GI | 114321064 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATG CCCCCTGGAC GAACCCCATG CCCGAGCCGC AATTCGAGCT GATGCGGCGC ATCCTGGCCG CCCCCAGCCC GGTGGGGCTG GAGGGCGCCA TGACCTACGG CGTGCTCAAG CCCCACTTCG AGGGGTTCGC GCCCGCGGAC TGGCACCTGC ACCAGTTCAA GGGCCACGCC GGCGTGGTGC TGGATACCCA CCCGGGCCGT GATGACCTGT TCAAGGTCAT GGTGATCGGC CACGCGGACA AGATCCGCAT GCAGGTCCGC TCCATCGGCG ACGACGGCAA GATCTGGATC AACACCGACG CCTTCCTGCC CAACGTACTG GTCGGCCACG AGGTCACGCT CTTCAGCGAG GACCCCGAGG CCCCGGGCCA ATACCGGCGC ATCGAGGGCG GCACCGTGGA GGCGCTGGGC GCCATCCACT TCTCCGACCC GAAGCAGCGC ACCGGCGAGC AGGGCATCAA GAAAGAGCAG ATCTACCTGG ACCTGCAGAT CCACGGCGAA AACAAAAAGC AGCAGGTGGA GAACCTGGGC GTGCGCCCCG GGGATTCGAT CCTGTTCAAC CGCCCCATCC GCCACGGTTT CAGCCCCGAC ACCTTCTATG GCGCCTACCT GGACAACGGC CTGGGCTGCT TCGTCACCGC CGAGGTGGCC CGGCTGATCG CCGAGGCCGG CGGCACGGAA AAGGTCAGGG TGTTGTTCGC CATCGCCAGC TACGAGGAGA TCGGCCGCTT CGGCAGCCGG GTACTGGCCG GGGAGCTCAA GCCCGATGCC ATCATCGCCG TGGACGTGAA CCACGACTAC GTGGCCGCCC CCGGTATCGG CGACCGGCGC ATGCAGCCGC TGGAGATGGG TAAGGGCTTC ACCCTGTCGG TGGGTGCCGT GGCCAGCGAG CAGCTCAACC GGATCATCGA AAGCACCGCC AAGGCGCAAC AGATCCCCAT GCAGCGCGAC GTTGTGGGGA ACGACACCGG TACCGACGGC ATGGCCGGCG TGCTCGCCTC CGTGGACTGC GTGGCCACCT CCATCGGCTT CCCGATCCGG AACATGCACA CCATCTCCGA GACCGGCAAC ACCCGCGATG TGCTGGCGGC CATCCACGCC ATCACCCGCA GCCTGCAGGC GCTGGACGCC CTGGCGGATC CGCATCGGGA GTTCCTGGAC AACCACCCAC GCCTGGACCA GGCCAATTCA CTGGGCCATC AGGGCGGAGA GAAGCCGGAT GACGGCGAGC CGTCCACAAC GCCGGAGAAA ACCACCTGA
|
Protein sequence | MSNAPWTNPM PEPQFELMRR ILAAPSPVGL EGAMTYGVLK PHFEGFAPAD WHLHQFKGHA GVVLDTHPGR DDLFKVMVIG HADKIRMQVR SIGDDGKIWI NTDAFLPNVL VGHEVTLFSE DPEAPGQYRR IEGGTVEALG AIHFSDPKQR TGEQGIKKEQ IYLDLQIHGE NKKQQVENLG VRPGDSILFN RPIRHGFSPD TFYGAYLDNG LGCFVTAEVA RLIAEAGGTE KVRVLFAIAS YEEIGRFGSR VLAGELKPDA IIAVDVNHDY VAAPGIGDRR MQPLEMGKGF TLSVGAVASE QLNRIIESTA KAQQIPMQRD VVGNDTGTDG MAGVLASVDC VATSIGFPIR NMHTISETGN TRDVLAAIHA ITRSLQALDA LADPHREFLD NHPRLDQANS LGHQGGEKPD DGEPSTTPEK TT
|
| |