Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2533 |
Symbol | |
ID | 4270172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2875802 |
End bp | 2877841 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127292 |
Product | oligopeptidase A |
Protein accession | YP_743363 |
Protein GI | 114321680 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.437216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.440356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ATCCGCTGCT GCACGACGAG CCGCTGCCGC CCTTCCCCGA GATCCAGCCC GAGCACGTGG AGCCGGCCAT CGACGAGTTG CTGGCCCACT GCCGGCAGAC CCTGCGTGAG GTGCTGGAGC GGGGCGACTG GACCTGGGAC GGGCTGGTGG CGCCGCTGGA GGCCGCCGAC GAGCGCCTGA GCCGGGCTTG GTCGCCGGTC TCGCATATGA ACGCGGTGGT TAACAGCGAG GCGCTGCGCG CCGCCTACAA TGCCTGTCTG CCCAAGCTCA GCGCTTACGC CACCGAAGTG GGGCAGAACG CCGAACTGTG CGCCGCCTTC CACGCCCTGC GCGACAGTGA GGAGTACCAG GCGCTGGATA GTGCCCAGCA GCGCACCATC GACAATGCCC TGCGCGACTT CCGCCTCTCA GGCGTGGACC TGCCGGCGGA CCAGAAGCAA CGCTACGGAG AGATCGCCCA ACGCCTGTCC GAGCTCTCCG CCAAGTTCGG CGAGAACGTA CTCGACGCCA CCAACGCCTG GCACAAGGAC CTGTCGGACG CGGAGGTCCT GTCCGGCCTG CCCGACTCCT CCCTGGCGCT GGCCCGGCAG ACCGCTGAGC GCGCCGGAGT CGAGGGCTAC CGGATCAACC TGGAGTTCCC CAGCTTCTTC GCGGTCATCA CCTACGCCGA CGACCGCGCG CTGCGCCGCG AGGTTTACGA GGCCTGGAGC ACCCGGGCCT CCGAGCGGGG GCCCCACGGC GGCCAATGGG ACAACCTGCC GCTGATGGAG GAGATCCTGG CCCTGCGCCA CGAGAAGGCG CGGCTGCTGG GGTACGACAA CTTCGCCGAA CTCTCCCTGG CCAAGAAGAT GGCCGGCTCC ACCGATGAGG TCCTGGGCTT CCTGAATGAC CTGGCTGAGC GCGCCCGGCC CCGCGCCGAG GATGAGCTGG CCGAGCTGCG CCGCTTTGCC GGCGAGGAGC TGGGTCTGAC CGACCTCCAG GCCTGGGATA TCCCCTATGC CTCGGAGAAG CTGCGCCAGG CCCGCTTCCA ACTCTCGGAC GAGGACCTTC GTCCCTATTT CCCGGCCGAG CGGGTGATGG CCGGGCTCTT TGAGGTGGTG CAGCGGCTCT ACGGCCTGCA TATTGAGGAG CGCCAGGGCG TGCCCGTCTG GCACGAGGAC GTCCGCTACT ACGAGATCCG CGACCGGGAC GGCGATCTGC GCGGGGCCTT CTACACCGAT CTCTATGCCC GCCCCCACAA GCGCGGCGGC GCTTGGATGG ACGAGTGCCG GGCGCGGATG CGCCAGGGGG AGCGGGTGCA GGTGCCGGTG GCTTATCTCA CCTGTAACTT CACGCCGGCG GTAGGCGACC AGCCGGCACT GCTCACCCAC GGCGAGGTGA CCACGCTGTT CCACGAGTTT GGCCACGGGC TGCACCACAT GCTGACCCGG GTGGAGGCGC CGGCGGTGGC CGGGATCCGC GGGGTTGCCT GGGATGCGGT GGAGTTGCCC AGTCAGTTCA TGGAGAACTG GTGCTGGGAG CGCGAGGCGC TGGATCTGTT CGCGGCGCAC TATCAGACCG GGGCCCGGAT CCCGGAGGAT CTCTTCCGGC GTATGCGGGC GGCGCGCAAT TTCCAGTCGG CCATGCAGAT GGTGCGCCAG CTCGAGTTCT CGCTGTTCGA TTTCCGCCTG CACGCGGGGT ATGACCCGGA GCGGGGCGCG CGCATCTATC CGCTGTTGGA GGAGGTGCGC GAGCAGGTGG CGGTGGTCCG GCCGCCGGAG TGGAACCGCT TTGCCAACAG CTTCGGGCAT ATCTTTGCCG GCGGTTATGC GGCCGGCTAT TATAGCTACA AGTGGGCGGA GGTGCTGTCG GCGGATGCCT ATTCGCGGTT TGAGGAGGAG GGCATCTTCA ACCAACAGGC CGGGCATGAG TTCATGACCC ACATCCTGGA GAAGGGCGGC TCCGAGGACC CCATGGTGCT GTTCCGCAAC TTCCGCGGGC GGGCGCCGCG GATCGACGCC CTGTTGCGGC ATTCCGGGCT GGCGGCGTGA
|
Protein sequence | MSDNPLLHDE PLPPFPEIQP EHVEPAIDEL LAHCRQTLRE VLERGDWTWD GLVAPLEAAD ERLSRAWSPV SHMNAVVNSE ALRAAYNACL PKLSAYATEV GQNAELCAAF HALRDSEEYQ ALDSAQQRTI DNALRDFRLS GVDLPADQKQ RYGEIAQRLS ELSAKFGENV LDATNAWHKD LSDAEVLSGL PDSSLALARQ TAERAGVEGY RINLEFPSFF AVITYADDRA LRREVYEAWS TRASERGPHG GQWDNLPLME EILALRHEKA RLLGYDNFAE LSLAKKMAGS TDEVLGFLND LAERARPRAE DELAELRRFA GEELGLTDLQ AWDIPYASEK LRQARFQLSD EDLRPYFPAE RVMAGLFEVV QRLYGLHIEE RQGVPVWHED VRYYEIRDRD GDLRGAFYTD LYARPHKRGG AWMDECRARM RQGERVQVPV AYLTCNFTPA VGDQPALLTH GEVTTLFHEF GHGLHHMLTR VEAPAVAGIR GVAWDAVELP SQFMENWCWE REALDLFAAH YQTGARIPED LFRRMRAARN FQSAMQMVRQ LEFSLFDFRL HAGYDPERGA RIYPLLEEVR EQVAVVRPPE WNRFANSFGH IFAGGYAAGY YSYKWAEVLS ADAYSRFEEE GIFNQQAGHE FMTHILEKGG SEDPMVLFRN FRGRAPRIDA LLRHSGLAA
|
| |