Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0558 |
Symbol | |
ID | 4270313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 605659 |
End bp | 607152 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125299 |
Product | PepA aminopeptidase |
Protein accession | YP_741402 |
Protein GI | 114319719 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.548083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000733214 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCACA CCGTCAAGAG CAAGACCGCC GATACCGTCA GCAGCCCCTG CGCCGTGGTG GGCGTGTTCG AGCGCCGCCG CCTGTCCCCT GCCGCCAAGG CCGTGGACGA GGCCAGCGGC GGCGCCATAA CCGCGGCCCT GAAGCGGGGC GATATCGAGG CCAAGCCCGG TCAGACCCGC CTGCTCACCG ATCTGGACAA CGTCAAGGCC GCCCGGGTTC TGCTGGTGGG CCTGGGCGTG GAGCGCGACC TGGACGAGCG CACCTACCGC AAGGCCGTCA CCGCCGCCGC CCAGGCGGCG CAGGACTGCG GCGCCGGTGA GGCGACCTTC TACCTGCCGG AGGTCGAGGT GAAGACCCGC GACCTGGCCT GGCGGGTCCA GCAGCTCGCC ATCGGCGTGA CCGGCGCCCT CTACCGCTTC GACGACATGA AGAGCGAGGC GGAAACGCCC AGGAAGCCGC TGAAGAAGCT CGCCCTGGGG GTGGCGGACA AGGCCGAGGC CAAAGTCGCC GATGAGGCCC TGGCGCAGGG GATGGCGGTC GGCCGGGGCA TGAGCCTGGC CCGCGACCTC GGCAACCTGC CCGGTAACGT CTGCACCCCC AGCTACCTGG CCGACCAGGC CAAGGCGCTG GGCAAGCGGT TCGACAAACT CAAGGTGCAG AGCCTGGACC GCAAGGACAT GAAGAAGCTG GGCATGGGCG CGCTGCTGGC GGTGGCCCAG GGCAGCCAGG AGGAGCCGCG CCTGATCGCC ATGGAGTGGA ACGGCGGCAA GAAGGACGAG CAGCCCTACG TGCTGGTGGG CAAGGGGATC ACCTTCGACA CCGGCGGCAT CTCCCTGAAA CCCGGGGCGG CCATGGACGA GATGAAGTTC GACATGTGCG GCGCCGCCAG CGTGTTCGGC ACCCTGCAGG CGGTGGCCGA GATGAACCTG CCCATCAACG TGGTGGGCGT GGTGCCGGCC AGCGACAACA TGCCCGACGG CAAGGCCACC CGCCCGGGGG ACATTATCCA GACCCTGTCC GGGCAGACAG TGGAGGTGCT CAACACCGAC GCCGAGGGCC GGCTGGTGCT GTGCGACGCC CTTACCTGGT CCGAGCGCTT CAAGCCCAAG GAGATCGTCG ACATCGCCAC CCTCACCGGG GCCTGCATCA TCGCCCTGGG TCACCACCGG AGTGCGGTGC TGGGCAACCA TCCGCCCCTG GTCAAGGCCC TGCAGGATGC CGGTGAGACC AGCGGCGACA AATGCTGGGA ACTGCCCCTG GACCCGGAGT ACGACGAGCA GCTCAAGAGC AACTTCGCCG ACATGGCCAA CATCGGTGGG CGACCGGCCG GCACCATCAC CGCCGCCAGC TTCCTGGCCC GCTTCACCAA GCGCTACCAG TGGGCCCACC TGGATATCGC CGGCACCGCC TGGCTCACCG GTGAGCAGAA AGGCGCCACC GGCCGGCCGG TGCCGCTGCT CACCCGGTAC CTCATGGACC GGGCCGGGGC CTGA
|
Protein sequence | MDHTVKSKTA DTVSSPCAVV GVFERRRLSP AAKAVDEASG GAITAALKRG DIEAKPGQTR LLTDLDNVKA ARVLLVGLGV ERDLDERTYR KAVTAAAQAA QDCGAGEATF YLPEVEVKTR DLAWRVQQLA IGVTGALYRF DDMKSEAETP RKPLKKLALG VADKAEAKVA DEALAQGMAV GRGMSLARDL GNLPGNVCTP SYLADQAKAL GKRFDKLKVQ SLDRKDMKKL GMGALLAVAQ GSQEEPRLIA MEWNGGKKDE QPYVLVGKGI TFDTGGISLK PGAAMDEMKF DMCGAASVFG TLQAVAEMNL PINVVGVVPA SDNMPDGKAT RPGDIIQTLS GQTVEVLNTD AEGRLVLCDA LTWSERFKPK EIVDIATLTG ACIIALGHHR SAVLGNHPPL VKALQDAGET SGDKCWELPL DPEYDEQLKS NFADMANIGG RPAGTITAAS FLARFTKRYQ WAHLDIAGTA WLTGEQKGAT GRPVPLLTRY LMDRAGA
|
| |