Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1989 |
Symbol | |
ID | 4270463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2258791 |
End bp | 2260212 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638126745 |
Product | nodulation efficiency protein NfeD |
Protein accession | YP_742821 |
Protein GI | 114321138 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00554582 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0711803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAAGA ACGTCCTCGC AGTCCTGTTG CTTTATCTGG CCACCGCACT GCCCCTGGGC GCGGACGCGG ACCGGCTGGC CCTGGTGCTG GATGTGGAGG GCCCCATCGG CCCGGCCACC AGCCATTTCA TTGCCGGCGG TCTGAAAACC GCTGAGGAGC GCGACGCCGT GGTGGTCATC CTGCGTATGG ACACGCCGGG CGGCCTGGAC AGCGCCATGC GCGACATCAT CAAGGACATC CTCGCCTCTC CGGTTCCGGT GGCCACCTGG GTCGGGCCGG ACGGGGCCCG GGCCGCTAGT GCCGGCACCT ACATCCTCTA CGCCTCGCAC GTGGCGGCGA TGGCGCCGGC CACCAATTTG GGTGCGGCCA CGCCGGTGCA GATTGGGGGC GGGGGCGGCT TCCCGCCGGG CGGCCAGCCG GAACAGGAGG AGGCCGCGGA CGACAACGAC GAGGCGGCCA ACGGCAATGA CGAGGCACCT GAAGAGCGGC CCCGGCGCGA CCCCGGTTCG GCCACCGAGC GCAAGGTGTT GGAAGACGCG GTCTCCTACA TTCGCGGGCT GGCGGAATTG CGCGGCCGCA ACGCCGACTG GGCGGAGAAG GCCGTGCGCG AGGCCGCCAG CGCCAGCTCC AGCGAGGCGT TGGAGCTGGG GGTGATCGAG CACCGGGTCT CCAATCTGGA GGCGTTGCTG GAGGCCATGG ACGGGCAGAC GGTGCAGACC AGCGTTGGCG AGGTGACCCT GGATACCGCC GGCGCCACGC TGGAGGTGAT CGAGCCGGAC TGGCGCACCC AGTTGCTCTC TGTGCTCACC AACCCCAATG TGGCCTACAT GCTGATGCTG ATCGGCATCT ACGGCATCAT CTTCGAGCTG ATGAACCCGG GCAGCCTGGT CCCCGGTGTG CTGGGTGCCA TCTGCCTGCT GTTGGCCCTG TTTGCTTTCC AGGCACTGCC CATCAGCTAT GCCGGTATGG CGCTCATCCT GCTGGGGTTG GGCTTCATGG TGGCGGAGGC CTTCGCGCCC AGTTTCGGTA TCCTCGGTAT CGGCGGGGTG ATCGCCTTCA TATTGGGCTC GATCATGCTC TTCGATACCG ACGTGGAGGG CTTTCAGGTC TCCCTCGGGT TGATCGTCGG TTTCGGCATC GCCAGCCTGG TGATCGTGCT GGGGATCGCG ACCATGGCCC TGCGGGCCTG GCGTCGGCCA CGCAAGGGCG GCCGTGACAG CATTGTCGGC GCGCGCTGTG AGGCGGTGTC GGATTTCGAG CACAAGGGTA AGGTGCGCAT CCAGGGGGAG CTCTGGAATG CGTTCACCGA CCAGCCGGTC AAGGCCGGAC AGGCCCTGGT GGTGGTGGAT ATGGAGGGGC TGAACGTCAA GGTCGCTCCG GCGGAGACGG CCAGCCACCA CGAGGGGTTG CAACATACCT GA
|
Protein sequence | MLKNVLAVLL LYLATALPLG ADADRLALVL DVEGPIGPAT SHFIAGGLKT AEERDAVVVI LRMDTPGGLD SAMRDIIKDI LASPVPVATW VGPDGARAAS AGTYILYASH VAAMAPATNL GAATPVQIGG GGGFPPGGQP EQEEAADDND EAANGNDEAP EERPRRDPGS ATERKVLEDA VSYIRGLAEL RGRNADWAEK AVREAASASS SEALELGVIE HRVSNLEALL EAMDGQTVQT SVGEVTLDTA GATLEVIEPD WRTQLLSVLT NPNVAYMLML IGIYGIIFEL MNPGSLVPGV LGAICLLLAL FAFQALPISY AGMALILLGL GFMVAEAFAP SFGILGIGGV IAFILGSIML FDTDVEGFQV SLGLIVGFGI ASLVIVLGIA TMALRAWRRP RKGGRDSIVG ARCEAVSDFE HKGKVRIQGE LWNAFTDQPV KAGQALVVVD MEGLNVKVAP AETASHHEGL QHT
|
| |