Gene Mlg_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1989 
Symbol 
ID4270463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2258791 
End bp2260212 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID638126745 
Productnodulation efficiency protein NfeD 
Protein accessionYP_742821 
Protein GI114321138 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00554582 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0711803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAAGA ACGTCCTCGC AGTCCTGTTG CTTTATCTGG CCACCGCACT GCCCCTGGGC 
GCGGACGCGG ACCGGCTGGC CCTGGTGCTG GATGTGGAGG GCCCCATCGG CCCGGCCACC
AGCCATTTCA TTGCCGGCGG TCTGAAAACC GCTGAGGAGC GCGACGCCGT GGTGGTCATC
CTGCGTATGG ACACGCCGGG CGGCCTGGAC AGCGCCATGC GCGACATCAT CAAGGACATC
CTCGCCTCTC CGGTTCCGGT GGCCACCTGG GTCGGGCCGG ACGGGGCCCG GGCCGCTAGT
GCCGGCACCT ACATCCTCTA CGCCTCGCAC GTGGCGGCGA TGGCGCCGGC CACCAATTTG
GGTGCGGCCA CGCCGGTGCA GATTGGGGGC GGGGGCGGCT TCCCGCCGGG CGGCCAGCCG
GAACAGGAGG AGGCCGCGGA CGACAACGAC GAGGCGGCCA ACGGCAATGA CGAGGCACCT
GAAGAGCGGC CCCGGCGCGA CCCCGGTTCG GCCACCGAGC GCAAGGTGTT GGAAGACGCG
GTCTCCTACA TTCGCGGGCT GGCGGAATTG CGCGGCCGCA ACGCCGACTG GGCGGAGAAG
GCCGTGCGCG AGGCCGCCAG CGCCAGCTCC AGCGAGGCGT TGGAGCTGGG GGTGATCGAG
CACCGGGTCT CCAATCTGGA GGCGTTGCTG GAGGCCATGG ACGGGCAGAC GGTGCAGACC
AGCGTTGGCG AGGTGACCCT GGATACCGCC GGCGCCACGC TGGAGGTGAT CGAGCCGGAC
TGGCGCACCC AGTTGCTCTC TGTGCTCACC AACCCCAATG TGGCCTACAT GCTGATGCTG
ATCGGCATCT ACGGCATCAT CTTCGAGCTG ATGAACCCGG GCAGCCTGGT CCCCGGTGTG
CTGGGTGCCA TCTGCCTGCT GTTGGCCCTG TTTGCTTTCC AGGCACTGCC CATCAGCTAT
GCCGGTATGG CGCTCATCCT GCTGGGGTTG GGCTTCATGG TGGCGGAGGC CTTCGCGCCC
AGTTTCGGTA TCCTCGGTAT CGGCGGGGTG ATCGCCTTCA TATTGGGCTC GATCATGCTC
TTCGATACCG ACGTGGAGGG CTTTCAGGTC TCCCTCGGGT TGATCGTCGG TTTCGGCATC
GCCAGCCTGG TGATCGTGCT GGGGATCGCG ACCATGGCCC TGCGGGCCTG GCGTCGGCCA
CGCAAGGGCG GCCGTGACAG CATTGTCGGC GCGCGCTGTG AGGCGGTGTC GGATTTCGAG
CACAAGGGTA AGGTGCGCAT CCAGGGGGAG CTCTGGAATG CGTTCACCGA CCAGCCGGTC
AAGGCCGGAC AGGCCCTGGT GGTGGTGGAT ATGGAGGGGC TGAACGTCAA GGTCGCTCCG
GCGGAGACGG CCAGCCACCA CGAGGGGTTG CAACATACCT GA
 
Protein sequence
MLKNVLAVLL LYLATALPLG ADADRLALVL DVEGPIGPAT SHFIAGGLKT AEERDAVVVI 
LRMDTPGGLD SAMRDIIKDI LASPVPVATW VGPDGARAAS AGTYILYASH VAAMAPATNL
GAATPVQIGG GGGFPPGGQP EQEEAADDND EAANGNDEAP EERPRRDPGS ATERKVLEDA
VSYIRGLAEL RGRNADWAEK AVREAASASS SEALELGVIE HRVSNLEALL EAMDGQTVQT
SVGEVTLDTA GATLEVIEPD WRTQLLSVLT NPNVAYMLML IGIYGIIFEL MNPGSLVPGV
LGAICLLLAL FAFQALPISY AGMALILLGL GFMVAEAFAP SFGILGIGGV IAFILGSIML
FDTDVEGFQV SLGLIVGFGI ASLVIVLGIA TMALRAWRRP RKGGRDSIVG ARCEAVSDFE
HKGKVRIQGE LWNAFTDQPV KAGQALVVVD MEGLNVKVAP AETASHHEGL QHT