Gene Mlg_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2166 
Symbol 
ID4270945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2461026 
End bp2462309 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content68% 
IMG OID638126922 
ProductHlyD family type I secretion membrane fusion protein 
Protein accessionYP_742998 
Protein GI114321315 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0862799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAGG GCGCGGTAAC CGGGAAACGG CCGCTCTGGC GCTGGGCCCT GGCCCGGCTC 
TGGCATGGTC CGGATCGCGC AGCCGGCGCT TCGGCACCGG CCTGGTCGGG TGGCATCGGC
GGCTTGGAGC CGGCGGAGGA CGCAAGGGTG GCCCGGCGGA TCGGGCGTTT TATCGGTCTG
GTGGGCCTGT TTGTGGGTGC CTTTGTCGTC TGGGCCTATT GGGCGGAACT GGCCGAGGTC
TCCAGTGGCC AGGCGACGGT GGTACCCAGC CGCGGCACCC AGGTGATCCA GTCCCTGGAG
GGCGGGATCC TCCAGGAGCT GCTGGTGGCC GAGGGGGAGA TGGTGGAGCC GGGGCAACCG
CTGGCGCGCC TGGACCCGAC CCGCACCCAG GCCGACATGG ACGAGGTGAT CGCCCGCTAC
CAGGGGGCGC TGGCGCGAAA GGCGCGGCTG GAGGCGGAAC TGGCCGGTGA GGGCGAGATC
CGGTTTCCCG AAGAGCTGGA CCTCGCCTCG GAGGTGGTGG CCGCCGAGCG AACCCTGTTC
GAGGCGCGCC GTGCGCATCT GGAGCGCACT GAGCAGGACA TCCGGACATC GCTGCAGCTG
GTCGGTAACG AACGCTCGAT CACCGAGGAG CTGGTCCGTG CGGGTGCGGC GAGCGAGGTG
GAGTTGCTGC AGCTGCGCCG CTCCGAGGCG GATTTGCGCC GGGAGTTGAA TCAATTGCGT
AACGAGTTCC GGGTGCGCGC GCGCCAGGAC CTGGCAGAGA CCCGCACCGA GGTGGAGGCG
TTGCGCTCCA GCCTGCGCGG TCACGAGGAC ACCCGTCAGC GCCAGACCCT GCGCTCGCCG
GTGCGCGGCC GGGTGCAGAA CCTGGCGGTC ACCACCATCG GCGGCGTGCT GGCTCCCAAC
GGCGAGTTGA TGGAGATCGT GCCCCAGGAC GGGGAGTTGC GGATCGAGGC CCGCATCTCG
CCCCGGGACA TCGCCTACAT CCACCCCGGT CAGCGCGCCC AGGTGAAGAT CACGGCCTAC
GATTACGCTA TCTACGGCGG CTTGGAGGGC GAGGTGGTGA ACATCTCACC CGATACCGAG
CGCGACGAGA TCAACCCCGA GGAGGTCTAT TACAAGGTCT TCATTCACAC CGACAGCGAT
GAGCTGGTGG TGGAGAACGG TCAGCGTTTC CCCATCTCCC CCGGCATGGT GGCGGAGGTG
GATATCGAGA CCGGCCAGCG CACGGTCTTG CAGTATATTA TAAAGCCCTT TAACCGGGCG
CGGGAGGCCT TGCGGGAGCG GTGA
 
Protein sequence
MHEGAVTGKR PLWRWALARL WHGPDRAAGA SAPAWSGGIG GLEPAEDARV ARRIGRFIGL 
VGLFVGAFVV WAYWAELAEV SSGQATVVPS RGTQVIQSLE GGILQELLVA EGEMVEPGQP
LARLDPTRTQ ADMDEVIARY QGALARKARL EAELAGEGEI RFPEELDLAS EVVAAERTLF
EARRAHLERT EQDIRTSLQL VGNERSITEE LVRAGAASEV ELLQLRRSEA DLRRELNQLR
NEFRVRARQD LAETRTEVEA LRSSLRGHED TRQRQTLRSP VRGRVQNLAV TTIGGVLAPN
GELMEIVPQD GELRIEARIS PRDIAYIHPG QRAQVKITAY DYAIYGGLEG EVVNISPDTE
RDEINPEEVY YKVFIHTDSD ELVVENGQRF PISPGMVAEV DIETGQRTVL QYIIKPFNRA
REALRER