Gene Mlg_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0653 
Symbol 
ID4268265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp711031 
End bp712260 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content74% 
IMG OID638125402 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_741497 
Protein GI114319814 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.356164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.176029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG CCCTGATAAA GGCCCTACGC TGGTGGCTGC CCGCGTTGCT TATCACCGCC 
CTCGGGCTCT ACGCCCTGCA CCTCACCGGC CCCGACGGCG CCGGCCCCGG TCTGACCGGC
GAGCGTGCGG CGCAACCGGT GGCAGTCGAG ACCGCCCCGC TGACCCGCGG CCCGCTGGAG
GATGTGCGCC GTTTCACCGG CAGCCTGGAG GCGGCTAATC AGTTCGACCT GGCGGCGCGC
ACCGGCGGCC GGCTGCGCCA ATTGCGAGTG GACATCGGCG ATACGGTGGA GCACGGCGAA
CTCATCGCCC GTCTGGACAG CGAGGAGCAG GAACAGGCCG TGGCCGAGGC CCTGGCCGCC
CGTGACGTGG CCCGGGCCCA GCTCGCCGAG ACCCGCGCCG CGCTGGCCTC CGCGCGCAAG
GAGCTGGACC GTACCCGCGC CCTGCGCGAG CGTCAGGTGG CCTCCCAGGC GGAGCTGGAG
GCCGCCGAGG CGCGGGTGGC CGCCGAGCAG AGCCGCGAGC AACTGGCCCG GGCCCAGATT
GCCCAGCGGG AGGCCGCCCT GGCCGCCGCC CGGGTACGCC TGTCCTGGAC CGAGATCCGC
GCCGACTGGG AGGGCGGCGG CGAGACCCGG GTGGTGGGCG AGCGCTATCG GGACGAGGGC
GCCGCGCTGA ACGCCGGCGA CCCGGTGGTC TCGCTGATGG ACACCCGCAC CCTGCGCGCC
GTGGGCTTCG TCACCGAACG GGACTACGCC CACCTGAACC CCGGCCAGGC CGCCCGTCTC
CGGGTGGACA CCCATCCCGG CGAGGACTTC CCCGCCACCG TCCACCGGCT GGCACCGCGC
TTCAGTCCCG GCAGCCGCCA GGCCCGGCTG GAGCTGACCG TCCCCAACCC GGAGGGCCGG
CTGCAGCCCG GACTCTTCGC CCGTCTCCAC ATCACCGTCG GCGAGACCCG GGACGCCCTC
TGGGTGCCCC GCGACGCCTT GGTGCGACGT GGCGATGAGG TGGGTATCTT CCTGGTGGAT
GAGGATGTGG GCGACGACCA GCCGCCGCGG GCCCGTTACC ACACCGTCAC CACCGGGGTG
CGGGACGGCG ACCGGGTACA GATCCTCAGC CCGGCGTTGC AGGGCAATGT GGTCACCCTC
GGCCAGCACC TGATCCGGGA CGGCAGCCCG CTGCGGCCGG AACGGCTGAC CGATGCGCTG
GCCCGCCAGG ACGAGGAGCA GGAAGGGTGA
 
Protein sequence
MKSALIKALR WWLPALLITA LGLYALHLTG PDGAGPGLTG ERAAQPVAVE TAPLTRGPLE 
DVRRFTGSLE AANQFDLAAR TGGRLRQLRV DIGDTVEHGE LIARLDSEEQ EQAVAEALAA
RDVARAQLAE TRAALASARK ELDRTRALRE RQVASQAELE AAEARVAAEQ SREQLARAQI
AQREAALAAA RVRLSWTEIR ADWEGGGETR VVGERYRDEG AALNAGDPVV SLMDTRTLRA
VGFVTERDYA HLNPGQAARL RVDTHPGEDF PATVHRLAPR FSPGSRQARL ELTVPNPEGR
LQPGLFARLH ITVGETRDAL WVPRDALVRR GDEVGIFLVD EDVGDDQPPR ARYHTVTTGV
RDGDRVQILS PALQGNVVTL GQHLIRDGSP LRPERLTDAL ARQDEEQEG