Gene Mlg_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1057 
Symbol 
ID4270530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1229387 
End bp1232494 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content67% 
IMG OID638125808 
Productacriflavin resistance protein 
Protein accessionYP_741899 
Protein GI114320216 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.828249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.914688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCT GGTTTGCCGC GCACCCCACC GCCGCCAACC TGGTGCTGGT CATTCTGGTG 
GCGGTCGGCC TGTTCGCGGC CCCCACCCTG CAGCGGGAGA CCTTCCCGGA CTACCGTCCC
GGCGAGGTCA GCATCGAGGT GGCGTACCGG GGGGCCAGCG CCGCCGACGT GGAGGACAAC
ATCTGCCGGC GTCTGCACGA TGCCGTCAAA GGCGTGGAAC ACCTGGACGA ATTCGTCTGT
ACCGCCCAGG ACAACCGCGC CTCGGCCACC GCCAGCATGG CCCCCCGCGG CACCATGGCG
CGGTTCCTGG GTGACATCGA CACCGAGGTC AAGGCCATTA CCGACTTCCC GGAGCGCGCC
GACCCGGCGG TGGTCCGGGA ACTGCACCGC ACCGACCTCG TGGCCGCTGT CGCGATCACC
GGGGACCTCT CCCTGGTCGA TCACGAGGCC TACGCCCAGC GGCTGGAGGA TCGCCTGATC
GCCCTGCCCG GGGTGGCCGA CGTCATCACG CATGGGCTCT CCCAGCGCCA GTGGCAGGTG
GAGGTAGGCC ATGACGCCCT CGCACAGCAT GGTCTGTCCG CCATTGAGTT GGCCGCCCGC
GTTGCCGCCC AGAATATCGA CAGCCCGCTG GGCACGCTGG AAACCCCTGA CCGGGAGATC
CTGCTGCGCT TCACCGACGA ACGCCACGAC CTCGTCGGCC TGGGGGAGCT GGTCGTGCTC
TCCGATGAAG GTGGTGGGGA ACTGCGATTG TCCGAAATCG CCCGACTGAG CGAGCACCGG
GAACACACGG AGGAGAAGGT GCTCATGAAT GGGCGACGCG CCGTCCTGCT GGAGGTGCAC
AAGGCCACCC ACGCCGATGC CCTGGATGTC ATGGAGGCTA TGCAAGGGTT GCTGGAAACG
GAGCGCGCAC GCTGGGGCGA GGCCCTGACG CTGACCGTCA CCCAAGATGC CACTTCGATC
GTCCGTGACC GGCTGCAAAT GCTGGTCTCC AACGGGCTGC TCGGGCTGAT CCTCGTATTG
CTAGTGCTCA GCCTGTTCTT CCGCCCGCGT CTGGCCCTTT GGGCCGGCAT CGGGCTGCCT
GTGGGGTTCC TGGGCGCGTT CACGGTCATG GCCTTGACCG GACTATCGCT CAACATGATC
ACCCTGGTCG CGCTGTTGAT GGCCATCGGC CTGGTAATGG ACCACGCCAT CGTGATCACC
GACAACATTT CCGCCCGCGC CCGAGCCGGC AGCCGTGCGC TGGAGGCCGT GGTGGAAGGT
GCCCGTCAGG TGCTGCCGGG GGTCATCTCC TCTTTCCTGA CCACGGTAGT CGTATTCACC
CCGCTGTCCT TCCTGGCCGG TGAGCTGGGC GCGGTTTTAC AGGTCTTGCC GGTGGTGCTC
ATCGCCGCAC TCACCGCCAG CCTGATCGAG GCATTCCTGG TGTTGCCCCA CCACCTCCGG
GGCGGTCTCG ACCGGATTCA GTCCAAAGGC AACTCACGGC TTCGGCAAGG TTTTGACCGC
GCCTTTGATC GCTTCCGGGA AGGTGTCGGC AGCCTGGCCG ACCGCGCCAT ACGCTTCCGC
CATGGGGTGC TGGCCGGCAC CCTGGTGATA CTGCTGGCGT CAGCCGGCTA CCTGGTCGGT
GGCCACCTGG GTACCGAGGC CATGCCCGAC ATCGACGGCG ATGTGCTGGA GGCCCGCATC
CTGATGCCCC AGGGAACCCC CCTGTCCCGC ACCGAGGCGG TGGCCAAACA GGTGAGCGAG
GCCATGGCAG AACTGAATGA GCGTCTCACA CCGGAGCAGC CGGACGGTGC CTCGCTGGTG
CGCAACCTGC AGGTGCGATT CAATCACAAC CCCAGTGCGG GTGAGCCCGG CCCCCATGTC
GCCACCCTCA GCGTCGACCT GCTCACCGCG GAGCGCCGCA CGGTGGACCT GGAAACTCTG
ACCGCTGCCT GGCGTGAGGC CATCGGCCCC ATCGCGGGCG TACACAGCCT GGTCATTCAG
GAGCCTGGCT TCGGCCCAGC CGGCACCCCC ATTGAGGTTC GCCTGAGCGG CGACGACCTC
GACGAACTTC AGCAGGCGTC GGAGCACCTG AAGGAGACCC TGGGCACGTA CCAAGGCGTG
TACAACGTGT TGCACGACCT GCGGCCCGGC AAGCCGCAAT ACCAGTTCCG CATGGCGGAA
GGCGCGCATG GGCTGGGGCT CACGGCGGAG GCAACCGCCC GCCAGCTACG CGCCGCCATA
CTGGGGGAAC TGGGGGGCAC ACTGCGCATC GGCGCCCACG ATGTGGAAAT CCTGGTCCGC
CACACCGAGC GGGACCGCAA CCGCCTGGAT GCACTGGAGG AGCTCACCGT GCTCGGGCCG
GACGGCCAAC GCATACCGCT GGCGGTGGCG ACGGAACGTA AGGCAGCCCG CGAGTGGGCG
CGCATTACCC GGGTGGACGG CCAACGCACA GTCACCGTGG AGGCCAACGT GAACCCCCGG
GTGGCGAATG CCCAGGCCAT CGTCAATGAC CTACAAAACC ATTGGCTGCC GGACTTCAAG
GCCGCTCACC CCACGCTGGC CGTGGGCTTC GAGGGCCAGG TGGCCCGTTC GGCGGAGACG
GGAGGGTCGG TGCGGCGGGC CCTGCTGATC GGGCTGATCG GGATCTACGT GATCCTCTCC
TTCCAGTTCC GCAGCTATGT GGAGCCGCTG CTGGTCATGG TGGCCATTCC CCTAGCCTTC
ATGGGCGCGC TCTGGGGCCA TGTGCTGATG GGCTACTACC TGTCCATGCC CTCGCTGGTC
GGCGCCGCCT CCCTGGCCGG CATCGTCGTG AACAACGCCA TCCTGCTCAT CCTGTTCATC
AAGGCCCACC GGGACGCGGG GCTGTCCGCC GTGCGTGCCG CCGGCCAGGC CAGCCGCGAC
CGGCTGCGGG CCATCCTGAT TTCGTCCGGC ACCACCACCG CCGGAGTGTT GCCCCTGCTG
GCGGAGTCCA GCACCCAGGC GGACGCCATC AAACCTTTGG TCATCTCTGT GGTCTTTGGC
CTGGCCACGT CCACCGTGCT GGTCCTGTTT GTCATCCCGG CCCTGTACGT GATCTTCGAC
GAGCACCGGG CCCGACGCCT CAGGTCATCC GCTGAAGAAC ATCCGTAG
 
Protein sequence
MIRWFAAHPT AANLVLVILV AVGLFAAPTL QRETFPDYRP GEVSIEVAYR GASAADVEDN 
ICRRLHDAVK GVEHLDEFVC TAQDNRASAT ASMAPRGTMA RFLGDIDTEV KAITDFPERA
DPAVVRELHR TDLVAAVAIT GDLSLVDHEA YAQRLEDRLI ALPGVADVIT HGLSQRQWQV
EVGHDALAQH GLSAIELAAR VAAQNIDSPL GTLETPDREI LLRFTDERHD LVGLGELVVL
SDEGGGELRL SEIARLSEHR EHTEEKVLMN GRRAVLLEVH KATHADALDV MEAMQGLLET
ERARWGEALT LTVTQDATSI VRDRLQMLVS NGLLGLILVL LVLSLFFRPR LALWAGIGLP
VGFLGAFTVM ALTGLSLNMI TLVALLMAIG LVMDHAIVIT DNISARARAG SRALEAVVEG
ARQVLPGVIS SFLTTVVVFT PLSFLAGELG AVLQVLPVVL IAALTASLIE AFLVLPHHLR
GGLDRIQSKG NSRLRQGFDR AFDRFREGVG SLADRAIRFR HGVLAGTLVI LLASAGYLVG
GHLGTEAMPD IDGDVLEARI LMPQGTPLSR TEAVAKQVSE AMAELNERLT PEQPDGASLV
RNLQVRFNHN PSAGEPGPHV ATLSVDLLTA ERRTVDLETL TAAWREAIGP IAGVHSLVIQ
EPGFGPAGTP IEVRLSGDDL DELQQASEHL KETLGTYQGV YNVLHDLRPG KPQYQFRMAE
GAHGLGLTAE ATARQLRAAI LGELGGTLRI GAHDVEILVR HTERDRNRLD ALEELTVLGP
DGQRIPLAVA TERKAAREWA RITRVDGQRT VTVEANVNPR VANAQAIVND LQNHWLPDFK
AAHPTLAVGF EGQVARSAET GGSVRRALLI GLIGIYVILS FQFRSYVEPL LVMVAIPLAF
MGALWGHVLM GYYLSMPSLV GAASLAGIVV NNAILLILFI KAHRDAGLSA VRAAGQASRD
RLRAILISSG TTTAGVLPLL AESSTQADAI KPLVISVVFG LATSTVLVLF VIPALYVIFD
EHRARRLRSS AEEHP