Gene Mlg_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1518 
Symbol 
ID4269074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1731293 
End bp1732531 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content72% 
IMG OID638126276 
Producttoxic anion resistance family protein 
Protein accessionYP_742357 
Protein GI114320674 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3853] Uncharacterized protein involved in tellurite resistance 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.150292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.25035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA CCCCCGAGCC GCCCCAGGGC GGCGAGTTTC ACCTGGCATT GCCGCCGGTC 
GATGAGATCG CCGAGTCGGT GAAGCGCGGC GCCGAGGAGG CCGCCGCCGC CGACCGGGAG
GACCTGGCGC GGCAGGCCGA CACCTTTGTC CGCGACCTGC TCGAGGCGCT GCCCGATGAG
GACGCCCCGG CCGGCGCCGC TCGCCAGCGC GAGGTCATCG ACTACATGGG GATCGAGGTG
CAGCGCCAGG CGGCGCGGCG CAGCGCCATG CTGCAGGAGC CCATCCGTAA ACTCGCGCAT
CAGGGGGAGG AGGGCGGCCC GGTGGCCCGC ACCCTGCTGG ATCTGCGCCA GCAGATGTCG
GCACTGGATC CGCGGGGGCG GGACCTCGCC CCGGGGCTGC TGGACCGCCT GCTGGCGCGG
ATTCCGGGCG TGGGCACCAA GGTGCAGCGC TACTTCCGTC AGTTCGAGAC CGCCCAGCAG
GCGCTGGACG CCATTATCCG CGACCTGGAG ACGGGGGCCC GGATGCTGCG CCGGGACAAC
CTCACCCTGT CCGACGATCA GGCCGAGCTG CGGGAGGTGC TGGCCGAGCT GGCGAGCCAT
ATCGAGCTGG GCAAGCTGAT CGACGCCCGG CTGGTGGCGG CGGCGGAGGC GCTGCCGGAG
ACCGCCCCAC GCCGGGCCTT TATCGAGGAG GAGCTGCTCT TCCCGCTGCG CCAGCGTATC
GTCGACCTGC AGCAGCAACA GGCGGTGAGC CAGCAGGGCG TACTGGCGCT GGAGGTGGTA
ATCCGCAACA ACCGGGAACT GATCCGCGGG GTGGACCGGG CGATCAATGT CACTGTCTCG
GCGCTCAATG TGGCGGTGAC CGTGGCGCTG GGGTTGGCCA ATCAGCGCTT GGTGCTGGAC
CGGGTGGAGG GCCTGAACCG GACCACCTCG GACATGATTG CCGGCACCGC CCAGGCCCTG
CGGCGCCAGG GCGCTGAGAT CCAGACGCGC GCCGCGGCCA CCATGCTGGA CATGGAGCAG
CTCGAGGCGG CCTTCGAGGA TGTGCTGGGC GCCATTGACG CCCTGTCCCG CTACCGGCAG
GAGGCCCTGC CGCGCCTGGA TGAACAGATC GACCGCCTGG ATACCCTGGC GCGCCGGGGC
CAGGGGGCGA TCGAGCGGCT GGAGCAGGGC AACCAGGCGT GGTCGGAGGA TGAGGCGCCG
GACGCCGGCG AGGGGGAGGG CGGCCGTCCC CGCGGTTGA
 
Protein sequence
MAKTPEPPQG GEFHLALPPV DEIAESVKRG AEEAAAADRE DLARQADTFV RDLLEALPDE 
DAPAGAARQR EVIDYMGIEV QRQAARRSAM LQEPIRKLAH QGEEGGPVAR TLLDLRQQMS
ALDPRGRDLA PGLLDRLLAR IPGVGTKVQR YFRQFETAQQ ALDAIIRDLE TGARMLRRDN
LTLSDDQAEL REVLAELASH IELGKLIDAR LVAAAEALPE TAPRRAFIEE ELLFPLRQRI
VDLQQQQAVS QQGVLALEVV IRNNRELIRG VDRAINVTVS ALNVAVTVAL GLANQRLVLD
RVEGLNRTTS DMIAGTAQAL RRQGAEIQTR AAATMLDMEQ LEAAFEDVLG AIDALSRYRQ
EALPRLDEQI DRLDTLARRG QGAIERLEQG NQAWSEDEAP DAGEGEGGRP RG