Gene Mlg_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0199 
Symbol 
ID4269645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp231092 
End bp233365 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content69% 
IMG OID638124923 
Productorganic solvent tolerance protein 
Protein accessionYP_741044 
Protein GI114319361 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0836973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGTC TGATTCCCAT TGCCATTACC GGTTCCCTGC TGTGGGGGGC GGCGGTCCAG 
GCCCAGGGGC CCACCGCCGC CGAGCGCGAG GCCTACTTCG CCGAGCGCCA GCGGGCCCTG
TGTGGCCCAC CGCTGGTGAT GCCGCTGGAT GCGGTGGACA CCGCCCTGCG CCACCGGCCC
GAGACCCCTG CCACCGTGGA TGCGGACGCT ATTTACTACG ATGGTGCGGC CGGGCGGTAC
CGCTTCCGTG GCGATGTCCT GATGCAGCGC CTGGATCAGC GGCTGCGCAG CGAGGAGGTG
CGCTACGATC ACGCGAGCGG TCGGGTCGAT CTGCCCTTCC CCTTCGTGTA CGAGGAGGCC
GGGCTGGCGC TGACCGGCGA GAGCGGCTGG CTGCAGTTGC GCGAGGACCG CGGCGAGGTG
GTGGCCGGCG AGTTCATGCT GGATGAGCGC AACATCCGCG GGCGGGCCGA GCGGCTGGAA
CTGGCGGACG CCCAGCGCTC CCGTTACGAG GATGTGGGCT ACACTACCTG CCGGCCCGGT
AACGAGGACT GGTGGCTGCA GGCCCGCGAG CTGGAGCTGG ATCGCGAGGA GGGGCTGGGT
ACGGCCCGCC ACGCCTGGTT CACCTTCCTC AACGTGCCGT TGTTCTACAC CCCCTGGATC
ACCTTCCCCA TCGACGACCG GCGCCGGACC GGGTTGCTGG CGCCGGGGTT TGCCACCTCG
GACCGCCATG GCACGGACAT CACCGTGCCC GTCTATTGGA ACATCGCCCC CAACTACGAC
GCCACCCTGG TGCCGCGCTG GATCGAGCGC CGTGGCGCCC TTCTGGGTGG TGAGTTCCGC
TACCTGCAGG AGGCCTTCTC CGGCGAGCTC TACGGTGAAT ACCTGCCCAA TGACAGCCTC
GCCCGGGACG ACCGCTGGCT GCTCGGCATC GACCACCGGG GGCGGTTGCC CCGGGGCTGG
CGCTATGACG CCGATATCAA CCGGGCCAGC GACGGCGACT ACCTGCGGGA TTTCGGCAGT
GGCCTGCTGG AGACCAGCTC CAGTCACCTG CAGAGCCGGG GGCGCCTGCG CAATCGCTGG
AACGACTGGG CGGTGGCGGC CGAGGTCCAG CACTGGCAGA CCCTGGACGA CGACCTGCGC
AATCCCTACC GGCGCGAGCC GCGCCTGACC GCGGACTACC AGGGCCCCTT CCGTGCCGGG
CAACCGCGCT ACCGGCTGAA CACCGAATAC ACCCGCTTCG CCCTGCCCGA CACCGATGCC
GACCGGCCCG AGGGTGAGCG CATGGACATT GCTCCGCGGG TGGAGTGGCG GTTGCACCGG
CCCTGGGGCT ATCTGACACC GGCGGCCGCG CTGCGCCACA CCCAGTACCG GCTGGACGAC
CCGGTACCGG GCGCGGACGA CCGCAGCCCC CGGCGCACCG TGCCCACCTT CAGTGTCGAC
TCCGGTCTGT TCTTCGATCG CCCCTTCGAC TGGGACGGAC GCCCCATGGT GCAGACCCTG
GAGCCACGGG TGTTCTACGT CTACACCCCG GAGCGCCGGC AGGACGACCT GCCGGTGTTC
GACACCTCCC GCCGGGATTT CTTCTTTGAT GGCCTGTTTC GCGAGGACCG CTTCAGTGGC
GCCGACCGGG TGGGGGATGC CGACCAGGTC ACCGTCGCAC TGACCACCCG CTTCGTCGAC
CTGGGGGGCG GTCGGGAGTG GCTGCGTGCC AGCCTCGGCC AGATCCATTA CCGGCGCGAC
CGCCAGGTGA CGTTGTTCCC CGAGACCGAC CGCGCGGCGG ACCGGCGTAG TCGGTCCGAT
TATATGGCCG AGATGCGTAG CGAGTTACCG GGCGGGGTGC TGGCCCAGGG CGAGTACCGG
TACAATCCCT ATGACAGCCG CTCCGAGCAG GGGGCGTTCC GGCTGGGCTG GCACCCGCGG
CCGGACCTGT TGGTTGGCGC CGGCTACCGG ATGCGCTACG GCGATGAGGG CCGGGACGTG
GAACAATCGG ACCTGGCCGC GGTCATCCCG TTGGGCCCCC GTTTCAGTCT GATCGGCCGT
TGGCTCTATT CCCTCGCCGA CGACAACAGC CTGGAGACCG TCGGTGGGCT GGAGTACCGG
ACCTGCTGCT GGCGGGTGCG GGCCATGGGC CGGCGCAGTT TCGAGGGGGC CGGCGCCGAG
CCGGACACCT CTATTATGCT GCAGTTCGAG TTCACCGGCC TGGGGCAGGT GGACTCGGGC
AGCACCGATT TCCTGCAGGA CAGCATCTAC GGCTATGAGG GCGACCGCTT TTGA
 
Protein sequence
MRRLIPIAIT GSLLWGAAVQ AQGPTAAERE AYFAERQRAL CGPPLVMPLD AVDTALRHRP 
ETPATVDADA IYYDGAAGRY RFRGDVLMQR LDQRLRSEEV RYDHASGRVD LPFPFVYEEA
GLALTGESGW LQLREDRGEV VAGEFMLDER NIRGRAERLE LADAQRSRYE DVGYTTCRPG
NEDWWLQARE LELDREEGLG TARHAWFTFL NVPLFYTPWI TFPIDDRRRT GLLAPGFATS
DRHGTDITVP VYWNIAPNYD ATLVPRWIER RGALLGGEFR YLQEAFSGEL YGEYLPNDSL
ARDDRWLLGI DHRGRLPRGW RYDADINRAS DGDYLRDFGS GLLETSSSHL QSRGRLRNRW
NDWAVAAEVQ HWQTLDDDLR NPYRREPRLT ADYQGPFRAG QPRYRLNTEY TRFALPDTDA
DRPEGERMDI APRVEWRLHR PWGYLTPAAA LRHTQYRLDD PVPGADDRSP RRTVPTFSVD
SGLFFDRPFD WDGRPMVQTL EPRVFYVYTP ERRQDDLPVF DTSRRDFFFD GLFREDRFSG
ADRVGDADQV TVALTTRFVD LGGGREWLRA SLGQIHYRRD RQVTLFPETD RAADRRSRSD
YMAEMRSELP GGVLAQGEYR YNPYDSRSEQ GAFRLGWHPR PDLLVGAGYR MRYGDEGRDV
EQSDLAAVIP LGPRFSLIGR WLYSLADDNS LETVGGLEYR TCCWRVRAMG RRSFEGAGAE
PDTSIMLQFE FTGLGQVDSG STDFLQDSIY GYEGDRF