Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0199 |
Symbol | |
ID | 4269645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 231092 |
End bp | 233365 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638124923 |
Product | organic solvent tolerance protein |
Protein accession | YP_741044 |
Protein GI | 114319361 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0836973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACGTC TGATTCCCAT TGCCATTACC GGTTCCCTGC TGTGGGGGGC GGCGGTCCAG GCCCAGGGGC CCACCGCCGC CGAGCGCGAG GCCTACTTCG CCGAGCGCCA GCGGGCCCTG TGTGGCCCAC CGCTGGTGAT GCCGCTGGAT GCGGTGGACA CCGCCCTGCG CCACCGGCCC GAGACCCCTG CCACCGTGGA TGCGGACGCT ATTTACTACG ATGGTGCGGC CGGGCGGTAC CGCTTCCGTG GCGATGTCCT GATGCAGCGC CTGGATCAGC GGCTGCGCAG CGAGGAGGTG CGCTACGATC ACGCGAGCGG TCGGGTCGAT CTGCCCTTCC CCTTCGTGTA CGAGGAGGCC GGGCTGGCGC TGACCGGCGA GAGCGGCTGG CTGCAGTTGC GCGAGGACCG CGGCGAGGTG GTGGCCGGCG AGTTCATGCT GGATGAGCGC AACATCCGCG GGCGGGCCGA GCGGCTGGAA CTGGCGGACG CCCAGCGCTC CCGTTACGAG GATGTGGGCT ACACTACCTG CCGGCCCGGT AACGAGGACT GGTGGCTGCA GGCCCGCGAG CTGGAGCTGG ATCGCGAGGA GGGGCTGGGT ACGGCCCGCC ACGCCTGGTT CACCTTCCTC AACGTGCCGT TGTTCTACAC CCCCTGGATC ACCTTCCCCA TCGACGACCG GCGCCGGACC GGGTTGCTGG CGCCGGGGTT TGCCACCTCG GACCGCCATG GCACGGACAT CACCGTGCCC GTCTATTGGA ACATCGCCCC CAACTACGAC GCCACCCTGG TGCCGCGCTG GATCGAGCGC CGTGGCGCCC TTCTGGGTGG TGAGTTCCGC TACCTGCAGG AGGCCTTCTC CGGCGAGCTC TACGGTGAAT ACCTGCCCAA TGACAGCCTC GCCCGGGACG ACCGCTGGCT GCTCGGCATC GACCACCGGG GGCGGTTGCC CCGGGGCTGG CGCTATGACG CCGATATCAA CCGGGCCAGC GACGGCGACT ACCTGCGGGA TTTCGGCAGT GGCCTGCTGG AGACCAGCTC CAGTCACCTG CAGAGCCGGG GGCGCCTGCG CAATCGCTGG AACGACTGGG CGGTGGCGGC CGAGGTCCAG CACTGGCAGA CCCTGGACGA CGACCTGCGC AATCCCTACC GGCGCGAGCC GCGCCTGACC GCGGACTACC AGGGCCCCTT CCGTGCCGGG CAACCGCGCT ACCGGCTGAA CACCGAATAC ACCCGCTTCG CCCTGCCCGA CACCGATGCC GACCGGCCCG AGGGTGAGCG CATGGACATT GCTCCGCGGG TGGAGTGGCG GTTGCACCGG CCCTGGGGCT ATCTGACACC GGCGGCCGCG CTGCGCCACA CCCAGTACCG GCTGGACGAC CCGGTACCGG GCGCGGACGA CCGCAGCCCC CGGCGCACCG TGCCCACCTT CAGTGTCGAC TCCGGTCTGT TCTTCGATCG CCCCTTCGAC TGGGACGGAC GCCCCATGGT GCAGACCCTG GAGCCACGGG TGTTCTACGT CTACACCCCG GAGCGCCGGC AGGACGACCT GCCGGTGTTC GACACCTCCC GCCGGGATTT CTTCTTTGAT GGCCTGTTTC GCGAGGACCG CTTCAGTGGC GCCGACCGGG TGGGGGATGC CGACCAGGTC ACCGTCGCAC TGACCACCCG CTTCGTCGAC CTGGGGGGCG GTCGGGAGTG GCTGCGTGCC AGCCTCGGCC AGATCCATTA CCGGCGCGAC CGCCAGGTGA CGTTGTTCCC CGAGACCGAC CGCGCGGCGG ACCGGCGTAG TCGGTCCGAT TATATGGCCG AGATGCGTAG CGAGTTACCG GGCGGGGTGC TGGCCCAGGG CGAGTACCGG TACAATCCCT ATGACAGCCG CTCCGAGCAG GGGGCGTTCC GGCTGGGCTG GCACCCGCGG CCGGACCTGT TGGTTGGCGC CGGCTACCGG ATGCGCTACG GCGATGAGGG CCGGGACGTG GAACAATCGG ACCTGGCCGC GGTCATCCCG TTGGGCCCCC GTTTCAGTCT GATCGGCCGT TGGCTCTATT CCCTCGCCGA CGACAACAGC CTGGAGACCG TCGGTGGGCT GGAGTACCGG ACCTGCTGCT GGCGGGTGCG GGCCATGGGC CGGCGCAGTT TCGAGGGGGC CGGCGCCGAG CCGGACACCT CTATTATGCT GCAGTTCGAG TTCACCGGCC TGGGGCAGGT GGACTCGGGC AGCACCGATT TCCTGCAGGA CAGCATCTAC GGCTATGAGG GCGACCGCTT TTGA
|
Protein sequence | MRRLIPIAIT GSLLWGAAVQ AQGPTAAERE AYFAERQRAL CGPPLVMPLD AVDTALRHRP ETPATVDADA IYYDGAAGRY RFRGDVLMQR LDQRLRSEEV RYDHASGRVD LPFPFVYEEA GLALTGESGW LQLREDRGEV VAGEFMLDER NIRGRAERLE LADAQRSRYE DVGYTTCRPG NEDWWLQARE LELDREEGLG TARHAWFTFL NVPLFYTPWI TFPIDDRRRT GLLAPGFATS DRHGTDITVP VYWNIAPNYD ATLVPRWIER RGALLGGEFR YLQEAFSGEL YGEYLPNDSL ARDDRWLLGI DHRGRLPRGW RYDADINRAS DGDYLRDFGS GLLETSSSHL QSRGRLRNRW NDWAVAAEVQ HWQTLDDDLR NPYRREPRLT ADYQGPFRAG QPRYRLNTEY TRFALPDTDA DRPEGERMDI APRVEWRLHR PWGYLTPAAA LRHTQYRLDD PVPGADDRSP RRTVPTFSVD SGLFFDRPFD WDGRPMVQTL EPRVFYVYTP ERRQDDLPVF DTSRRDFFFD GLFREDRFSG ADRVGDADQV TVALTTRFVD LGGGREWLRA SLGQIHYRRD RQVTLFPETD RAADRRSRSD YMAEMRSELP GGVLAQGEYR YNPYDSRSEQ GAFRLGWHPR PDLLVGAGYR MRYGDEGRDV EQSDLAAVIP LGPRFSLIGR WLYSLADDNS LETVGGLEYR TCCWRVRAMG RRSFEGAGAE PDTSIMLQFE FTGLGQVDSG STDFLQDSIY GYEGDRF
|
| |