Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2365 |
Symbol | |
ID | 4270704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2683824 |
End bp | 2685023 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638127123 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_743195 |
Protein GI | 114321512 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0471008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00367269 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCGC GCCGCGTGCA ACTGCTCAAG GTGGTCTCGG GGCTGGATAT CGGGGGGGTG CGTAGCGCAG AGCTCGGTTT CACCGCCGAG TTGCAGGCGC GGGGGGTCGG CGTTGGCGGC GTAATCGTGA CCGGGGGCGC CATGGCCGCC CGTTACGCCC GGCTGTTTGA CGGCCACTGG CTGCTGGACA CCCGTCTGCC CGAGTTCCGA GGCGGACGGT GGGAGCGGCT GCGCCGAACG GTGGTAAACC TGGGGCGCAG CCGGCAGGGC GCCCGGCGGC TGGCCGGTGA CCTTCGCGGG ACCCGGTGGG CGTCGGCGGG CACGGTGGTG GCGGTGCGCC ATGCGCCGCT GCTGCCTCTG GCCGCGCTGC TGGCCCGCCG GCTCGACCTC CCGTTGCTCT GGCACATGCC CAACCCGGTC CACGACCGGC TCGGCCGGGC CTATTTCCAG GCCCTGCTGC GTCTGGCCGG CGGCACGGCG GTGGGCAATA GCCGCTACAC CCTGGCCAGC CTCGGCGCCG GCGATGGCCC GGTGATCTAT CCCGGCTTCT CCCCCGCCCG GGTGGCCCTC GACGAGACCG CCCCCGAGCT GCGGGCGTCG CTGGGCATCC CGGCCGCGGC GCCGGTATTC GCGGTGGTGG CCCGGCTCAC CCCGGACAAG GCCACTGACT GGGTGCTGGC GGCCTTTCTC GATTCGGCGG CCTTTCGCAA CGGGGCGCAT CTGCTGATCG CCGGCGGCCC GCTGGACTCC CGGTTCGCCC TTGACCTAAG GGCTCACGCC GGTGAGGCGG GCGACGGCCG GGTGCACTTT CTCGGCGAGG TCGAGCAGGT GGCGCCGGTT TACCGGGCCG CGGATGTGCT GGTGGCGGGC CGGCGCACGG TGGAACCCTT CGGGATCTCG CTGGTGGAGG CCATGGCCTC GGGACTGCCG GTCATCGCCC CGGGCGGCGG GGGGCCGGAC GAGGTCGTCA CCGACGGGGT GACCGGGTGG TTGCTGCCGG ACCGGGGCGT GGCAGCCTAT ACCCGGGCGC TGGACCGCGC CTGGTCGGAC CGCGCCCGCT GGCCGGTGAT GGGTGGGCAG GCGCGCACCG CCGCCGCGCC GTTCAGCCTG GCGCACCAGG CCGGCCGCTA CCTGCAACTG GTGCGGGCGG TGCTGGACCC GGACGGCGAC GACCCGGCGC CCATGGTGAG GGCGCGTTGA
|
Protein sequence | MTARRVQLLK VVSGLDIGGV RSAELGFTAE LQARGVGVGG VIVTGGAMAA RYARLFDGHW LLDTRLPEFR GGRWERLRRT VVNLGRSRQG ARRLAGDLRG TRWASAGTVV AVRHAPLLPL AALLARRLDL PLLWHMPNPV HDRLGRAYFQ ALLRLAGGTA VGNSRYTLAS LGAGDGPVIY PGFSPARVAL DETAPELRAS LGIPAAAPVF AVVARLTPDK ATDWVLAAFL DSAAFRNGAH LLIAGGPLDS RFALDLRAHA GEAGDGRVHF LGEVEQVAPV YRAADVLVAG RRTVEPFGIS LVEAMASGLP VIAPGGGGPD EVVTDGVTGW LLPDRGVAAY TRALDRAWSD RARWPVMGGQ ARTAAAPFSL AHQAGRYLQL VRAVLDPDGD DPAPMVRAR
|
| |