Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0137 |
Symbol | |
ID | 4269830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 157284 |
End bp | 158519 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638124861 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_740982 |
Protein GI | 114319299 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0776455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATGA AAGTGCTCCA CATCCTCGAC CATTCCCTGC CCCTGCACAG CGGCTACACC TTCCGCACGG CCGCCATCCT GCGCGAGCAG CACCGGCTGG GCTGGGAGAC CGTGCACCTG ACCAGCCCGA AACACGGGGT GGCGGCCGGC GCGGACGCCG ACCGCGAGGA GTGGGCGGAG GGGTTGCACT TCCACCGCAC CCCCCACACC CCCCTGCGGG TCCCCGGACT GGGCGAGTGG ACCCTGATGG ATGCGCTCAC CCGACGCCTG CACCAGGTGG CCGCAGAGAC CCGACCGGAC GTCCTGCACG CCCACTCGCC GGCCCTGAAC GCCATCCCCG CCCTGCGGGT GGGCCGGCGG CTGGGCATCC CGGTGGTCTA CGAGGTGCGG GCCTTCTGGG AGGACGCCGC GGTGGACCAT GGCACCAGCC GCGATCAGGG GCTGCGTTAC CGGCTCACCC GCGGCCTGGA GACCCGCGCC CTGCGACGCG CCGACCATGT CACAACGATC TGCGAGGGGC TGCGCCAGGA CATCATCACC CGCGGCATCG CCCCGGGCCG GGTCACCGTC ATCCCCAACG CCGTGGACGC GGAGCGGTTC CAACTCGGCG GTACCGCCGA CCCGGCCCTG AAGGCGGAAC TGGGCCTGGA GGGCTGCCGG GTGCTCGGTT TCATCGGTTC CTTCTACGCC TACGAGGGGC TGGACCTGCT GCTGCAGGCC TTCCCACGCA TCCACGACCA GGCCCCCGAT GTCCGCATCC TGCTGGTGGG CGGGGGCAGC CAGGCGGAGG CGCTCAAGGC CCAGGCCCGG GACCTGGGCA TCGCCGACCA GGTGGTCTTC ACCGGCCGGG TCCCCCATGA CCAGGTCAAC CGCTACTATG ACCTGGTGGA CCTGCTGGTC TACCCGCGCC ATTCCATGCG CCTGACCGAG CTGGTCACCC CGCTCAAGCC GCTGGAGGCC ATGGCCCAGG GCCGGCTGCT GGTGGCCTCC GACGTGGGGG GCCACCGGGA GCTGATCCGC GATGGCGAGA CCGGCTGGCT GTTCCCGGCC GGCGACCCCA AGGCCCTGGC CGATACCGTC CTGCACACCC TCGCCCGCGC CGCGGACTGG CCGCAGGTGC GCGCCAATGG CCGCCGATTT GTCGAGGAGG AGCGCAACTG GCCGGCGAGC GTGGCCCGCT ATCAGGCCAT CTACCGCCGC CTGACAGGGC TCGGGGAGGC CGCCCGTGCC GGCTGA
|
Protein sequence | MPMKVLHILD HSLPLHSGYT FRTAAILREQ HRLGWETVHL TSPKHGVAAG ADADREEWAE GLHFHRTPHT PLRVPGLGEW TLMDALTRRL HQVAAETRPD VLHAHSPALN AIPALRVGRR LGIPVVYEVR AFWEDAAVDH GTSRDQGLRY RLTRGLETRA LRRADHVTTI CEGLRQDIIT RGIAPGRVTV IPNAVDAERF QLGGTADPAL KAELGLEGCR VLGFIGSFYA YEGLDLLLQA FPRIHDQAPD VRILLVGGGS QAEALKAQAR DLGIADQVVF TGRVPHDQVN RYYDLVDLLV YPRHSMRLTE LVTPLKPLEA MAQGRLLVAS DVGGHRELIR DGETGWLFPA GDPKALADTV LHTLARAADW PQVRANGRRF VEEERNWPAS VARYQAIYRR LTGLGEAARA G
|
| |