Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0139 |
Symbol | |
ID | 4269832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 160425 |
End bp | 161633 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638124863 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_740984 |
Protein GI | 114319301 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03088] sugar transferase, PEP-CTERM/EpsH1 system associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.211857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG CCCGCCCCCT GGCAACGGCG GCCCCGGGCG ACGAGCGCCC GCTGGTGGCC CATATCATCC ACCGCCTGGA CGTGGGCGGC ATGGAGAACG GCCTGGTCAA CCTGATCAAC CACATGCCGG CCGAGCGCTA CCGCCACGCC ATCGTCTGCA TGACCCGGTA CACCGACTTC AGCCAGCGCA TCCACCGCGA TGATGTGAGC CTGCACGCCC TGCACAAGCG CGAGGGCAAG GACCTGGGGG TGCATCGGCG CCTGCACCGG CTGCTGCGGT CGTTGCGCCC GGCCATCGTC CACACCCGCA ACCTCGCCAC CCTGGAGGCC CAGGCCACCG CCGCGGCGGC CGGCGTGCGG GCACGCATCC ACGGTGAGCA CGGCTGGGAT ATCGGCGATC TCGACGGCGC CCGCACCAAA CACCGCCTGA TGCGCCGCCT GGCCCGACCG TTGGTGGGGC GCTATATCGC CCTGTCGCGC CAGCAGCTGG ACTACCTGGC CGGTGCCATC GGCGTGCCGG AGGGGCGGTT GCACCACGTC TGCAACGGTG TGGACACCCA CCGCTTCAGG CCCCGCCGTC GGGACGAGGC CTCGCCACTG CCGGACGGCT TCGCGCCGGA GGGCAGCCTG GTGGTGGGCA GCGTGATGCG CATGCAGGCG GTCAAGGCCC CGGAGGATCT CGTTGATGCC TTCATCGCGC TGCGCGAACG GGCACCCGCC CGCTTCCCCC GCCTGCGGCT GGTGCTGGTG GGCGACGGCC CCCTGAGCGA GCGCGTCGCC CGGCGGCTGG CGGAGGCCGG GGTGGCGGAT CAGGCCTGGC TGCCCGGCGC CCGGGACGAT GTGGCGGCGG TGATGCGCGC CCTGGACCTG TTCGTGTTGC CGTCACTCGC CGAGGGCATC TGCAACACCG TCCTGGAGGC CATGGCCTGC GGGCTGCCAG TGGTCGCCAC CGAGGTGGGC GGCAACCCGG ACCTGGTGCG GCCCGGCGAG ACCGGCACGC TGGTCCCGGC AGGCGATCCG TCAACCCTCG CCCGGCACCT CCAGGCCTAC CTGGACGACC CGGAACGGCG GCAGCGCGAG GGCGAGGCCG CGCGGGCCCG GGCGGAGGCG GTATTCAGCA TGGAGGCCAT GGTGGAGGGC TACATGAGGG TCTACGATCA GGCGCTGGCC GAACACCCCT TGCCGGCCGT GCCGGGGAGG CGGGGCTAG
|
Protein sequence | MSAARPLATA APGDERPLVA HIIHRLDVGG MENGLVNLIN HMPAERYRHA IVCMTRYTDF SQRIHRDDVS LHALHKREGK DLGVHRRLHR LLRSLRPAIV HTRNLATLEA QATAAAAGVR ARIHGEHGWD IGDLDGARTK HRLMRRLARP LVGRYIALSR QQLDYLAGAI GVPEGRLHHV CNGVDTHRFR PRRRDEASPL PDGFAPEGSL VVGSVMRMQA VKAPEDLVDA FIALRERAPA RFPRLRLVLV GDGPLSERVA RRLAEAGVAD QAWLPGARDD VAAVMRALDL FVLPSLAEGI CNTVLEAMAC GLPVVATEVG GNPDLVRPGE TGTLVPAGDP STLARHLQAY LDDPERRQRE GEAARARAEA VFSMEAMVEG YMRVYDQALA EHPLPAVPGR RG
|
| |