Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0163 |
Symbol | |
ID | 4270724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 192774 |
End bp | 193769 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638124887 |
Product | UDP-galactose 4-epimerase |
Protein accession | YP_741008 |
Protein GI | 114319325 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.507846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTAC TGGTAACCGG CGGCGCCGGA TACATCGGCA GCCATGTGGT GCGCCAACTG CTGGCAGCGG GCCACGAGCC GGTGGTGTAC GACAATCTGA GCACCGGCTT TGCCTGGGCG GTGGGCGAGG CGCCGCTGGT GCGGGCCGAC CTGGCGGATA CGGCGCAGTT GGCGGAGACC CTGGCCCGCC ACGCCTTCGA TGCCGTGCTC CATTTTGCCG CCCATACCGT GGTGCCGGAG TCGCTGAGCG ATCCGCTTCG CTATTACGGC AACAACACCC GGAACACCTG GCAACTGCTC CAGGCCTGCC ACGAGGCTGG GGTGAAGCGG TTCGTCTTCT CCTCCACCGC GGCGGTCTAC GGCATGCCCG AGACCATGCC GGTGGCCGAG GACGCCCCGT TGGCGCCCAT CAACCCCTAT GGCGCCTCGA AGATGATGTC TGAGCGCATG CTCATGGACC TGGGGGCGGC CAGCGACCTG CGCTATGTGT GCCTGCGCTA TTTCAACGTG GCCGGGGCCG AGCCGTCGGG GCGACTGGGG CAGGCGACGC CCCAGGCCAC CCACCTGATC AAAGTGGCCT GCGAGGCCGC TGTCGGGCGG CGCGACGGGG TCACCGTCTT CGGCACCGAC TACGCCACGC CGGACGGTAC CTGTGTGCGG GACTTCATCC ACGTGGAGGA CCTGGCCCGG GCCCATATCC AGGCCCTGTC CCACCTAGTG GATGGCGGCG ACTCGCAAGT GTTGAACTGC GGCTATGGTC GTGGCTACAG CGTGCTGGAG GTGCTGGAGG CCGTGAAGCG GCTGAGTGGC GCGGACTTTC CCGTCACCCT GGGGACGCGG CGTGCCGGCG ATCCGGCGCA GGTGGTGGCG GACAACCGGC GCATCCTGCG CACTCTGGAC TGGAGCCCGC GCTACGCTGA CCTGGACACC ATCGTCGCCC ACGCCCTGGC CTGGGAGCGC GACGGGTTGC TGGCACGCCG CGGGCCCCGA CGGTAA
|
Protein sequence | MRVLVTGGAG YIGSHVVRQL LAAGHEPVVY DNLSTGFAWA VGEAPLVRAD LADTAQLAET LARHAFDAVL HFAAHTVVPE SLSDPLRYYG NNTRNTWQLL QACHEAGVKR FVFSSTAAVY GMPETMPVAE DAPLAPINPY GASKMMSERM LMDLGAASDL RYVCLRYFNV AGAEPSGRLG QATPQATHLI KVACEAAVGR RDGVTVFGTD YATPDGTCVR DFIHVEDLAR AHIQALSHLV DGGDSQVLNC GYGRGYSVLE VLEAVKRLSG ADFPVTLGTR RAGDPAQVVA DNRRILRTLD WSPRYADLDT IVAHALAWER DGLLARRGPR R
|
| |