Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2342 |
Symbol | |
ID | 4269098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2653230 |
End bp | 2654645 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638127100 |
Product | glycosyl transferase family protein |
Protein accession | YP_743172 |
Protein GI | 114321489 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGG AGCTTGGTTT GCAAACGCTA ACAGATCCTG CCGTGACGGC GATCCTGACC TCCCTTCTCA TTGGTTGGGT GCTGGTCGCG GCGGAACCTG CGTTATCCCG TCTGACGCGA GACCGGAACG ACCTGGATGC CGTGCAGGCC TCGCACACCG GCGAGGTGCT GCGGTTGGGC GGGGTCGCGA TCTTTGGCGG GGTGCTGGCC GGCGCCCTGG TCCTGAGCGG GACGACCGAT ATCAGCTTCA CCATGCTCCT CCTGCTGACA GCCTTGCCGG TGCTGATGGC GGGGCTGGCG GAGGACTTGG GCTATCCTGT CTCACCCCGG GGCCGCTTGA TGGCCGCCGC CATATCGGCA GCGGCCTGCG TCCTGATCCT GGGCCTTTGG GTGCCGAGGG CCGACTTACC AGGCATTGAC CTGCTGATGA CCTTCATGCC CCTGGCGATC GTTCTCACGG TGCTGGGCGC GGCCGGTTTT TGCCACGCGG TCAATCTGAT TGACGGTATG AATGGCTTGG CGGCCTTCAC CGCACTCGTG GCGGCTGCCG GGCTAAGCGC GATCGCTTAC CAGGCTGGCG AGCCCGAGAT AAGCCTCTTT GCCATGCTGC TGGGGGCGGC TTGCCTGGGG TTCCTGGCGT GGAATTGGCC GCTGGGCAGG CTGTTCCTTG GGGATGCGGG CTCCTACGGC ATTGGGCACC TGCTGGCTTG GCTGGCCATC GCCCTGGTTA TGCTGGCCCC AGCGGTTGCG TTTGCTGCGG TGGTTCTCGT CCTTTTCTGG CCACTTGCAG ACACCCTGCA CACCATCCTC CGCAGGTTCC TGGCGCGGCA GCGCATTGCT GAACCCGACA AGATGCACCT GCACCAGAAG ATCCGGCGGT GCCTGGAGGT GGTCTGGTTC GGCTCCAACC GCCGTGAACT GACCAATCCA CTGGCGACGC TGGTGATGGC GCCGATAATT GCGCTACCGG TGGCCACCGG CGTTATACTC TGGAATCAGG CAGTGGCCGC GTACGTGGCG CTTGCCTTCT TCGCATTGGC CTTTGGCGGG CTGCATCTGA TGATCATGCG GCTTGCAACG CTCTACCGTC GTGCCAGGTG GCCCTTCAGC GCCCTCAACC GGAGACAGGA TGCGGTGGCT TCGCCCGACT CACTTAACCC TCCGCTGATC GCCGTGCGCG TCGACTCGGA CTATTCCGGA ATGTTCATTC AGGATGGTCT GGCCGTCGAC GTGCGAATCT TCCGGTACGC CAAGGACACC CACTGGACCC TGGAGACCTA TGATGGTGTA AACCCACCTG TTCAGTGGAG CCAGCAATTC GATACGGAAC GGGCCGCCTG GGACGCGTTC ATGCGGGCGG TTCGCGAAGA TACGATGGAC ACCCTGGCGA GGGGCTACCA GGTCCGGCCT CGCTAA
|
Protein sequence | MSLELGLQTL TDPAVTAILT SLLIGWVLVA AEPALSRLTR DRNDLDAVQA SHTGEVLRLG GVAIFGGVLA GALVLSGTTD ISFTMLLLLT ALPVLMAGLA EDLGYPVSPR GRLMAAAISA AACVLILGLW VPRADLPGID LLMTFMPLAI VLTVLGAAGF CHAVNLIDGM NGLAAFTALV AAAGLSAIAY QAGEPEISLF AMLLGAACLG FLAWNWPLGR LFLGDAGSYG IGHLLAWLAI ALVMLAPAVA FAAVVLVLFW PLADTLHTIL RRFLARQRIA EPDKMHLHQK IRRCLEVVWF GSNRRELTNP LATLVMAPII ALPVATGVIL WNQAVAAYVA LAFFALAFGG LHLMIMRLAT LYRRARWPFS ALNRRQDAVA SPDSLNPPLI AVRVDSDYSG MFIQDGLAVD VRIFRYAKDT HWTLETYDGV NPPVQWSQQF DTERAAWDAF MRAVREDTMD TLARGYQVRP R
|
| |