Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2368 |
Symbol | |
ID | 4270707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2687042 |
End bp | 2688079 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638127126 |
Product | cytidine 5'monophosphate N-acetylneuraminic acid synthetase |
Protein accession | YP_743198 |
Protein GI | 114321515 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.275161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00639534 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGAGC CCTGGCCGAA CAGCGGGGGT TGGCTGATCC GGGCGGATGC CGCCCCGGCC ATCGGCATCG GCCATGTCAA CCGCGCCCTG GCCCTGGCCC GGGCGCTGGC CCCCCGGCCG GTTCTGATCG CCACCCGCCG CGATGGGGAC TACCGCCTGG GCTACCAGTG GCTGGCGGAG TCAGGCCGGC CCCTGTGCCC ACTGGACGGC GAGGCGGACT TCCTCGACCT GCTGCGACGT TGCCGGCCCG AGACCCTGTG CCTGGATATC CTCGACACCG GGCGCCGGGA GATGGGGGTC TACCGTACCC TGGCACGGCG GGTGGTGAGT TTCGAGGACC TGGGGCCCGG GGCGGCATTG GCCGATGTGG TGATCAACGA CCTCTACGGC CCCGCGCCGG GGCAGGCCCA CGTGCTCGCC GGGGTGGAAC ACGCCCTGCT GTCACCGGCC TTTGACGACG CCTCGCCCGC CCCCGGGGCC ACCCCGGAAC GGGCGGAACG GTTGCTGCTG CTGTTCGGGG GCACCGACCC CGCCGGTCTG GTCCACCGCT GCCTGGACGC CCTGGGCCGG CTGGCGCTTC CGGTTCGGGT GGAGGTGGTG GTGGGGCCGG GCTGGCGCCG GCGGCGGATC CGGTTGGCGG ACTGGGGCCT GTGTGGCCGT GTCCACCGGG ACGTGCAGGA CATGCCGGCG GTGATGCGAA ACGCCGACCT GGCGCTCTCC AGCGCCGGGC GCACGGTCAC CGAGCTGATG GTGATGCGGG TGCCCACCCT GGTGCTCTGC CAGAATGAGC GCGAGTTGCG CCATACCCAC GCCAGCGCCC GCCACGGGGT CTGCAACCTG GGCCTGGGCC GGGCGGTGCC GGTGGACCGG CTGGCGCGGG AGATCGCGGC GCTGGTCGCG GACCGGGCGC GGCGCGAGCA GATGCGGGCC CTGGCGGACC GCGCCGTTCG CGGGCGCAGC AACCGTGCCA TTGTGGCGCG AATCGACGGC CTGCTGCAGG GCCGGTCACG ACGGCTGGAC ACGGGAGTGA CGCCATGA
|
Protein sequence | MAEPWPNSGG WLIRADAAPA IGIGHVNRAL ALARALAPRP VLIATRRDGD YRLGYQWLAE SGRPLCPLDG EADFLDLLRR CRPETLCLDI LDTGRREMGV YRTLARRVVS FEDLGPGAAL ADVVINDLYG PAPGQAHVLA GVEHALLSPA FDDASPAPGA TPERAERLLL LFGGTDPAGL VHRCLDALGR LALPVRVEVV VGPGWRRRRI RLADWGLCGR VHRDVQDMPA VMRNADLALS SAGRTVTELM VMRVPTLVLC QNERELRHTH ASARHGVCNL GLGRAVPVDR LAREIAALVA DRARREQMRA LADRAVRGRS NRAIVARIDG LLQGRSRRLD TGVTP
|
| |