Gene Mlg_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2369 
Symbol 
ID4270708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2688076 
End bp2689122 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID638127127 
ProductN-acetylneuraminate synthase 
Protein accessionYP_743199 
Protein GI114321516 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0796873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0101527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC TGCATATCGA CGGCCGGGCC ATCGGCCCGA ACCACCCTCC TTACGTCATC 
GCCGAGGCAG GCGTTCATCA CTACGACAGC CTGGAGCTGG CCCGGGCCTA TGTGCTGGAG
GCCCGCAAGG CGGGCGCCGA CGCGATCAAG TTCCAGACCT ATACGGCGGA CGAGCTTGTG
ACCCGGTGGG CCCCGCTCTA CTGGACCGAC GCACGATTCG AGACCCAGCA CCAGGTCTTC
AGCACCAAGC GCGGCTTGAG CCCGACGGAG TACCGGTCGC TGTTCGACTA CGCCCGCGAG
CTGGGCATCA CGCCATTGTC CACGCCCTTC GACCCCGCCT CGGTGGACCT GCTGGACGGC
CTGGGGATGG CCGCCTTCAA GGTGGCCTCG GCGGACATCA CCCACCGGCC GCTGCTCCAG
GATATCGCCG GCAAGGGCAA GCCGGTGCTG TTGTCCACCG GTGCGGCGTC GATGGCGGAG
GTGCGTTCGG CGCTGGAGGT GCTGGAGTCC GAAGGGGTTC CGGTGGCCCT GCTGCACTGC
TCGCTGGCCT ACCCCACCCC GGTGGACCAG GCGAACCTGT CGCGGCTGGG GCTGCTCGCC
GAACAGTTCC CCGGGCGGGT GCTGGGCTAT TCCGACCACA CGCCGCCCCG GGATTCGGCA
CTGCCCTGCC CGGCCAGCGT CCTGCTGGGC GCCCGGGTGA TCGAAAAGCA CTTCTCGCTG
AACCGGCACC TGGCCGGGGA CGATCACTAC CACAGCGTGG ACCCGGACGG TCTGGCGCGG
CTGGTGCGCG ACTGCCGGGA TGCCTGGGCG ATGAGCCGGC CAGCGGCAGA GATCACCGCC
GCCGAGGAGA GCGCCCGCAC CCAGGCCCGC CGCAGCGTGG TGGCCGCCAC CGATCTGCCC
GCGGGCACCA CGCTCGCAGC CGGGCACCTG GCCTACAAGC GGCCCGGCAC GGGGGTGCCC
CCGACGCAGG CGGAGGACCT GATCGGGCGA CGGCTGGCCG TGGACCTGGC CCACGATGAG
CTGATCACGC CGGACAAGCT GGCCTGA
 
Protein sequence
MTELHIDGRA IGPNHPPYVI AEAGVHHYDS LELARAYVLE ARKAGADAIK FQTYTADELV 
TRWAPLYWTD ARFETQHQVF STKRGLSPTE YRSLFDYARE LGITPLSTPF DPASVDLLDG
LGMAAFKVAS ADITHRPLLQ DIAGKGKPVL LSTGAASMAE VRSALEVLES EGVPVALLHC
SLAYPTPVDQ ANLSRLGLLA EQFPGRVLGY SDHTPPRDSA LPCPASVLLG ARVIEKHFSL
NRHLAGDDHY HSVDPDGLAR LVRDCRDAWA MSRPAAEITA AEESARTQAR RSVVAATDLP
AGTTLAAGHL AYKRPGTGVP PTQAEDLIGR RLAVDLAHDE LITPDKLA