Gene Mlg_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2368 
Symbol 
ID4270707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2687042 
End bp2688079 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content74% 
IMG OID638127126 
Productcytidine 5'monophosphate N-acetylneuraminic acid synthetase 
Protein accessionYP_743198 
Protein GI114321515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.275161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00639534 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGAGC CCTGGCCGAA CAGCGGGGGT TGGCTGATCC GGGCGGATGC CGCCCCGGCC 
ATCGGCATCG GCCATGTCAA CCGCGCCCTG GCCCTGGCCC GGGCGCTGGC CCCCCGGCCG
GTTCTGATCG CCACCCGCCG CGATGGGGAC TACCGCCTGG GCTACCAGTG GCTGGCGGAG
TCAGGCCGGC CCCTGTGCCC ACTGGACGGC GAGGCGGACT TCCTCGACCT GCTGCGACGT
TGCCGGCCCG AGACCCTGTG CCTGGATATC CTCGACACCG GGCGCCGGGA GATGGGGGTC
TACCGTACCC TGGCACGGCG GGTGGTGAGT TTCGAGGACC TGGGGCCCGG GGCGGCATTG
GCCGATGTGG TGATCAACGA CCTCTACGGC CCCGCGCCGG GGCAGGCCCA CGTGCTCGCC
GGGGTGGAAC ACGCCCTGCT GTCACCGGCC TTTGACGACG CCTCGCCCGC CCCCGGGGCC
ACCCCGGAAC GGGCGGAACG GTTGCTGCTG CTGTTCGGGG GCACCGACCC CGCCGGTCTG
GTCCACCGCT GCCTGGACGC CCTGGGCCGG CTGGCGCTTC CGGTTCGGGT GGAGGTGGTG
GTGGGGCCGG GCTGGCGCCG GCGGCGGATC CGGTTGGCGG ACTGGGGCCT GTGTGGCCGT
GTCCACCGGG ACGTGCAGGA CATGCCGGCG GTGATGCGAA ACGCCGACCT GGCGCTCTCC
AGCGCCGGGC GCACGGTCAC CGAGCTGATG GTGATGCGGG TGCCCACCCT GGTGCTCTGC
CAGAATGAGC GCGAGTTGCG CCATACCCAC GCCAGCGCCC GCCACGGGGT CTGCAACCTG
GGCCTGGGCC GGGCGGTGCC GGTGGACCGG CTGGCGCGGG AGATCGCGGC GCTGGTCGCG
GACCGGGCGC GGCGCGAGCA GATGCGGGCC CTGGCGGACC GCGCCGTTCG CGGGCGCAGC
AACCGTGCCA TTGTGGCGCG AATCGACGGC CTGCTGCAGG GCCGGTCACG ACGGCTGGAC
ACGGGAGTGA CGCCATGA
 
Protein sequence
MAEPWPNSGG WLIRADAAPA IGIGHVNRAL ALARALAPRP VLIATRRDGD YRLGYQWLAE 
SGRPLCPLDG EADFLDLLRR CRPETLCLDI LDTGRREMGV YRTLARRVVS FEDLGPGAAL
ADVVINDLYG PAPGQAHVLA GVEHALLSPA FDDASPAPGA TPERAERLLL LFGGTDPAGL
VHRCLDALGR LALPVRVEVV VGPGWRRRRI RLADWGLCGR VHRDVQDMPA VMRNADLALS
SAGRTVTELM VMRVPTLVLC QNERELRHTH ASARHGVCNL GLGRAVPVDR LAREIAALVA
DRARREQMRA LADRAVRGRS NRAIVARIDG LLQGRSRRLD TGVTP