Gene Mlg_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2800 
Symbol 
ID4269143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3182914 
End bp3184182 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID638127562 
ProductO-antigen polymerase 
Protein accessionYP_743630 
Protein GI114321947 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTC TGAGCCTGCC CGTCCATCGC CTCCAGGTTT TCTTTGACGG TCAGCGGTGG 
CGCCGGCTTG CGAAGGAGAC CCATTGGGTC TCCTGGGGGC TCGCTCTGTT CTGGGCAGGG
TTTATTTTTG CCTGGAGCAC CCGGGTGCAC CGGGACAGCC TGTATTTTCT CGTCTTTCTG
CCCTTCCTTA TGGTCCTGGG CCGGCAGCAG TTTCAGTGGC TGCTCAGCAG CCGGGTTATC
CAGTGTCTGG GGTTGTTCCT GGGCTATCTG TTGTTATCGG TGAGCTGGAG TCCGGATGCG
TCGTGGGCGC TCTTTGGTAG CAAGTTGCGC TACGGGCTCA TTATTTTCGC AGCGGTCCTG
GCCACGGCTT ACATGGTGGC CCGCGATGAG CAATGGGCGG AGCGCCTTTT CTGGTTTTTA
GGCCTGGCTG CCTGCATCGT CTTTTTCTAC TCTGTTTACC ATTATTATCA GGCGCATCCG
TTTCCCGCTG CGCGGCTGTC GAACCTCGTC TATTACCACA CCAATCCGAA CCCGGACGCG
GTTGGATTCC TGCTGGCGTT CACCTTCGCC CTGTGCTTTG TGTTGTCGGA CCGATCACCC
CGCTGGCGGG TTGTCGCCGC GGTGCCTCTG ATGTGTGCCG GGGCGTTCCT GCTGTTGGCC
CAGAGCCGCG GGCTGATTCT GGGGGCCGCG TTGGTGTCAG TCTTCCTGCT CTTGCGCCTC
CGTTACTGGA AAACCCTGGC GCTTTTCCTG GTGGTCGCCT CGGCCGCTCT GGTGGCGGTC
GAGACGGTGG AATGGGGAGG GCGCGGGTTG ATCGAACGCG CGGATGCCCA ACGTATCGGG
ATCTGGCAGG TCGCCCTGGC GCGCATCGCC GAGGCGCCCT GGTTCGGGGC GGGCCTGGCC
AGTGACACCT CCATCACCTA TGGTGACCGA ATCCATATCA GCCCGCACAA TCTGTGGTTG
ATGACCCTAA TGGCGGGTGG AGTACTGGGG GGTGCCCTGC TGGTGGCCCT GTATGGGTTG
GCGCTCCGGA TCGCGTACTG GGACCGCGAT GACCGCTCGC CTGGTGCCAT CCTGGCTGTT
GCCCTGCTGG TGGCGGGCCT GGTGCCGCTG GGCGTCGATG GCCACCAGAT CATTACCCGT
ATCCATCCGC ATATCTGGGT GGCCTTGTGG TTGCCCATGG GTGCGCTCGC CGGTCGGGAA
CTTCTGCAGC GGCAAAGGGG GCAGACGGAA GGCCCGGGTG CCAGGGTGAG TCGATCCCGC
TCTGGATGA
 
Protein sequence
MAVLSLPVHR LQVFFDGQRW RRLAKETHWV SWGLALFWAG FIFAWSTRVH RDSLYFLVFL 
PFLMVLGRQQ FQWLLSSRVI QCLGLFLGYL LLSVSWSPDA SWALFGSKLR YGLIIFAAVL
ATAYMVARDE QWAERLFWFL GLAACIVFFY SVYHYYQAHP FPAARLSNLV YYHTNPNPDA
VGFLLAFTFA LCFVLSDRSP RWRVVAAVPL MCAGAFLLLA QSRGLILGAA LVSVFLLLRL
RYWKTLALFL VVASAALVAV ETVEWGGRGL IERADAQRIG IWQVALARIA EAPWFGAGLA
SDTSITYGDR IHISPHNLWL MTLMAGGVLG GALLVALYGL ALRIAYWDRD DRSPGAILAV
ALLVAGLVPL GVDGHQIITR IHPHIWVALW LPMGALAGRE LLQRQRGQTE GPGARVSRSR
SG