Gene Mlg_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0119 
Symbol 
ID4268205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp131646 
End bp133046 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID638124843 
ProductO-antigen polymerase 
Protein accessionYP_740964 
Protein GI114319281 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACA TCCTCCTTGC ACTTATATTC GCTGGCTTAG TGCCTGTAAT CCTGCGTCAT 
GCCTGGGTGG GGATACCCAC GCTGTTCTGG GTGAGCTTAT TTACGCCGCA GCTCCAGACC
TGGACCTTTA TGCACGGTTT TCCGGCGGCG ATGCTGTTCA CAGTAGTCAC CCTGGCTGCC
ATCGCGTTTT CTAAGGATCG AAAACCGTTT CCTTGGACGA GGGAAACGGT CATGTTGCTG
ATTCTCGCAG CATACTTCGC AATGACCAGC CACTTCGCCG TGAACTCCAG TGGCGCGTGG
GATTTCTGGG TGCACTTTAT GAAGATCCTG TTGGTGACCT TCCTCACGCC GATACTGATC
CACGGTGAGC GACGTATCCT GATCACGCTC TTGGTGATCA CCGGGGCGCT GGCGTTCTTT
GGCCTGAAGG GAGGGATCTT CGCGGTTAAC ACCGGAGGGG CCCACATGGT CCTTGGCCCA
TCGGGCTCCT ACCTCTCCGG GAATACCTAT ATCGGGCTGG CGATGCTCAT GGTATTGCCG
CTTATTCTAA CCAGCGCGCG ACTGTTCCAT CGACAATGGG TGGATTTTAG CATTCCACTT
ATCAATCGGT TTGCTGTCCC CATCGGTTGG GCCGGGTACG CTGTGTTTTG GTTCACCTGT
GCAGCCATAC TGGCGACCCA CTCCCGGGGG GCCTTCGTCG GAATGGTGGT GATCACGCCA
TTTTTATTCT TGCACATGAG AAAAAAGTGG CTTTTGGTGC TGGTAGCCTT TATCGGAGTT
GGCGTGATTG GAGTGACGGC GCCGGAGCCG TTGGTCGAGC GCTGGCAAAC GATAAAGACG
TACGAAGAAG ACCAATCGGC TATGCAGCGA ATCCAGAGTT GGGGTGTGGC GTGGAATATG
GCCATGGAGC GACCCCTGAC GGGTATGGGG TTCAGAAATA CCGCTTTGGG TTACGATTGG
TGGATTACCT ATGCGGAGTT TGAAGGCGGC TGGCGGCATG TTCTGTCACC GCACAGCATT
TATTTCGGGT TGCTGGGACA GCACGGATTT GGCGGGCTGG CCGTGTATCT TTTTCTCGGA
GCGTTTACAT TTTTGACGTT GAATCGGGTG CGGCGGACCG CGAAGCGAAG AACTGGGAAG
ATCTGGTTGT CGGAGTATGC ATGGGCCTTG CAGGTTGGCT TGGCCGGATA TTTCATTGCA
GGGATATTTT TGGACGTGGC CTACTTTAAC CTCTATTATG CGTTCATAGC CATGTCAGTG
ATTATGCGGC GTGAGCTCGA GGAGGCGCCC AAACCGGCGG AAGCGACCGT GCCGACGCCA
GCTCAGCCCT TGCCCCAGGA TACTCGACCA AGCCTCTCCA ATCCTTCCCT GGGGGCCAGA
TCGGGGGTGG AGCGCAGGTG A
 
Protein sequence
MRDILLALIF AGLVPVILRH AWVGIPTLFW VSLFTPQLQT WTFMHGFPAA MLFTVVTLAA 
IAFSKDRKPF PWTRETVMLL ILAAYFAMTS HFAVNSSGAW DFWVHFMKIL LVTFLTPILI
HGERRILITL LVITGALAFF GLKGGIFAVN TGGAHMVLGP SGSYLSGNTY IGLAMLMVLP
LILTSARLFH RQWVDFSIPL INRFAVPIGW AGYAVFWFTC AAILATHSRG AFVGMVVITP
FLFLHMRKKW LLVLVAFIGV GVIGVTAPEP LVERWQTIKT YEEDQSAMQR IQSWGVAWNM
AMERPLTGMG FRNTALGYDW WITYAEFEGG WRHVLSPHSI YFGLLGQHGF GGLAVYLFLG
AFTFLTLNRV RRTAKRRTGK IWLSEYAWAL QVGLAGYFIA GIFLDVAYFN LYYAFIAMSV
IMRRELEEAP KPAEATVPTP AQPLPQDTRP SLSNPSLGAR SGVERR