Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0119 |
Symbol | |
ID | 4268205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 131646 |
End bp | 133046 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638124843 |
Product | O-antigen polymerase |
Protein accession | YP_740964 |
Protein GI | 114319281 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | [TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACA TCCTCCTTGC ACTTATATTC GCTGGCTTAG TGCCTGTAAT CCTGCGTCAT GCCTGGGTGG GGATACCCAC GCTGTTCTGG GTGAGCTTAT TTACGCCGCA GCTCCAGACC TGGACCTTTA TGCACGGTTT TCCGGCGGCG ATGCTGTTCA CAGTAGTCAC CCTGGCTGCC ATCGCGTTTT CTAAGGATCG AAAACCGTTT CCTTGGACGA GGGAAACGGT CATGTTGCTG ATTCTCGCAG CATACTTCGC AATGACCAGC CACTTCGCCG TGAACTCCAG TGGCGCGTGG GATTTCTGGG TGCACTTTAT GAAGATCCTG TTGGTGACCT TCCTCACGCC GATACTGATC CACGGTGAGC GACGTATCCT GATCACGCTC TTGGTGATCA CCGGGGCGCT GGCGTTCTTT GGCCTGAAGG GAGGGATCTT CGCGGTTAAC ACCGGAGGGG CCCACATGGT CCTTGGCCCA TCGGGCTCCT ACCTCTCCGG GAATACCTAT ATCGGGCTGG CGATGCTCAT GGTATTGCCG CTTATTCTAA CCAGCGCGCG ACTGTTCCAT CGACAATGGG TGGATTTTAG CATTCCACTT ATCAATCGGT TTGCTGTCCC CATCGGTTGG GCCGGGTACG CTGTGTTTTG GTTCACCTGT GCAGCCATAC TGGCGACCCA CTCCCGGGGG GCCTTCGTCG GAATGGTGGT GATCACGCCA TTTTTATTCT TGCACATGAG AAAAAAGTGG CTTTTGGTGC TGGTAGCCTT TATCGGAGTT GGCGTGATTG GAGTGACGGC GCCGGAGCCG TTGGTCGAGC GCTGGCAAAC GATAAAGACG TACGAAGAAG ACCAATCGGC TATGCAGCGA ATCCAGAGTT GGGGTGTGGC GTGGAATATG GCCATGGAGC GACCCCTGAC GGGTATGGGG TTCAGAAATA CCGCTTTGGG TTACGATTGG TGGATTACCT ATGCGGAGTT TGAAGGCGGC TGGCGGCATG TTCTGTCACC GCACAGCATT TATTTCGGGT TGCTGGGACA GCACGGATTT GGCGGGCTGG CCGTGTATCT TTTTCTCGGA GCGTTTACAT TTTTGACGTT GAATCGGGTG CGGCGGACCG CGAAGCGAAG AACTGGGAAG ATCTGGTTGT CGGAGTATGC ATGGGCCTTG CAGGTTGGCT TGGCCGGATA TTTCATTGCA GGGATATTTT TGGACGTGGC CTACTTTAAC CTCTATTATG CGTTCATAGC CATGTCAGTG ATTATGCGGC GTGAGCTCGA GGAGGCGCCC AAACCGGCGG AAGCGACCGT GCCGACGCCA GCTCAGCCCT TGCCCCAGGA TACTCGACCA AGCCTCTCCA ATCCTTCCCT GGGGGCCAGA TCGGGGGTGG AGCGCAGGTG A
|
Protein sequence | MRDILLALIF AGLVPVILRH AWVGIPTLFW VSLFTPQLQT WTFMHGFPAA MLFTVVTLAA IAFSKDRKPF PWTRETVMLL ILAAYFAMTS HFAVNSSGAW DFWVHFMKIL LVTFLTPILI HGERRILITL LVITGALAFF GLKGGIFAVN TGGAHMVLGP SGSYLSGNTY IGLAMLMVLP LILTSARLFH RQWVDFSIPL INRFAVPIGW AGYAVFWFTC AAILATHSRG AFVGMVVITP FLFLHMRKKW LLVLVAFIGV GVIGVTAPEP LVERWQTIKT YEEDQSAMQR IQSWGVAWNM AMERPLTGMG FRNTALGYDW WITYAEFEGG WRHVLSPHSI YFGLLGQHGF GGLAVYLFLG AFTFLTLNRV RRTAKRRTGK IWLSEYAWAL QVGLAGYFIA GIFLDVAYFN LYYAFIAMSV IMRRELEEAP KPAEATVPTP AQPLPQDTRP SLSNPSLGAR SGVERR
|
| |