Gene Emin_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1087 
Symbol 
ID6263213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1178573 
End bp1180411 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content36% 
IMG OID642611567 
ProductO-antigen polymerase 
Protein accessionYP_001875976 
Protein GI187251494 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAG TAAAACAAAT ATTAAATTTT GTTTTTTACG CGGGCGCTTT TGTTATCGCC 
CCTTTGTTTT TCTTTACGGA TTTAACCCAA AATCCGTTTC AAATACAAAC AAATGTTTTG
ATGTTTTCTT TAACGGGGAT TTTTATAATT AACGCTAAAG ACTTTCTTGT AAATAAGCAA
GATAAAGCGT TTTTCTTTTT TATAGCCGTA TTGTTTTTAA CGTGGTTTAT TTCCTTGCTT
TTGGCTAAAA ATTATTATGA GACAATAAAA TACTCTATTC TTTCAAACGG TTTTATTTTG
TTTGTATGGG CGGCATCCTA CGTGGCGGGC AAGAGTATAA AAGAAAACGG CTCGAACTTA
AAATTAAAAA CTGTTATTAA CACTCTTCTT ATAACCGGGT TTATAGCCGC GTTTTACGGG
TTAGCGCAAA AAGCAGGGGG GGAGGTAATA TGGCCGGGTA ATATCCGCGC CGGGGTTATA
AGCACATTCG GAAACCCAAA CTTTTTATCT TCTTTTTTAG TTGTACTGTT TTTCCCCGCT
TTGTATTTGT TTTTAGAAAA TAATAAAAAA GCTTTTTACG GCGTTGTTTT GCTTGTTTAC
GCTTTGTTTA TTATATTATG CGGGGCAAGG TCCTCTTTAC TCGCTTTAGC CGGCGGTATG
GTTTTATTTC TGGTTTACGC GCCTTTCAGA AGCTATATAA AACAAAATAA AAAACAACTT
GGTATTTTTG CGCTTATTTT GGTTGTTATT TTGACTGCTT TCCCGGCGCA AAATAAATTT
TCCAAAATAA ATGAAGTTAA AGATATTTTA AAAACCGAAA GGCCCATGGT TCAAAGCTAT
GACCAGCGTA TAATGCTTTG GAAGGGCGCT TTTAAAATTT TTACCTCAAA CCCCGCGGCC
GGGGCGGGGT GGGGCAACTT TCAGCTTTTT TACGCCGTAA AGCAGGGGGA ACTCCTTGCC
CAAAAGCCTG ATTTATATGT ATTTAAAGTG CAGGGTAATG CGGCGCACAA TTTTATTTTT
CAACTGCTTG CCGAAAGCGG CGTTTTAGGC CTTGCAACTT TTATATTTTT TGTTGTTATC
TTTGGTAAAA GAAGTGTTAG TTACTTTACT AAAAAAACTA AAAATAGAGA TATGGTTTTT
GCTTTGCTTG TGTCTTTGGC GGCGATGTTT GCTGATAATA TGCTTAACAT TACGCTTTTT
ATAACAATGC CCGCTTTTTT ATTTTTCTTT ATCTTAGGCA TTCTGTCTTC TGAAATGGAG
GAAGGCAAAC CGGCCCCGGT TATTTGCTGT ATATTTATAT TTATTTTCAC CGCCGCTTTG
TTTTTTGACA TAAAAATATT TATATCATCC GTAAAAGAAC ATAAGGCGGT AAGGGTTTTT
AATAAAAACA ATTACGTCTT GGCAAAGGAA TATTTTACTT CCGCCCATAA CGCTTACGGC
GGTAATTATA ACGCCCTTCT TTTACGGGGT AAAATAAACG CCGTGTTTAA AGAAAATAAA
GCCGCTTTTG AGGATTTTGC CGCTGCTTCC GTCATAAACT CAGCTTATGA CGAGCTTTTT
TATAACGCGG CTTTGATGGC CTATTCTTTA GAAAAGTACC AAGACTCTTA CCAAAATACC
ATTGCGGCCA TTGAGCTTAA CCCCGTAAAA AGCGATTATT ATGTACTTTT ATTAAACATC
TTGCAGCGTG ATAAAAAAAC CGTAAACGCG GATTCTAAAA AAATATTTTT AACGCTTGAA
AAAATATTGA AAAACACTTC TGAAGAAAGT GAAAATAAAG AAATTATAAA AGCCGTTCTT
GCCGAAATAA AAAATAAACA AATATTTGAC AAAGCATAA
 
Protein sequence
MNKVKQILNF VFYAGAFVIA PLFFFTDLTQ NPFQIQTNVL MFSLTGIFII NAKDFLVNKQ 
DKAFFFFIAV LFLTWFISLL LAKNYYETIK YSILSNGFIL FVWAASYVAG KSIKENGSNL
KLKTVINTLL ITGFIAAFYG LAQKAGGEVI WPGNIRAGVI STFGNPNFLS SFLVVLFFPA
LYLFLENNKK AFYGVVLLVY ALFIILCGAR SSLLALAGGM VLFLVYAPFR SYIKQNKKQL
GIFALILVVI LTAFPAQNKF SKINEVKDIL KTERPMVQSY DQRIMLWKGA FKIFTSNPAA
GAGWGNFQLF YAVKQGELLA QKPDLYVFKV QGNAAHNFIF QLLAESGVLG LATFIFFVVI
FGKRSVSYFT KKTKNRDMVF ALLVSLAAMF ADNMLNITLF ITMPAFLFFF ILGILSSEME
EGKPAPVICC IFIFIFTAAL FFDIKIFISS VKEHKAVRVF NKNNYVLAKE YFTSAHNAYG
GNYNALLLRG KINAVFKENK AAFEDFAAAS VINSAYDELF YNAALMAYSL EKYQDSYQNT
IAAIELNPVK SDYYVLLLNI LQRDKKTVNA DSKKIFLTLE KILKNTSEES ENKEIIKAVL
AEIKNKQIFD KA