Gene Emin_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1088 
Symbol 
ID6263588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1180404 
End bp1182617 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content37% 
IMG OID642611568 
ProductO-antigen polymerase 
Protein accessionYP_001875977 
Protein GI187251495 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAAATAAAAC AACAGTAGTG CCGCAAAACA CAGGCGCCGC TTTTTTGCAA 
AAGGCGTGCG GGCATGTTAT CGGCTGGGCT GTTTTTTTTA TTTCGCTAAG CGTTTATGTA
AGAACTTATG ACATTGCCGC TGTTAAAATA TCTTTATTTT TTTGCGCTTT GGCCGTTATG
TTTAGCGTAT GGGCAAGTTT TAAATCTTTA TCTGAAAAAC CTTTTAACAA GCAGCTTTTT
TTAAGTTTTT TACCTTTTAT TTTCTTTGCT TTATGGCTGT TTATCTCTTT TATTATTAAT
CCCTATAAAA TAAGTTCTTT AGAAGATTTT TTGAAGCAAT TTCTTTATAT TTTTATTCCT
TTTTTTATAG CCACATCATT TAGTTTAAAA GAAGTTGGCA CGGTAGTAAA ATTTTTAACG
GCTTCCGCGG TAATTTGCTT TGCTTACGGA TTTTTACAGA TAACAGGCCT AGATATTATG
CCTTGGGCGG ATTTTTTTGG CAAAAGAATA TTTTCCGCCT TAGGCAACCC CAACATGTTT
GCCGATTTTG TTATTTTTAT GAATTTTATA GTCTTGGCGC TTTATCTTAG AAAGTTTGAA
AAAAAATATT TACTTTTGTT TGCTGCGGGG CTTGTTAACT TATATTTTAC GGAGTCCAAG
GGCGCGTGGC TCAGTTTTGG CGTGACTTTG GTTTTTTTCG TGTTTTTATA TGTAAAATAT
TTCCCGTCAA ATTTTATTAA AAAGCATTCA AAGTTTATAT GGGTTGTTGG GGTTGTGTTG
GCGCTGGCGT CGGTAATTTT GGTTGTTGTG TTCGCTTCTA AAAGAATGCA GTCCGTGGAT
TTTAGGGCAA TTACATGGCG CGGCATAGTT CAAATGTCTG CGGATAAAGC TTTTACGGGC
TATGGTACCG GCAGCTTTGC GACAGTGTAT CCGACGTACC GCCGGGCCGA AATATTTTAT
ATTGAGAAAA TACATAATAA TGAAACCCAG CACGCCGAAA ATGAATTTTT GGAAGTTCTT
ACAGACAACG GAATAATAGG GCTTACTCTT TTTTTATGGC TTTTATATTT TGTTTTTTAT
TTAGCTTTTA AAAAATTTAA GGAATTGCGC CTTGACCCTA AATCCCAAGG GCCGCCGGCG
TATTATTTGC TTGCTTTTGT TTCGGCCGCC GCGGCGATTT TAATACACAG TATTTTTGAC
GTAAGCATGC GTTTTGTATC CACGGGGTTT TTGTTTTGGG TTTTTATAGG CCTGATACTT
GTGTTAAGCG GCTATGAAAC GGTTGTCCGC CCTAAAGAGT TGGCGGGTAA GGGTAAATGG
CTGACTGTGG CAGGGGCGGC GGTTTGGCTT ATTTCAGCAG TTGCTTTGTG CTTTTTGGTA
TATATCTTTT CTGAAGTAAT AGGCAGGGAA AGCATGCTCA ATACAGGGCG CTTATTACTT
AAAATACTTG CCTGGGCGGG TGTGACTACT TTATGTGGGG CGCTTTTATA TTTATATTAC
AGGATTTTAA AAATCAGGCA AAGTGTTTTA CCTTGCCTTT TTATACTTAT AACACTGCCT
TTCTTTTTTG CGGCCGAACG TTTCGTAAGG GCTGATTATT ACACTAATTT GGGCAATTAT
TACGCCGGGC TAAATAATTG GAGCACGTCT ATTAAATACT ATATTAAATC CTTTAACACC
AACCCTTTTA ATCCTTCCGT TCGACAATTT ATAGCTAACA TAGCCATTAA CCGTTGGAAT
AAATATAAAA CGCAAGAGCC GGGTCTCGAA GATAAAACAG CCCTTTACCA GGATGATTTT
GAGCGCGCTC TTTATATGTT TAACCTTGTT TATAAAGCGG CTCCCAACCA TGCGCAGCTT
CACCACCAAT GGGGGGAATT ATATTATAAA AAAGGCTATG ATTTATATGA TGATTACTTT
ATAAACAAAA AAGATAAAAG TGTATATGAG GAAGCGTCAA AATATTTTGA CCTGGCTAAA
GAAAAATTTG AATATTCCCT TTTAATAGAC CCTGTTAACC CCGACACTTA TTTTTATATG
TCTAATATAG CTCTTATGCA GGGCAATCCG CAGGGCGCTT TGGAATGGGT TGATAAATAT
ACCCGCGGCC CGTCAATGGT AAAGAATGAT GAATATTTAA AAATAAATAA AAATAACGCC
AAAGCGGCCC AAATGAGGCA AAATATTTTA AGGCAGGCGG GCCGTTATGA ATAA
 
Protein sequence
MKKKNKTTVV PQNTGAAFLQ KACGHVIGWA VFFISLSVYV RTYDIAAVKI SLFFCALAVM 
FSVWASFKSL SEKPFNKQLF LSFLPFIFFA LWLFISFIIN PYKISSLEDF LKQFLYIFIP
FFIATSFSLK EVGTVVKFLT ASAVICFAYG FLQITGLDIM PWADFFGKRI FSALGNPNMF
ADFVIFMNFI VLALYLRKFE KKYLLLFAAG LVNLYFTESK GAWLSFGVTL VFFVFLYVKY
FPSNFIKKHS KFIWVVGVVL ALASVILVVV FASKRMQSVD FRAITWRGIV QMSADKAFTG
YGTGSFATVY PTYRRAEIFY IEKIHNNETQ HAENEFLEVL TDNGIIGLTL FLWLLYFVFY
LAFKKFKELR LDPKSQGPPA YYLLAFVSAA AAILIHSIFD VSMRFVSTGF LFWVFIGLIL
VLSGYETVVR PKELAGKGKW LTVAGAAVWL ISAVALCFLV YIFSEVIGRE SMLNTGRLLL
KILAWAGVTT LCGALLYLYY RILKIRQSVL PCLFILITLP FFFAAERFVR ADYYTNLGNY
YAGLNNWSTS IKYYIKSFNT NPFNPSVRQF IANIAINRWN KYKTQEPGLE DKTALYQDDF
ERALYMFNLV YKAAPNHAQL HHQWGELYYK KGYDLYDDYF INKKDKSVYE EASKYFDLAK
EKFEYSLLID PVNPDTYFYM SNIALMQGNP QGALEWVDKY TRGPSMVKND EYLKINKNNA
KAAQMRQNIL RQAGRYE