Gene Emin_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0892 
Symbol 
ID6262684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp983966 
End bp984994 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content40% 
IMG OID642611373 
Productglycosyl transferase family protein 
Protein accessionYP_001875784 
Protein GI187251302 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000759355 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAATT TATCGACTTA TGTGTTTTGT ACTTTATGCG CGCTTTTAAT TTCCTCAATA 
CTGGCGCCCG CGCTTGTGTT TATAACAAAC GGCAGATTTA CCGATAAACC CGGCGGCATT
AAAAATCACA TCGGAAACAT ACCGCTTGTA GGCGGTACTG CTATAGTGGC GGGTTTTTTT
ATAAGTCTTT TAATAATCCG TTTTACTACT GACTTCCCGA GCGGCACTTT GCACAGCTTA
AGAGGCATTT TTCTTGGCGG TTTTATAATT TACTTTACTG GGATAGTTGA CGATTTAAAA
AAGCCCAAAG GCATAAACCC CGGCCTTAAG CTGCTTGGAC AGGCGGCAGC GGCTTACATA
TTAATACATT ACGGCATTAA AATTAATTTT ATCGAAAATG TTTGGATTGC AAACACTCTT
ACTCTTATAT GGATAATAGG TCTTACAAAT GCTTTTAACT TATTAGATAT TATGGACGGC
CTTTCCGTAA GCCAGGCGGC GTGCGCTTCT TTATTTTTCA TTATAATCGC GCTGCCTTCC
GAACATATTT ACGTTAATTT TACGGCGGCG GCGCTTTTAG GTGCGGCTTT AGGTTTTTGG
CCGTATAACC ACAGCAAAAG GCAAAAAATA TTTATGGGCG ACGGCGGAAG CATGTTTTTG
GGTTTTGTTT TAGCGGCGGT ATCTTTGGGA ACGGAATATT CCGCAAAAAA CCCTATGGCC
GTTTTAGCGC CTATTCTTAT TTTAGCCGTG CCTTTGTGGG ACACGGGCTT TGTGTTTTTA
GTAAGGACAA TACAGGGTAA AAACCCTTTT TTAGGTTCAC CCGACCACGC TGTTATTCTT
TTAAGAAATA AAGGCTTAAC GCCAAATACC ATACTGGCTT TATTTTTAAC AGCATCAATA
GGTTACGGTG CGCTGGCTTT GATAGTAATA AACGTTTCCG ATTTTTGGAC ATACATTATC
TTCGCCTTTT CAATGGTTGA TATGACCGCC GCCGCGCATA TGATTTATAA ATTTAAAGGA
TTAAAATGA
 
Protein sequence
MQNLSTYVFC TLCALLISSI LAPALVFITN GRFTDKPGGI KNHIGNIPLV GGTAIVAGFF 
ISLLIIRFTT DFPSGTLHSL RGIFLGGFII YFTGIVDDLK KPKGINPGLK LLGQAAAAYI
LIHYGIKINF IENVWIANTL TLIWIIGLTN AFNLLDIMDG LSVSQAACAS LFFIIIALPS
EHIYVNFTAA ALLGAALGFW PYNHSKRQKI FMGDGGSMFL GFVLAAVSLG TEYSAKNPMA
VLAPILILAV PLWDTGFVFL VRTIQGKNPF LGSPDHAVIL LRNKGLTPNT ILALFLTASI
GYGALALIVI NVSDFWTYII FAFSMVDMTA AAHMIYKFKG LK