Gene Emin_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0073 
Symbol 
ID6263984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp76689 
End bp79007 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content41% 
IMG OID642610535 
Productouter membrane protein assembly complex, YaeT protein 
Protein accessionYP_001874977 
Protein GI187250495 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00110643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.06207e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TACTATTTTT ATTATTAATT TTAACATCCG CAGGCCTTTT CGCGCAGGAA 
ACGGAAATGA ACGCAACCGG CCCGTGGATG GTGTGTGAAG TAGCCGTATC CGGGTTAAAA
AACGTAGCTA AAAAAACAGT AACCAAAGCG GTGCACGCCA AAAAAGGCGT TATGTATGAA
CGAGGTTTTG TTTACGACGA TATGCAGGCA ATCATTTCTT TGGGAAGTTT TGATAATGCG
GAAGTAGACA TAAGCCCCAT GGAAGGTGAG CGCAAAAACA AAGAAGATAA AGAATTTCAC
CCCTGTTTTA AAGTTACCTA CATTGTAAAA GAAAAACCTA TTTTTGACGC TATTACCTAT
GAAGGAAGGA AAAGATTAAG CCGAACCGCG ATTACCGAGG CTATGACTTT AAAAATTAAA
GATCCTTTTA ATGAAACAAA GCTGGTTTCC GATTTAGAAC GCATTAAAGC CAAATATGCG
GAAAAAGGAT ATATTAACGC CGACATTAAA TATGAAACAG AAGTTAATGA AAAATTAAAT
ATTGTTACCG TAAAGTTTAT TATTGACGAA GGCCAAAGGG CGAGGGTTAA AGAGGTTGGT
ATCGAGGGTG CGAATTTAAT ACCTTCTAGA AAACTTGTTA AAAAGACCGC AAACAGGCCC
GGAAAAGTTT TCAAACCACA AAAATTGCAG CAAGATTATG TAAAGATGAC TCTTTACGGC
CGCAACAAAG GTTTTAGCGA GTATGAAATA ACCCCGCCGC AGATTGACAT GAATGATGAA
AAAAGCGAGA TTACTATTAA CTATGATGTA ACGGAAGGAG CAAAAGCGCA ATACGGAACA
GCTGCTTTTG ACGGCAATAC TGTCTTTACC GATGAAGAAT TACAAAAGCA AATTTTTTTC
AGGGAAGGCA AAACATATAC CCAAAAAAGC TTTGATATGA CTATGCGTGA TTTGCAGGAA
CAATACGCTA ACAAAGGGTA TTTAAACGCA AAAATTAATC CCATAAGAAC AATTGACGAC
GCGGGCAGAC TTAATATCCT TTTTGATATA AGCGAAAGCC ATATTTTTTA CATTGACCAT
GTTGACGTTA CCGGGTATGA GACTACAAGA AGAAACGTTC TTGCCCGTGA AATAACGGTT
AAACCCGGCG ATTTATTTGA TTATTCTAAG ATACGCAGAT CGCAAACAAG GCTTTTAAAC
TTAGGGTTTA TTAACGATGT GCAGCTTGAT ATTTCGCCTA CGGCGTATCC CGACAGGGTA
GACGTAGGCT TTAACGTAGT TGAAGGCCGT CCCGGCATGT TTACTGCCGG TGTCGCCATG
TCTTCTTTAG ACGGTTTATA CGGTGAAGTC AGCGTCAGCC ACATGAATTT ATTCGGGCGG
GCACAAAGGC TTAATTTAAG AACGCAGTTT GGTAAAAATT TACTTGACTA TACGATAGGC
TGGTCTACGC CGTGGGTTTT TGACAGGCCT GTTTCTTTCG GAGTGGATGC TTTTAACACA
AGGCGTTACC GCCCTTTTAG GAGTGAATCG CGCGCGTATA CGGATAGAAG GATTGGCGGA
AGGGTAAGAG TCGGGCCTAG ATTTTCAGAT GATATTTACC AATTGGCTTT TTCCTATACA
TTCCAAAACA TTGATATTTA TGACATAGAT GACCAGTTTA AAGGGGACAT TGACAGCGAA
AGGTTAAACT CTTCCTCTTT CAGCGCGGAT TTCGCAATAG ACACGCGTGA TAATATTTGG
GACCCTACCA CCGGTTGGCG TAACTCCATC GGGCTTGAAC TTACCGGCGG CCCTTTAATG
GGAGATTTGG ATTTATGGAC AATAAATTTA CGCTCAATTT TTAACCGTAC TTTAATAAAT
ATCGGCGGTA ACTATCCTAT AGTTTTTGTG TTGTCTAATA AATTCGCGTC AACAAATGCT
TACGGAAGGA CGGGAGAGGT GCCCGTGTTT GAAAGATTTT TTATAGGCGG CGCCGATACA
ATAAGAGGTT ATGACCATAA CGGACAAGTT GGGCCGCAGG ACGGCGGTAA TATGTATTTT
GTATCTTCGG CGGAAGTTCG TCTTCCTCTC GCAAGAGAGG GCAGAAGAAG CATTGCCCAG
CTTGCGGCGT TTTTTGATAT AGGAAACTCA TGGAAAAGCG CAAGCGATGT AAGGTTTAGA
ATGGGCCCCG AAGAGGACGA GTTTAAAGCC GGCGTAGGTT TGGGGTTAAG GTTTGCCACT
CCGCAGCTTC CCATACGTAT AGACTGGGGG TATGGTTTGA ACCACAGGCC GGGTGAATCA
AGAACCAAGT TCTATTTTAA TATGTCTAAC GCGTTTTAA
 
Protein sequence
MKKILFLLLI LTSAGLFAQE TEMNATGPWM VCEVAVSGLK NVAKKTVTKA VHAKKGVMYE 
RGFVYDDMQA IISLGSFDNA EVDISPMEGE RKNKEDKEFH PCFKVTYIVK EKPIFDAITY
EGRKRLSRTA ITEAMTLKIK DPFNETKLVS DLERIKAKYA EKGYINADIK YETEVNEKLN
IVTVKFIIDE GQRARVKEVG IEGANLIPSR KLVKKTANRP GKVFKPQKLQ QDYVKMTLYG
RNKGFSEYEI TPPQIDMNDE KSEITINYDV TEGAKAQYGT AAFDGNTVFT DEELQKQIFF
REGKTYTQKS FDMTMRDLQE QYANKGYLNA KINPIRTIDD AGRLNILFDI SESHIFYIDH
VDVTGYETTR RNVLAREITV KPGDLFDYSK IRRSQTRLLN LGFINDVQLD ISPTAYPDRV
DVGFNVVEGR PGMFTAGVAM SSLDGLYGEV SVSHMNLFGR AQRLNLRTQF GKNLLDYTIG
WSTPWVFDRP VSFGVDAFNT RRYRPFRSES RAYTDRRIGG RVRVGPRFSD DIYQLAFSYT
FQNIDIYDID DQFKGDIDSE RLNSSSFSAD FAIDTRDNIW DPTTGWRNSI GLELTGGPLM
GDLDLWTINL RSIFNRTLIN IGGNYPIVFV LSNKFASTNA YGRTGEVPVF ERFFIGGADT
IRGYDHNGQV GPQDGGNMYF VSSAEVRLPL AREGRRSIAQ LAAFFDIGNS WKSASDVRFR
MGPEEDEFKA GVGLGLRFAT PQLPIRIDWG YGLNHRPGES RTKFYFNMSN AF