Gene Emin_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0333 
Symbol 
ID6262844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp354832 
End bp359640 
Gene Length4809 bp 
Protein Length1602 aa 
Translation table11 
GC content44% 
IMG OID642610799 
Productouter membrane autotransporter 
Protein accessionYP_001875229 
Protein GI187250747 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TATTTAAAAT AACGGCAATC TTTATTTTCT TATCAAGCGG CGCAAATGCG 
GCACAGCTTG TACTGACCTC GGCGGACCAT ACTCTTACCG ACAACTCCGC CTATGACGGC
GGCACCGCCG GCATTATTTT TAACAGCTCG GTTACAAACC GCGTTTTGAA CCTTGACGGT
TTTACAATTT CGGCGTCTAA TAATACAGCG CAGGGATTTC AGCTTACAGG CGGCGGGGTT
GTTACGGTTA ACGGTAACGG CGGCGCTATA AATTTAACGG ATAACAGAAA CGCCGCAGGC
ACATCAGGTA CGGGGTTAAA TCTGACAGGC GCGACAACGC TTAACATAAA CAATGCGGAT
GTAAATATCA GCGGCAGCGT TCTCGGTATT TCAAACGCCG CAGGTTCTAA ACTCAATTTT
ATCGGCGGCG GGACAAATAC GATTAACTCC AGCAATAATA AAAATGCGAC AACAACAGGC
GGCACGGGTA TTCTTTCTAC CGGCGCTGGC TCAGAGGTAT CAATAACGGA TTACCAAAAC
ATTATTTTTA ACGGTAACAC TAACATTGGC ATGCAGGCTG CAAGCGGCGG TTCTATTACA
ATTACCGGTA AAGCGGACGG CACCACAACA TTCACCGCCT CTGGCAACGG CCGCCAGGAC
GGTTCAGGCA ACCTTACCGC TGGGCAAGGT ATATATATCA GCACCGCCGG CTCCGCCGTA
ACAATGACAA ATATGCAGTC CGTAGATATT AACGGCAATT ATACCGGCGT ACAAAATGTG
GCATCGACAA ATTTAACTTT GTCAGGCAAC GGCAGTGCGA CATCCTCTTT AAGCATAAGC
AACAACACTT ATACGGGCCT TATTAACAGC GGCAGTACGG ATATAAGCGG TTTTTCGACT
ATAAATATTA ATAATAACGG CAATATAGGC GTTTCCGTTA CGGGCGCTTC GGCTTCTTTA
GATATAAGCG GCGTAGAAAC AATGAATGTA AATGGCAATG CCAGCGGTAT CACCACAGCG
GCAAACACTA CATTGAATTT AACGGGCACC GGATCTGCCG CGTCTAAACT TTCGGTCAAC
AATAATACGG GCACCGGGTT TTCTTCTGCG GGGCTTGTCA CAATTAAGGA CTTCAGAGAA
ATTGAAGCCA GCGGCAACGG CGGCAACGGT TTTGTCATCT CAGGCGCAAG CACTATAGAG
ACAGCGGCGG GGCAGAATCT TACCATAAAA ACAAATAATA ATACAAGCGT GGGAACTTTC
GGCTTTTTAT CAACTGCAGC AAATCTTGTT TTGGACAGGG TTGATATATT TGAATCCACC
GGTAACGGAA GGTACGGGTT TGCCGTAAAC AACGGTAAAA CAACTATGAA TCTTACCGAT
AACGGTGTGG TAACTTTGGA CGGCAACGGG GTTTCTGCTA CGGCGGCTGC AGCGGGGCTT
TTGGTTAACG GGGCAACGGC GGAACTTGAA ATAAACGGCG GCCTTAACTC AACTTTAAGC
GCCTCAGGCA ATAAAAACAG TACGGGATCA GGGGGCGTCG GTATACAGGC GGCTTCAGGC
TCTTTAAAGC TCAATAACAT GGAAACTATT ACCGCAGAAG GAAACATTAC CGGTATATCC
ACATCCGCAA ACGGCTCTCT TCTTTTAGAT GGAAGCGGCA GCCTAACCTC TAAATTAAGC
GTAAATAATA ATATCGGCGC GGGCATAAGC GCTGCACAGT CAACTTCTTT TACAATTCAG
GATTTTGCAA TCATAGAGGC AAACGGCAAC GGCGGTACAG GGCTTAATTT AGGCGTAAAC
TCAACAATAA CAAATATTGC GGGTGAGCAG ATTTCAATAA GTTCAAGCAA TAATACGGGA
GCTGCCAGCG ACGGCATATA TTCCAGCGCG GCTAATTTGG AGCTTAAGGA TTTAGCTTCG
CTCCAGCTTA ATAATAACGG CAGATACGGG CTGCACGTTT ACTCGGGCAG CGTATCTATA
GACGCGGCAG GCAGGGTCTT CGGCGTGGGT GATTTAAGCG TAACCGGCAA CGGAAACCAC
GGCGTATATT TAAGGCTGGA TAATTTAGAA ATAGCCAATG TCGATGGGAT AAACATTTCC
GGCAACGGCG GAACGGGAAT GTTGGCAAGC GGAGGAACTC TTTCCTTAAA AAGTAATAAC
ACCGAAGGGA AGATGATCTT TTCAAACAAC GGCGGAGACG GCGCAAAATT TACAGGTTCA
ACCCAGCTTA TAAACTTTAC GCAGATCAAC GCTTTTAATA ACGGCGGAAC CGGCTTTAGC
TTGGCGGGTA ATATTGACTG GCAGACACAG GTCGATTCTG AATTTAGCTT TGTAGCAAAA
GACAACGGCG CTTCAGGCAT AGTATTAAAC GGCTCATCTT TGGAAGATCC GCTGGTTTTT
AACGTTAAAG AACTTGTGCT AACAAATAAC GGCACAAAGG TTACGTCTAA CTCAGGCACG
GGCCTTTCAT TAGGTAGAAA TATAACTTTT AATGTTACGG ACACTGATAT TACCGCGACG
GATAACGGCG CAATAGGACT TTATGTCTCA GGCGGCGGTG TTTTAAACGC AATATCCTCA
ACAGGCACTA ACATTATCCG CTTGCGAAAT AACGGTGATA AAAGTTTAAG CGGCAGCTCT
ATGGGTATGG ACGTGCACGG CGGGAGATTC TACGCCGAAA ATATGAACGT TATAAATATT
GAGAACCAGA ACTTCGGCAT AATAGCCTGG TACAACGGCG AAGCCATAAT GAAGGATTCC
AATATAATCG TTACCGATAA CGCTAACGGA ATTCTTGCTT CACGCGCTAC GATTGATATA
AACAGCACAT CGGGGTTGCA TACAATATAT GGTTCCGGCA ATACCTATGC TCTTATTACC
CAGGAGAACG CGCAGTATGA AGGCAGCGAG TATTTTAATA TAACCAATAT GGATATATAC
CTTGAAAATA ACAGCTATGC CATTGCCTCT ACCTTTGGCG GCGGCGCAAT GAATATAAAA
GGTAACGGAT CTAATATTTT ATCCTTTAAA GATAATTTGC AATCAGGCCT TTTTGCGCAA
TGGAACGTTA ACAGGCCGTC TGCGGGTACA AAAGTGGCTT TATCACAGGT TAATGTTGAA
GGCATGTATA TACAAACCGA AAATGATGAT TTAGCAATAG CCGAATCAAG CCAAGAGGGT
GGCGCGGAAA TTAATATAAA AGATTCAACC ATAAACATGC TTACGCAAGA TACTTTGTTC
CAAACCATAG GTTATGGCGT TTATAATATA GAGAATGTTA CGGCAACAGG TAACAGCGCG
CTTTTGGTTA ATACGGCTAT AGCTGACCAT AATGATCCAA GCGTAATGGT TGGCAAGTCT
GAGTTTAACG CAAAAGATGC CTTCCTAAGC GGCGTTATAA CAACAGCCGC GGATACCGCC
GCACATGTAT CGTTTACGGA CAGCGCGTGG AATATGATTA ATTCTTCCAA TATGACCTCG
CTTACTAATA GCTCTTCTGA TATTTTTATA GGTTCTCAAA CGTCCGGGGC TTATAATACT
CTGACGCTTG AAAATTATAT GGGTCATAAT GGCAATATCT ATTTAAATAC AGACGTTAAC
GGCGGCAGCA CGGATATGAT TTCCGTAAGC GGCTCAGCTA ACGGTTCTAC CACTTTGTTT
ATAAATGATA CAAGCACAAC TCTTCCCGGT ACGGACCCTC TTGAAATAAA AATAGTAGAG
GCTAAGGCCG GCGCGCACAT TTCCGCCAAT TCATTTGTGT TAAACGGCGG CAGCATTGAT
ACAGGCGCTT TTAAATATTT CCTTTTACAG GGAAATAAAG ACCAGTCGGA CAATCAGTCT
TTCTTCCTTC GCTTATCTAA CGCCATGACA GACACTTTTA AAACAATGTT AAACATACCG
CAGCTTAATG TTGTTATGGC ACAAACAGGC ATGAATTCCC TTCAAAAACG TTTAGGCGAC
TTACGTAATT TCGGCAACTC ATCTAAACAA CACGGCGTTT GGACAAGATC ATACGGAAAG
CAAATTACCG TAGATGACCT GATAGAAACA GACATGTCAA TATTTGGTTT TGAGGCCGGG
TATGATTTTT TAATGAGAGA AGGCGACGAA AACAGAATTT ATCTCGGTTT AATGGGTGGT
TATATGACTG CTGGTAATGT AAAAACAAAA CAGGATAACG GTTTTTACAG TAAAGGAAAC
GGCGATGCCC CCAGCGTAGG CGCTTATATG ACTTTTATAA CCCCTTCTTC ATATTTTTTA
GATTTAACCA TAAGAAATTT TTGGACAAAC TTAGATATGG TTAACTATTC TTCTTTAGGA
ACCGAGCTTA AATATTCCCC CAGAAGAAAC CTGTTTACTA TGAGCGCGGA ATTCGGTAAA
GAGTTTCAGA TTATGTCTTG TCGCGGTACA CAGTGGCGGG TTGAACCTAA AGCCGAATTG
GCATTTTTAA ACGCTTCGGG AGACAGTACA AATGTTGAAA ACGGCAACGG TTCTTTAAAA
TATGAAAATG CCTCATATAT AATAACAAAA GCCTCGGTAA TGCTTGCGTA TATGCATAAA
AGGCAAACAG GTCTTTTTAC ACAACCTTAC GCAGAGCTTG CTTACAGCCA AACGTTTATG
GGAAAGGGCG ATGTCAGCTA CTCCGGCGCG GTTAGCGAAA CAAACCTTTC AGGCGGCGCT
TTTGAAACGG CCTTGGGTCT GAATTATCAG TTTAACAAAG ATATCTATGT ATATGGCCAG
GCTACTTTTG AAACAGGAAG CAAGATACAA GCCTTCGGCG GCAATTTAGG CTTCCGTTTG
GCTTTTTAA
 
Protein sequence
MKNIFKITAI FIFLSSGANA AQLVLTSADH TLTDNSAYDG GTAGIIFNSS VTNRVLNLDG 
FTISASNNTA QGFQLTGGGV VTVNGNGGAI NLTDNRNAAG TSGTGLNLTG ATTLNINNAD
VNISGSVLGI SNAAGSKLNF IGGGTNTINS SNNKNATTTG GTGILSTGAG SEVSITDYQN
IIFNGNTNIG MQAASGGSIT ITGKADGTTT FTASGNGRQD GSGNLTAGQG IYISTAGSAV
TMTNMQSVDI NGNYTGVQNV ASTNLTLSGN GSATSSLSIS NNTYTGLINS GSTDISGFST
ININNNGNIG VSVTGASASL DISGVETMNV NGNASGITTA ANTTLNLTGT GSAASKLSVN
NNTGTGFSSA GLVTIKDFRE IEASGNGGNG FVISGASTIE TAAGQNLTIK TNNNTSVGTF
GFLSTAANLV LDRVDIFEST GNGRYGFAVN NGKTTMNLTD NGVVTLDGNG VSATAAAAGL
LVNGATAELE INGGLNSTLS ASGNKNSTGS GGVGIQAASG SLKLNNMETI TAEGNITGIS
TSANGSLLLD GSGSLTSKLS VNNNIGAGIS AAQSTSFTIQ DFAIIEANGN GGTGLNLGVN
STITNIAGEQ ISISSSNNTG AASDGIYSSA ANLELKDLAS LQLNNNGRYG LHVYSGSVSI
DAAGRVFGVG DLSVTGNGNH GVYLRLDNLE IANVDGINIS GNGGTGMLAS GGTLSLKSNN
TEGKMIFSNN GGDGAKFTGS TQLINFTQIN AFNNGGTGFS LAGNIDWQTQ VDSEFSFVAK
DNGASGIVLN GSSLEDPLVF NVKELVLTNN GTKVTSNSGT GLSLGRNITF NVTDTDITAT
DNGAIGLYVS GGGVLNAISS TGTNIIRLRN NGDKSLSGSS MGMDVHGGRF YAENMNVINI
ENQNFGIIAW YNGEAIMKDS NIIVTDNANG ILASRATIDI NSTSGLHTIY GSGNTYALIT
QENAQYEGSE YFNITNMDIY LENNSYAIAS TFGGGAMNIK GNGSNILSFK DNLQSGLFAQ
WNVNRPSAGT KVALSQVNVE GMYIQTENDD LAIAESSQEG GAEINIKDST INMLTQDTLF
QTIGYGVYNI ENVTATGNSA LLVNTAIADH NDPSVMVGKS EFNAKDAFLS GVITTAADTA
AHVSFTDSAW NMINSSNMTS LTNSSSDIFI GSQTSGAYNT LTLENYMGHN GNIYLNTDVN
GGSTDMISVS GSANGSTTLF INDTSTTLPG TDPLEIKIVE AKAGAHISAN SFVLNGGSID
TGAFKYFLLQ GNKDQSDNQS FFLRLSNAMT DTFKTMLNIP QLNVVMAQTG MNSLQKRLGD
LRNFGNSSKQ HGVWTRSYGK QITVDDLIET DMSIFGFEAG YDFLMREGDE NRIYLGLMGG
YMTAGNVKTK QDNGFYSKGN GDAPSVGAYM TFITPSSYFL DLTIRNFWTN LDMVNYSSLG
TELKYSPRRN LFTMSAEFGK EFQIMSCRGT QWRVEPKAEL AFLNASGDST NVENGNGSLK
YENASYIITK ASVMLAYMHK RQTGLFTQPY AELAYSQTFM GKGDVSYSGA VSETNLSGGA
FETALGLNYQ FNKDIYVYGQ ATFETGSKIQ AFGGNLGFRL AF