Gene Emin_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0579 
Symbol 
ID6262746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp633989 
End bp635719 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content43% 
IMG OID642611050 
Producttype II secretion system protein E 
Protein accessionYP_001875471 
Protein GI187250989 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000642239 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAGAA AACTAGATGA TATTTTAATC GATTCCGGCA TTATAAACGC TGAGCAGCTT 
AAAAAGGCCA CATTGTACGC CAACCAAAAC AATGTTTCTT TGGGCGACGC TACTATTAAA
CTCGGCTTCG CTACGGAAGA GCAAATAACA ATTGCGCTCT CAAAACACTT TTCCGTTCCT
TACGCTTCCA AAGAAAATAA CATTTTAATA CCTGAAAAGG AACAAAACCT GCAAGATGTT
GTAAATGAAA AATTTGCCAG GGAAAACATG GTTCTTCCTT TATTTTTGGA AGAAGGCGTT
TTGGCTATAG CAATGTATGA CCCTTCAAAC GTTTTTTTAG TTGATAACGT AAAAATGATG
ACCGGCTATG ATATCCAACC GTTTATCGCC AGTAAATCCC AAATTTTAAC CGCTATTGAC
GTTTTTTACG GTGGTAAAGA CTTAATTGAA GAAGTTGTTG TAGGCAACGC CGCCGTTAAA
GAGGAAGAAG GGGACGACAT TGAAGTTATT TCCGTAGAAG GAAAGCTTGA CTTAGATACT
TTAAAAGGCA GCGGATCGCA CTACATTAAA CTTGTTAACG CTATTTTAAA GCAAGCTATT
TCGGAAAGAA CTTCCGATAT TCACTTGGAA ATGTTTGACG AAATGGTCAG CTTGCGTTTC
AGAATCGACG GCGCGCTTTA TGAAAGAACC CCGCCTCCTA AGGAAAGCGT TGCGGCTATT
ATTTCCCGTA TTAAAATTTT ATCTAAGTTA GACATTGCCG AAAAGCGTTT ACCGCAAGAC
GGCAGCTTCG CCATAAAGTA CCAGAACAGA ACCATTGAAG TCCGTGTTTC GGTTTGTCCC
ACGGTGTTTG GCGAAAAGCT GGTTCTTCGT ATTTTGGATA AAGGTACCAG CGTTCTTACT
ATTGAAAGGC TTGGCTTTGA GCCCAGACAA AGGGAAGACT TTTTATCTGC GGCTAACTTA
CCGCACGGGT TGATATTTCT TACAGGGCCT ACAGGTTCGG GTAAATCAAC CACGCTTAAC
GCCGTTTTAT CAACAATTAA AACCACTGAA CTTAACTTTA TGACGCTGGA AGATCCTGTA
GAATATAAAC TTCAGGGTAT AAGCCAGGTG CAGGTTAAAC CACAAATCGG TTTAACTTTT
GCCGCGGGCC TGCGTTCTTT TCTCCGCCAA GATCCTGACG TTATCTTGGT AGGTGAAGTC
CGCGATAATG AAACGGCCGA ATCATGTTTA AAAGCCGCTC TTACAGGCCA CTTAGTGCTT
TCAACGCTGC ACACTAACGA AGCTTTGGGT GCGATTCCGC GTCTTATTGA TATGGGTATG
GAGGCGTTCT TGCTTTCCAG CTCTTTAGCT TTAGTGGCGG CGCAGCGTCT TATCAGAAAG
CTTTGTCCGC ATTGCAGGCG TCCTTTTACG CCTTCACCTG AGCTTGTTGA ACAGGCTTTA
AGGGAATCAA AGCTTCCTCC CGGGGATAAA GACACGTGGA CTTTTTACCA AAAGGTAGGC
TGTCCCAAAT GCAGCGAAAC CGGGTATTTA GGCCGTATGG CTATTTATGA AGTTTTTAAA
ATCAATGAGG AAATGAGAAA TATTATTTAT AAAACGCAGG ACCTTATTGA CCTTAACCGG
GCTGCTGAAA GGTCCGGCGC GTGGAACCTT CGCGCGAGCG GTTGGAGAAA GGCCGTTAAA
GGGTTAACAA CACATGAGGA AATTTTATCC GTCACCACGT TGGAAGAATA G
 
Protein sequence
MSRKLDDILI DSGIINAEQL KKATLYANQN NVSLGDATIK LGFATEEQIT IALSKHFSVP 
YASKENNILI PEKEQNLQDV VNEKFARENM VLPLFLEEGV LAIAMYDPSN VFLVDNVKMM
TGYDIQPFIA SKSQILTAID VFYGGKDLIE EVVVGNAAVK EEEGDDIEVI SVEGKLDLDT
LKGSGSHYIK LVNAILKQAI SERTSDIHLE MFDEMVSLRF RIDGALYERT PPPKESVAAI
ISRIKILSKL DIAEKRLPQD GSFAIKYQNR TIEVRVSVCP TVFGEKLVLR ILDKGTSVLT
IERLGFEPRQ REDFLSAANL PHGLIFLTGP TGSGKSTTLN AVLSTIKTTE LNFMTLEDPV
EYKLQGISQV QVKPQIGLTF AAGLRSFLRQ DPDVILVGEV RDNETAESCL KAALTGHLVL
STLHTNEALG AIPRLIDMGM EAFLLSSSLA LVAAQRLIRK LCPHCRRPFT PSPELVEQAL
RESKLPPGDK DTWTFYQKVG CPKCSETGYL GRMAIYEVFK INEEMRNIIY KTQDLIDLNR
AAERSGAWNL RASGWRKAVK GLTTHEEILS VTTLEE