Gene Emin_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0578 
Symbol 
ID6262745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp632290 
End bp633873 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content43% 
IMG OID642611049 
Producttype II and III secretion system protein 
Protein accessionYP_001875470 
Protein GI187250988 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000285943 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGGA GTTTAATTGC TGTTTGGGCT TTTGCAGCGG CGCTTTGTTT GACAAGCGCA 
AGTATAGCAT GGGCTCAAAA TGTTAATTTG GAGTTACCTA AGGATGAACC CGCCCCTAAA
TCCGGCGGTT TCGCCGCCGC TAATGCGGAG TCATCTCTGT ATAAAGATAC AGAGGCCAAC
TCACCGTTAG ACAGAAAAGT TTCTATCAGA GTGGCAAGCG TTCCGATAGC TACGTTTCTT
AACAGTATTT CAGCGCAGGC TAAAATAAAC TTTATTATGA GTGAGGAATT TGCCAATAAA
AAAGTTACAG CTTCACTTAA TAATATTACA GTGAGAGAGG CGCTTGACAC ATTACTTCGC
GTGCAGGGAC TTACTTACCA GCGTATAGGT AAAAGCGACA GCTATGTCGT TACCAAACGT
TCAAGTGATT CACCTAACAC AATTACAAAA GTTTATACAT TAAACTATAT TTCACTTCAA
GGCAGCAACA GCACCGGCAA TACAAACTCG CCACAATCAA CTTTTGCCTC CAGCCTTAAC
GTCAACCCCA ATGAACAAAG AAGCGACATG GAAAACCTTG CCGAAATGGT TACAGACAGC
GCGAGAAAAG GCGCCACCGG CAGCGATTTC TTAACAATCA TTCAAAGTGT AATGTCCTCA
CAGGGTAAAA TAGCTATTGA CCCGAGAACA AATAAACTTA TTGTTACCGA TATTCCTGAA
ATTTTTCCGC AGTTGGAAAA CATTTTAGCT GAACTTGATA TCAAACCTCC GCAGGTTTTG
ATTGAAGCGC AAATTATTGA AGTAAACAAA TCAAGCGGTT TGGAAATAGG TTTAAGCTAC
GGCGGTACGG ACGGCACAAT ATTTAAATTT ACAGGTCCCG CAAGAGGCGT TGATATTGAT
TATGTTAAAG GTAACGGCGT AAGCGGCTGG GGTTATATTT TCCCTCCCGC AGGTACGGGT
ACGGGCGGTA GCTCCGGCGG CGACAGCGGT GGTGGAAGCA CTGACGGCAG CGGCGGCAAC
AGCAACTCCC CTAAAGACGC TTTCTTAGAT TTTTCTTCAT TTTCAATCGT GCTTAAATCA
CTTTTGACAA GAGGCGAAGC AAAGTACCTT GGTAAGCCTA AAGTTGTTAC AATAAACAAC
CAACCCGCGG TTATCGAATC AACAAGAGAC GCGGCTGTGG GTTTCGCAAG TAATTTGTCA
GGCAACACAG GCAGCACAAA CGTTGTAAGC CAGACAGCTG AAAGAAAAAC TGTGGGTTTA
ACCCTTAAAG TTACGCCCCA GGTTAACAAA GAGGGGTATA TTACCCTTCT TATCGAACCT
TCTTATTCTT CAGTCGCAAA CTCTGCCGTA AGTGATGGTA CGAAAGATAC GCTTAACAGA
AGCGCAAGCA CTCTTGTACG CGTTAAAAAC GGACAGACAG TTGTATTGGG CGGCTTATTG
TCTTCAAGAG AAATCTTAGA AGACAGGAAA GTTCCTTTAC TTGGTGATAT TCCTTTAATA
GGTTGGTTAT TTACACAAAG ATCAACAACT AAAGAAACAA CCGATATGGT TATTTTTGTA
ACGCCTACAA TCTTGGCGGA CTAA
 
Protein sequence
MKRSLIAVWA FAAALCLTSA SIAWAQNVNL ELPKDEPAPK SGGFAAANAE SSLYKDTEAN 
SPLDRKVSIR VASVPIATFL NSISAQAKIN FIMSEEFANK KVTASLNNIT VREALDTLLR
VQGLTYQRIG KSDSYVVTKR SSDSPNTITK VYTLNYISLQ GSNSTGNTNS PQSTFASSLN
VNPNEQRSDM ENLAEMVTDS ARKGATGSDF LTIIQSVMSS QGKIAIDPRT NKLIVTDIPE
IFPQLENILA ELDIKPPQVL IEAQIIEVNK SSGLEIGLSY GGTDGTIFKF TGPARGVDID
YVKGNGVSGW GYIFPPAGTG TGGSSGGDSG GGSTDGSGGN SNSPKDAFLD FSSFSIVLKS
LLTRGEAKYL GKPKVVTINN QPAVIESTRD AAVGFASNLS GNTGSTNVVS QTAERKTVGL
TLKVTPQVNK EGYITLLIEP SYSSVANSAV SDGTKDTLNR SASTLVRVKN GQTVVLGGLL
SSREILEDRK VPLLGDIPLI GWLFTQRSTT KETTDMVIFV TPTILAD