Gene Emin_0681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0681 
Symbol 
ID6263350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp753871 
End bp755574 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content47% 
IMG OID642611152 
Producthypothetical protein 
Protein accessionYP_001875573 
Protein GI187251091 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2165] Type II secretory pathway, pseudopilin PulG 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000227256 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCAAA AAAAAGGGTT CACCTTAATA GAAATAGCTG TGGTAGTTTT AGTAATAGCT 
ATTTTAGCGG CTATAGCTTT GCCGCAGTAC AGAAAGTCGT TAGAACGCTC AAGAGCTGCC
GAAGCTTTTG ACATCCTTAC AGAGATAAGG AACAAACAAG AATCCAGGGA TTTACTTGGC
ACGGGCACCG CCAAAGGCTA TACTGTAAAG TTTAGCGATT TGGGGGAAGT AATAGCGGGC
AAAACCTCCA CAACAAACAC GTTAGACACA AAACTTTTTT CATATGTTCT TTCAGATAAT
CCTTACCCTC AGGCGTACGC CAAAAGAAAA GATTTAGATT ATTCAATAGT TCAAACCAAA
GGCTACCAAG ACAGCGCATT GTGTTGTATA GGCAAAGACT GCGATATGGT AGATAATGTT
TTAAAAGGGT GTGAGAAGAC AGCGTGTCCT ACAACATGCG CGGCTGGATA TAAAAGAACA
GGGTACTTTT TTTCGGAAGA CGGGCCTTGC TGTGAAGCTA AAACGTCGTG TCCCGCAACA
TGTCCTGCAG GGCAAAAAAG AAGCAGCGTG CAGTATACGG AAGATGGAGC GTGCTGCGTA
GCCAAAACAT CATGCCCGAC AACATGTCCT ACAGGCCAGC AGAGGACAAG CGTACAATAT
AGTGAAGACG GAGCGTGCTG CGTAGCCAAA ACATCATGCC CGACAACATG TCCTACAGGC
CAGCAGAGGA CAAGCGTACA ATATAGTGAA GACGGAGCAT GCTGCGTATC AAAAACGTCG
TGTCCCGCAA CATGTCCTAC AGGTCAAAAG AGGACAAGCG TACAATACAG TGAAGACGGA
GCATGTTGCA CGGCTAAAAC GGCTTGTCCC GCATCATGTC CTGCCGGAGA AGAAAGAACA
AGCGTGCAAT ACAGTGAAGA CGGAGCGTGT TGCACGGCTA AAACGGCTTG TCCCGCGACA
TGTCCCACGG GCCAGGAAAG AACAAGCGTA CAATACAGTG AAGATGGAGC GTGCTGCCAA
ACAAAAACCT GCGGCAGCGG ACAAACCCTT GTTGGGGGAG TATGTAAAAC AGCGTGTCCT
GCTACATGTC CTACAGGTCA AAAGAGAACA AGTTCCCAGT ATTCGGAAGA TGGAGCGTGC
TGCGTGGCTA AAACAGCGTG CCCCGCTACA TGTCCTACGG GCCAGGAAAG AACAAGCGTG
CAATACAGTG AAGACGGCGA CTGTTGTAAA ACTTCAAGCG GATGTCCTGT CGGTACTTCA
ATAGGAGCTA ATGGAAAATG TTGTACTCCT GAATTAATGG GTATTGACGA ACGGGACGGA
GGCTGTTGCG TAGAATTTTC AAACTGCGGT CTTACCGGAC CCGGAGGACT TGCCTTTCCT
TGCAGATGTG CAAGAACAGG CACAGGCACT ATATCTTGTT CTGGTAACAG TACGCAATCC
TGCGGTCAGT GCGGCACCCA AACAAGAACA TGTAATACTT CAACAGGAGT ATGGAGCTCA
TGGAGTTCGT GCAATGAATC TCCAAATCCT TTAAGCCCTT CGGACCAGGA TATGTGCTTA
AGCTGCGGCG GTACATTAAC ATGTAAAGGC TGCGATTGCG GAACTTATTA CGACTTTAGC
AGCGGCTCGG GTATTTTATC ATATTGCTCT TTTAACAGCC GTTACGGATG CGTATGCGGA
CAGACGTACA GATCATGCAG ATAA
 
Protein sequence
MSQKKGFTLI EIAVVVLVIA ILAAIALPQY RKSLERSRAA EAFDILTEIR NKQESRDLLG 
TGTAKGYTVK FSDLGEVIAG KTSTTNTLDT KLFSYVLSDN PYPQAYAKRK DLDYSIVQTK
GYQDSALCCI GKDCDMVDNV LKGCEKTACP TTCAAGYKRT GYFFSEDGPC CEAKTSCPAT
CPAGQKRSSV QYTEDGACCV AKTSCPTTCP TGQQRTSVQY SEDGACCVAK TSCPTTCPTG
QQRTSVQYSE DGACCVSKTS CPATCPTGQK RTSVQYSEDG ACCTAKTACP ASCPAGEERT
SVQYSEDGAC CTAKTACPAT CPTGQERTSV QYSEDGACCQ TKTCGSGQTL VGGVCKTACP
ATCPTGQKRT SSQYSEDGAC CVAKTACPAT CPTGQERTSV QYSEDGDCCK TSSGCPVGTS
IGANGKCCTP ELMGIDERDG GCCVEFSNCG LTGPGGLAFP CRCARTGTGT ISCSGNSTQS
CGQCGTQTRT CNTSTGVWSS WSSCNESPNP LSPSDQDMCL SCGGTLTCKG CDCGTYYDFS
SGSGILSYCS FNSRYGCVCG QTYRSCR