Gene Emin_0692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0692 
Symbol 
ID6263203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp765625 
End bp767271 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content45% 
IMG OID642611164 
Producthypothetical protein 
Protein accessionYP_001875584 
Protein GI187251102 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2165] Type II secretory pathway, pseudopilin PulG 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000140435 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.26108e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGAAAAA GAAATCTGCA GTCAAAAGGA TTCACATTAA TTGAAATAGC TGTTGTTGTT 
CTAGTAATAG CTATTTTAGC TGCGGTGGCG CTGCCGCAGT ACAAAAAGTC GTTAGAACGC
TCAAGAGCTG CCGAAGCTTT TGACATTCTT ACCGAGATAA GGAATAAACA AGAGGCAAGA
GAACTCCTTG GCACGGGGAC GGCCAAAGGC TATACTGTAA AGTTTAGCGA TTTGGGGGAA
GTTATAGAGG GTAAAACATC TACAACAAAT ACATTAGACA CAGACCTCTT TACATATACA
CTTTCAAACA ACCCATATCC GCAGGCGTAT GCAAAAAGAA AGGATATGGA TTATTCAATA
GTGCAAGCTA ATGGGTACCA GGACAGTGCG TTATGCTGTT TGGGTAAAGA TTGTAAGGTT
GTAGACAGCG TTTTAAAAGG GTGCGAGAAG ACTGCGTGCC CCACAACCTG CGCGACTGGG
CATAAAAGAA CAGGATATTT TTTCCAGGAA GACGGGCCTT GCTGTGAGGC AAAAACAGCG
TGTTCTGCAA CATGTCTTGC GGGAGAAGAA AGAACAGCCG TACAATATAT TGAAGACGGC
GCCTGCTGCC AAACAAAGAC TTGCGGTGCC GGGCAGACGC TTGTAGGCGG TATATGTAAA
ACAACGTGTC CCGCAACATG CTCAGCAGGG GAAGAAAGAA CAGCCGCGCA ATATAGCGAA
GACGGCGCTT GCTGCCAAAC AAAAACTTGC GGCGCCGGAC AGACGCTTGT AGGCGATATA
TGTAAAACAA CGTGCCCCGC AACATGTTCC GTAGGAGAAG AAAGAACAGC CGCGCAATAT
AGCGAAGACG GCCCGTGTTG TGAAACAAAA TGCGATTTTA ATCAAATTAT TGTAAACGGG
GTTTGTAAAA CACCTTGCGC GTCTCTATCT ATTCCGCAAG GATATAAGAA AACTTCAAAT
AATTACTTAG AGGAAGAAGG CGGTTGTTAT ACTAAAGATA ACTCCTGCCG CGCGCTTCCA
AAATATTTTT GCGGCGGCAA TAAACATGGC GGATATTGTG TACATGGCGG TTACTGGTCA
GGCGGTACGG AATATCTTAC CGCCGCGCCT GATTATATAG TCGGGTATGA ATGTAAAAAT
TCCGTTACAG GAGAAGCTTG TGACGGCTGC GTTTTACCTT TGGAAAACAA ATGTTCTGAC
TCGGGCTTTT TTGAAAAAAA TATGAATTTT TGCTGTAACA GCAGCCTTGG CGATCAACCT
AAATGTTGTA TAAATGTTAT CTCCAGCAGC TGGCCGACTG AATGCGAGCC TCAAAACTTG
CCGCAAGGCC AGCATTGTTC AATGGATAAA ATGGTTATGG TATGTAAAAA TCTTTTAGGA
AACGCGTGTG TCTGCGCCAG TAAAGGCGGC GCGCTGCCGG CGGCTCCTTC TGATTATTGC
AATAAAGGAC AAGTGTATGA CGACGGTGTT TGCAAAACCA TCTGTCCTAA CTTCTGTCCT
TCCGGTAAAG TAAGAAGCGG CTTGTATTTT AAAGAAGAGG GCGCTTGCTG CGTAGATGGG
GATGGGGAAG ATGATTCTTC TGATTCTATT GTCTTACCGG GCGGAGGAGG CTCTTTAGGC
GACGTGGAAA TTATCACTAA GTTTTAG
 
Protein sequence
MRKRNLQSKG FTLIEIAVVV LVIAILAAVA LPQYKKSLER SRAAEAFDIL TEIRNKQEAR 
ELLGTGTAKG YTVKFSDLGE VIEGKTSTTN TLDTDLFTYT LSNNPYPQAY AKRKDMDYSI
VQANGYQDSA LCCLGKDCKV VDSVLKGCEK TACPTTCATG HKRTGYFFQE DGPCCEAKTA
CSATCLAGEE RTAVQYIEDG ACCQTKTCGA GQTLVGGICK TTCPATCSAG EERTAAQYSE
DGACCQTKTC GAGQTLVGDI CKTTCPATCS VGEERTAAQY SEDGPCCETK CDFNQIIVNG
VCKTPCASLS IPQGYKKTSN NYLEEEGGCY TKDNSCRALP KYFCGGNKHG GYCVHGGYWS
GGTEYLTAAP DYIVGYECKN SVTGEACDGC VLPLENKCSD SGFFEKNMNF CCNSSLGDQP
KCCINVISSS WPTECEPQNL PQGQHCSMDK MVMVCKNLLG NACVCASKGG ALPAAPSDYC
NKGQVYDDGV CKTICPNFCP SGKVRSGLYF KEEGACCVDG DGEDDSSDSI VLPGGGGSLG
DVEIITKF