Gene Emin_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0536 
Symbol 
ID6262732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp588143 
End bp589744 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content46% 
IMG OID642611006 
Producthypothetical protein 
Protein accessionYP_001875428 
Protein GI187250946 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2165] Type II secretory pathway, pseudopilin PulG 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000617435 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC GAGGATTCAC CTTAATAGAA ATAGCTGTAG TAGTTTTAGT AATAGCTATT 
TTAGCTGCGG CAGCTTTGCC GCAGTACAGA AAGTCGTTAG AACGCTCAAG AGCTGCCGAA
GCTTTTGACA TCCTTACAGA GATAAGGAAT AAACAAGAAG CCAGGGATTT ACTTGGCACG
GGGACGGCCA AAGGCTATAC TGTAAAGTTT AGCGATTTGG GGGAAGTAAT AGCGGGTAAA
AATTCCACAA CAAACACATT AGACACAAGC CTGTTTACTT ATACGCTTTC AAATAACCCA
TATCCGCAGG CATATGCCAA AAGGAAGGAT ATGGATTATT CGATAGTTCA GACAAAGGGG
TATCAAGACA GTGCGTTATG TTGTTTGGGA AGGGATTGTA AAGTTGTAGA CAGCGTTTTA
AAAGGGTGTG AGAAGACAGC GTGCCTTACA ACATGCGCAA CGGGATATAA AAAAACGGGG
TACTTTTTTT CGGAGGACGG GCCTTGCTGT GAAGCAAAAA CGTCGTGTCC GACAACATGT
CCCGCAGGGC AGCAGAGAAC AAGTGCACAA TATAGCGAAG ACGGAGCGTG TTGCGTATCA
AAAACGTCGT GTCCCGCAAC ATGTCCTACG GGGCAGAAGA GGACAAGCGT ACAATATTAT
GAAGACGGAG CTTGTTGCAC GGGTAAAACA GCCTGTCCTA CAACGTGTCC TACAGGCCAG
CAGAGGACAA GCGTACAATA TAGTGAAGAT GGAGCCTGCT GCGTAACAAA AACAGCGTGT
CCTGCCACAT GTCCCACAGG TCAGGAAAGA ACATCCATGC AATACAGTGA AGACGGAGCG
TGCTGTCAGT CAAAATCTTG CGGTAGCGGA CAAACCCTTG TAGGCGGGGT ATGTAAAACA
GCGTGTCCCG CAACATGCCC TTCCGGTCAG GAAAGAACCT CAAGCGGGTA TTCTGAAGAC
GGCGTTTGTT GCCAAACAAA AACATGTACC ACTGGTCAGA CGCTTGTTAA CGGAGTATGT
AAAACAGCGT GTCCCGCAAC ATGCCCCTCC GGTCAGGAAA GAACCTCAAG CGGATATTCT
GAGGACGGAG CGTGCTGTAA AACAAAATCA TGTCCTTCTG GCCAATATTT AACAAACGGT
ATATGCTGTC TTAACGCACA GGTGTCTAAA GACGGTAAAA CATGTATATA TCTTTATAAA
CCCGAGGTTA TTAAAGTGGG TATACTGGTT GACTGTCATA ACAATTACAG CTTTGTAGAT
AAAAAGAAAA CACAATGTTA CAGGACTGGG GCCCCCAGAT ACTTTGAGGG GGGGAAATTG
CCGTACGTAG CCGCAGGCGG CGGATGCACC GCTCATTATT CATACTATAA TGGCGGTAAT
TGGTGGGAAG GCGGAATAGC GCACGGTACG CCGTCAGACT GTAATAACAG TATTTCCGAC
CAACAGGCTT GCGATAATAA CTGCAAAGGC GCGACATGTT CCTTTAAATG TATTAAAACC
AAGTCTTACG GGGACAGGTG CGGCACATAT AAATGTTCCA ACGGTATGCA TTGCGATAAT
GCGGAAGGTT CGGGTACTAT GCTAAGGTGT GTAAGGAAAT AA
 
Protein sequence
MKKRGFTLIE IAVVVLVIAI LAAAALPQYR KSLERSRAAE AFDILTEIRN KQEARDLLGT 
GTAKGYTVKF SDLGEVIAGK NSTTNTLDTS LFTYTLSNNP YPQAYAKRKD MDYSIVQTKG
YQDSALCCLG RDCKVVDSVL KGCEKTACLT TCATGYKKTG YFFSEDGPCC EAKTSCPTTC
PAGQQRTSAQ YSEDGACCVS KTSCPATCPT GQKRTSVQYY EDGACCTGKT ACPTTCPTGQ
QRTSVQYSED GACCVTKTAC PATCPTGQER TSMQYSEDGA CCQSKSCGSG QTLVGGVCKT
ACPATCPSGQ ERTSSGYSED GVCCQTKTCT TGQTLVNGVC KTACPATCPS GQERTSSGYS
EDGACCKTKS CPSGQYLTNG ICCLNAQVSK DGKTCIYLYK PEVIKVGILV DCHNNYSFVD
KKKTQCYRTG APRYFEGGKL PYVAAGGGCT AHYSYYNGGN WWEGGIAHGT PSDCNNSISD
QQACDNNCKG ATCSFKCIKT KSYGDRCGTY KCSNGMHCDN AEGSGTMLRC VRK