Gene Emin_0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0535 
Symbol 
ID6262726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp585831 
End bp587735 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content47% 
IMG OID642611005 
Producthypothetical protein 
Protein accessionYP_001875427 
Protein GI187250945 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4968] Tfp pilus assembly protein PilE 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000025366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.911782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GGTTCACCTT AATAGAAATA GCTGTGGTAG TTTTAGTAAT AGCTATTTTA 
GCGGCTATAG CTTTGCCGCA GTACAGAAAG TCGTTAGAAC GCTCAAGAGC TGCCGAAGCT
TTTGACATCC TTACAGAGAT AAGGAACAAA CAAGAAGCAA GAGACTTGCT TGGCACCGGC
ACCGCCAAAG GCTATACTAT AAAGTTTAGC GATTTGGGGG AAGTAATAGC GGGCAAAACC
TCCACAACAA ACACGTTAGA CACAAAACTT TTTTCATATG TTCTTTCAGA CAACCCCTAC
CCTCAGGCGT ACGCCAAAAG AAAAGATTTA GATTATTCAA TAGTTCAAAC CAAAGGCTAC
CAAGACAGCG CATTGTGTTG TATAGGTAAA GACTGCGATA TGGTAGATAA TGTTTTAAAA
GGGTGTGAGA AGACAGCGTG TCCTACAACA TGCGCGGCTG GATATAAAAG AACAGGGTAC
TTTTTTTCAG AAGACGGGCC TTGCTGTGAA GCCAAAACGT CGTGCCCGAC AACATGTCCT
ACGGGACAAA AAAGAAGCAG CGTGCAGTAT ACGGAAGATG GAGCGTGCTG CGTAGCCAAA
ACATCATGCC CGACAACATG TCCTACAGGC CAGCAGAGGA CAAGCGTACA ATATAGTGAA
GACGGAGCAT GCTGCGTATC AAAAACGTCG TGTCCCGCAA CATGTCCTAC AGGTCAAAAG
AGGACAAGCG TACAATACAG TGAAGACGGA GCATGTTGCA CGGCTAAAAC GGCTTGTCCC
GCATCATGTC CTGCCGGAGA AGAAAGAACA AGCGTGCAAT ACAGTGAAGA CGGAGCGTGT
TGCACAGCTG TAAAATGTGC TGATGATAAT AAAAGTCTTT ACTTAAACTC TTCAAACGGT
TTTTGGTCTG ATACACTTTG TAAAGGTATT TGCTGCGGGG TGGGGTATTA TCCCGTTGAT
AAAGGAACAT ATTTAACTTG TTATAACGGA GTTATGTACG GCAGTGAGGC GGTTTGCGAA
GATAACCCAA TACCTTGCGG CAGCGGGCAA ATACTTGTTG GTGGAGTGTG TAAAACGGCG
TGCCCTTCTA CGTGTCCCAC AGGCCAAAAG AGAACAAGTT CCCAGTATTC TGAAGACGGA
GCGTGCTGCG TGGCTAAAAC AGCGTGTCCC GCCACATGTC CCACGGGCCA GGAAAGAACA
AGCGTGCAAT ACAGTGAAGA CGGAGCGTGC TGCCAAACAA AAGCATGTCC CACAGGACAA
ACCCTTGTTG GGGGAGTATG CAAAACAGCG TGTCCTGCGA CATGTCCTAC GGGCCAGGAA
AGAACATCCA TGCAATACAG TGAAGACGGA GCGTGCTGCC AAACAAAAAC ATGTCCCACA
GGACAAACCC TTGTTGGTGG AGTATGTAAA ACAGCGTGTC CCGCCACATG TCCCACGGGC
CAGGAAAGGA CATCCGCGCA ATACAGTGAA GACGGAGCGT GCTGCAAAGC AAAAACATGT
CCCACAGGGC AAACGCTTGT CGGAACTGTT TGTAAAACCA ATTGCCCGTC AGCCTGTCCG
ACAGGTTATG CAAGAAGTTT GGTTAATTAT ACTGAAGACG GCGCCTGCTG TAAGCCTCAG
GTAGTGGCAT ATAATTGCCA ACCTGTTCCG GGAGCCGGTT TAACTGGCGG ACCCGGTGAC
ATTTGGATAG GAGCCACTAT GGCAAACAAT GGAACAGCTA CTTCAAGTCA TGTTGTTTCA
ACCAGAGTTG ATTACCAAAC AGGCTCCGGT TATTCAGGCT CAGCATATCA GGATATTGTA
ATACCCCAGG GAGCTAAGTA CGGAATTTTG GAATTTTCTG CTTACACGCC TTCAGGAGAC
ACGGGTGTTA CAAATTGTAC AGTTACTGTG ATTTCAGTTA ACTAG
 
Protein sequence
MKKGFTLIEI AVVVLVIAIL AAIALPQYRK SLERSRAAEA FDILTEIRNK QEARDLLGTG 
TAKGYTIKFS DLGEVIAGKT STTNTLDTKL FSYVLSDNPY PQAYAKRKDL DYSIVQTKGY
QDSALCCIGK DCDMVDNVLK GCEKTACPTT CAAGYKRTGY FFSEDGPCCE AKTSCPTTCP
TGQKRSSVQY TEDGACCVAK TSCPTTCPTG QQRTSVQYSE DGACCVSKTS CPATCPTGQK
RTSVQYSEDG ACCTAKTACP ASCPAGEERT SVQYSEDGAC CTAVKCADDN KSLYLNSSNG
FWSDTLCKGI CCGVGYYPVD KGTYLTCYNG VMYGSEAVCE DNPIPCGSGQ ILVGGVCKTA
CPSTCPTGQK RTSSQYSEDG ACCVAKTACP ATCPTGQERT SVQYSEDGAC CQTKACPTGQ
TLVGGVCKTA CPATCPTGQE RTSMQYSEDG ACCQTKTCPT GQTLVGGVCK TACPATCPTG
QERTSAQYSE DGACCKAKTC PTGQTLVGTV CKTNCPSACP TGYARSLVNY TEDGACCKPQ
VVAYNCQPVP GAGLTGGPGD IWIGATMANN GTATSSHVVS TRVDYQTGSG YSGSAYQDIV
IPQGAKYGIL EFSAYTPSGD TGVTNCTVTV ISVN