Gene Emin_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0933 
Symbol 
ID6262599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1036165 
End bp1037748 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content47% 
IMG OID642611412 
Producthypothetical protein 
Protein accessionYP_001875823 
Protein GI187251341 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4968] Tfp pilus assembly protein PilE 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000356044 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000476431 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG GGTTCACCTT AATAGAAATA GCTGTGGTAG TTTTAGTAAT AGCTATTTTA 
GCGGCTATAG CTTTGCCGCA GTACAGAAAG TCGTTAGAAC GCTCAAGAGC TGCCGAAGCT
TTTGACATCC TTACAGAGAT AAGGAACAAA CAAGAATCCA GGGATTTACT TGGCACGGGC
ACCGCCAAAG GCTATACTGT AAAGTTTAGC GATTTGGGGG AAGTAATAGC GGGCAAAACC
TCCACAACAA ACACGTTAGA CACAAAACTT TTTTCATATG TTCTTTCAGA TAATCCTTAC
CCTCAGGCGT ACGCCAAAAG AAAAGATTTA GATTATTCAA TAGTTCAAAC CAAAGGCTAC
CAAGACAGCG CATTGTGTTG TATAGGCAAA GACTGCGATA TGGTAGATAA TGTTTTAAAA
GGGTGTGAGA AGACAGCGTG TCCTACAACA TGCGCGGCTG GATATAAAAG AACAGGGTAC
TTTTTTTCGG AAGACGGGCC TTGCTGTGAA GCTAAAACGT CGTGCCCGAC AACATGTCCT
ACGGGACAAA AAAGAAGCAG CGTGCAGTAT ACGGAAGATG GAGCGTGCTG CGTAGCCAAA
ACATCATGCC CGACAACATG TCCTACAGGC CAGCAGAGGA CAAGCGTACA ATATAGTGAA
GACGGAGCAT GCTGCGTATC AAAAACGTCG TGTCCCGCAA CATGTCCTAC AGGTCAAAAG
AGAACAAGTG CCCAATATTA TGAAGACGGA GCATGCTGCG TATCAAAAAC AGCGTGTCCT
ACAACGTGTC CTACAGGACA GCAGAGGACA AGCGTACAAT ATTATGAAGA CGGAGCCTGC
TGCGTAACAA AAACAGCGTG TCCCGCAACA TGTCCTACGG GTCAAAAAAG AACAAGTGCC
CAATATTATG AAGACGGAGC ATGCTGCACG GCTAAAACAG CGTGTCCTGC GACATGTCCT
ACGGGCCAGG AAAGAACAAG CGTGCAATAC AGTGAAGACG GAGCGTGTTG CCAAACCAAA
ACCTGCGGCA GCGGACAAAC CCTTGTTGGT GGAGTATGTA AAACAGCGTG TCCCGCCACA
TGCCCCACGG GCCAGGAAAG GACATCCGCG CAATACAGTG AAGACGGAGC GTGCTGCAAA
GCAAAAACAT GTCCCACAGG GCAAACCCTT GTTGGTGGAG TATGTAAAAC AGCGTGTCCC
GCCACATGTC CCACGGGCCA GGAAAGGACA TCCGCGCAAT ACAGTGAAGA CGGAGCGTGC
TGCAAAGCAA AAACATGTCC CACAGGGCAA ACCCTTGTTG GTGGAGTATG CAAAACATCA
TGTCCCATTA CATGTCCTAT AGGCCAGCAG AGGACAACCG TCCAATATAC CGAAGACGGA
GCATGCTGTA AAAGTAAACC ATGCCCGTTA GTCCCGCAAT CGATAATTGA TGATTGTAAC
AATGCTTCTT ATGCCGTGTG GAACGAAAGT GAATGCGGTT GCAGATGTTG TAATAAGCAA
CATGGGAATC CATATTTAGA ACCTTCTACA GGTATTGTAA GATGTCTCCC CACAGGTGTA
GCAGCATATT TATGTGCGGT GTAA
 
Protein sequence
MKKGFTLIEI AVVVLVIAIL AAIALPQYRK SLERSRAAEA FDILTEIRNK QESRDLLGTG 
TAKGYTVKFS DLGEVIAGKT STTNTLDTKL FSYVLSDNPY PQAYAKRKDL DYSIVQTKGY
QDSALCCIGK DCDMVDNVLK GCEKTACPTT CAAGYKRTGY FFSEDGPCCE AKTSCPTTCP
TGQKRSSVQY TEDGACCVAK TSCPTTCPTG QQRTSVQYSE DGACCVSKTS CPATCPTGQK
RTSAQYYEDG ACCVSKTACP TTCPTGQQRT SVQYYEDGAC CVTKTACPAT CPTGQKRTSA
QYYEDGACCT AKTACPATCP TGQERTSVQY SEDGACCQTK TCGSGQTLVG GVCKTACPAT
CPTGQERTSA QYSEDGACCK AKTCPTGQTL VGGVCKTACP ATCPTGQERT SAQYSEDGAC
CKAKTCPTGQ TLVGGVCKTS CPITCPIGQQ RTTVQYTEDG ACCKSKPCPL VPQSIIDDCN
NASYAVWNES ECGCRCCNKQ HGNPYLEPST GIVRCLPTGV AAYLCAV