Gene Emin_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0699 
Symbol 
ID6263622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp774092 
End bp775783 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content42% 
IMG OID642611171 
ProductTPR repeat-containing protein 
Protein accessionYP_001875591 
Protein GI187251109 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000888687 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAAAAA AAACTTTTAC TTTTATTTTC CTTTGTCTTT TTTGCGTGTG TTGCTTTGGG 
CAAAACCTTA TAGGCCGCAA AAAAGAACTT AACCCCGACT ACTCCAAATC AGTTTGGGAT
AATGTTATGG AAACCCAGTC TCTTAACGAC GTACGCAAAG GGTTCTATTT CATGTCCGTG
GCAAAATATG AGGACGCGGT TACCGCGTTT GCAAAAGCTG TAGTAAAAAA CCCCAAAGAA
GCAAATTATT ATCTTTTTTT AGGCCGAGCG CTTTATTGGT CCGGTAAAGT AGATTCCGCT
ATGGCTGAGT TTCGCACCGC TATGGAAATC AACCCTAAAA ATGGCGATGC TTACCAGCTT
TTAGGTATAG GTTACGGCTG GAAAGGCGAT ATCAGGCAAG CCCAAAAGAA TTTTGAGAAA
GCGGAAAGAC TTATGCCAAA CAGGCCTGAC GTAAAAATGA ATTTAAGTTC CGTTTACGCA
AGCCAAAACA AATTGGAGCT GGCGCTTGAT TATATAAGAA TGGCGGTGGC GTTGTCGCCG
AAAGATCCGC TTCTTTACCA CCAGTTAGGG CTTATAAGCG AAATGCTTGG GCGGGACTCC
TCAGCTGAAG AGGCTTTTAA AACCTCAATT AAACTCTATC CCCGCTACGA GGATTCCATG
CTGGCTCTGG CAGCTACTTA TGAAAAGCGA AATGATGATA AAGACGCTTT GTCTTACTAT
AAAAAAGCCT TAAAAATTAA ACCTGAGGAT TATGTGGCAA GGCTGCGTTA CGCTAATTTA
CTTTTCACGT CGGCTTTTGA AAAAGAAGCA AAAGAAGTTG TTGAGAAAGC TTTTTCAATC
AGTTCACGCG AAGGCAAAGG TCTTGCGTTT AATGTTTCCT ACAGCGCGGT GCAGAACCAA
ACTGCCCAAT CCTCTTTTCC TCCCGAGCTT TCCGCTTTAA AAAAAGTGCT TGAAGAAACT
GATTTGGCAG ATGATATTAT TGTTGACGCG GAGGTAAACT ATTCTTTACC GCACGAATTT
AAAGAAGCAA AAGAAAAAAG CCTTCTTGAA CGTGAAATGC TGCGCGCTTT GGAGCAATCC
CGCGCCGCCG CTGCCGGTTC CCAGACGTTC CGCCGCAGTT TTATTATAAA CGGCGCTAAT
AAAGAAGAGC GTAATATACA GATAGAAACT ATTATAAACA CCCTTAGCGA GGCGCTTCAA
AACTCGCCTG AAAATACACA GTCAAAACTA AATGTAAAAA CTGAAAACGC AAAAAAAACC
GTGCCTGTGG AAGGAACAGG GGGGAACAGT TCCCAAAAAA CCGCTTATGA CCCTAGAAAT
GTAGGTAATG ACATGGGGCT TTGGGTAGCG GGTAAAAGCT GGGTTCGTTT TGTAGCCGAA
ACTTTGCCTG ATATAGAAAA CCGTATTTTT GATAAGGAAC AGCAAAAGAT GGAGCCGGAT
TCTTTTGATA ATGTTTTAAT GGGGTTGGCC TATCTTACTT TAGGTAAAGG AAATGAGGCG
TTAAATTATT TTGACGACGC CTTGAAAACA GAGCCCGCAA ATGAACTTGC TCTTTTAGGC
AAAGGAACCG CATGGATAGT TTTAGGACGT GAAGACAATG CGCAAAATAT ATATAGGCAA
GTTTTGGAAA TTAACCCTAA AAACAAAACG GCTAAAAAAA ATCTTACATT TTTAGAAAAA
AGAGGAGCCT GA
 
Protein sequence
MLKKTFTFIF LCLFCVCCFG QNLIGRKKEL NPDYSKSVWD NVMETQSLND VRKGFYFMSV 
AKYEDAVTAF AKAVVKNPKE ANYYLFLGRA LYWSGKVDSA MAEFRTAMEI NPKNGDAYQL
LGIGYGWKGD IRQAQKNFEK AERLMPNRPD VKMNLSSVYA SQNKLELALD YIRMAVALSP
KDPLLYHQLG LISEMLGRDS SAEEAFKTSI KLYPRYEDSM LALAATYEKR NDDKDALSYY
KKALKIKPED YVARLRYANL LFTSAFEKEA KEVVEKAFSI SSREGKGLAF NVSYSAVQNQ
TAQSSFPPEL SALKKVLEET DLADDIIVDA EVNYSLPHEF KEAKEKSLLE REMLRALEQS
RAAAAGSQTF RRSFIINGAN KEERNIQIET IINTLSEALQ NSPENTQSKL NVKTENAKKT
VPVEGTGGNS SQKTAYDPRN VGNDMGLWVA GKSWVRFVAE TLPDIENRIF DKEQQKMEPD
SFDNVLMGLA YLTLGKGNEA LNYFDDALKT EPANELALLG KGTAWIVLGR EDNAQNIYRQ
VLEINPKNKT AKKNLTFLEK RGA