Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0535 |
Symbol | |
ID | 6262726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 585831 |
End bp | 587735 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642611005 |
Product | hypothetical protein |
Protein accession | YP_001875427 |
Protein GI | 187250945 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4968] Tfp pilus assembly protein PilE |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000025366 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.911782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG GGTTCACCTT AATAGAAATA GCTGTGGTAG TTTTAGTAAT AGCTATTTTA GCGGCTATAG CTTTGCCGCA GTACAGAAAG TCGTTAGAAC GCTCAAGAGC TGCCGAAGCT TTTGACATCC TTACAGAGAT AAGGAACAAA CAAGAAGCAA GAGACTTGCT TGGCACCGGC ACCGCCAAAG GCTATACTAT AAAGTTTAGC GATTTGGGGG AAGTAATAGC GGGCAAAACC TCCACAACAA ACACGTTAGA CACAAAACTT TTTTCATATG TTCTTTCAGA CAACCCCTAC CCTCAGGCGT ACGCCAAAAG AAAAGATTTA GATTATTCAA TAGTTCAAAC CAAAGGCTAC CAAGACAGCG CATTGTGTTG TATAGGTAAA GACTGCGATA TGGTAGATAA TGTTTTAAAA GGGTGTGAGA AGACAGCGTG TCCTACAACA TGCGCGGCTG GATATAAAAG AACAGGGTAC TTTTTTTCAG AAGACGGGCC TTGCTGTGAA GCCAAAACGT CGTGCCCGAC AACATGTCCT ACGGGACAAA AAAGAAGCAG CGTGCAGTAT ACGGAAGATG GAGCGTGCTG CGTAGCCAAA ACATCATGCC CGACAACATG TCCTACAGGC CAGCAGAGGA CAAGCGTACA ATATAGTGAA GACGGAGCAT GCTGCGTATC AAAAACGTCG TGTCCCGCAA CATGTCCTAC AGGTCAAAAG AGGACAAGCG TACAATACAG TGAAGACGGA GCATGTTGCA CGGCTAAAAC GGCTTGTCCC GCATCATGTC CTGCCGGAGA AGAAAGAACA AGCGTGCAAT ACAGTGAAGA CGGAGCGTGT TGCACAGCTG TAAAATGTGC TGATGATAAT AAAAGTCTTT ACTTAAACTC TTCAAACGGT TTTTGGTCTG ATACACTTTG TAAAGGTATT TGCTGCGGGG TGGGGTATTA TCCCGTTGAT AAAGGAACAT ATTTAACTTG TTATAACGGA GTTATGTACG GCAGTGAGGC GGTTTGCGAA GATAACCCAA TACCTTGCGG CAGCGGGCAA ATACTTGTTG GTGGAGTGTG TAAAACGGCG TGCCCTTCTA CGTGTCCCAC AGGCCAAAAG AGAACAAGTT CCCAGTATTC TGAAGACGGA GCGTGCTGCG TGGCTAAAAC AGCGTGTCCC GCCACATGTC CCACGGGCCA GGAAAGAACA AGCGTGCAAT ACAGTGAAGA CGGAGCGTGC TGCCAAACAA AAGCATGTCC CACAGGACAA ACCCTTGTTG GGGGAGTATG CAAAACAGCG TGTCCTGCGA CATGTCCTAC GGGCCAGGAA AGAACATCCA TGCAATACAG TGAAGACGGA GCGTGCTGCC AAACAAAAAC ATGTCCCACA GGACAAACCC TTGTTGGTGG AGTATGTAAA ACAGCGTGTC CCGCCACATG TCCCACGGGC CAGGAAAGGA CATCCGCGCA ATACAGTGAA GACGGAGCGT GCTGCAAAGC AAAAACATGT CCCACAGGGC AAACGCTTGT CGGAACTGTT TGTAAAACCA ATTGCCCGTC AGCCTGTCCG ACAGGTTATG CAAGAAGTTT GGTTAATTAT ACTGAAGACG GCGCCTGCTG TAAGCCTCAG GTAGTGGCAT ATAATTGCCA ACCTGTTCCG GGAGCCGGTT TAACTGGCGG ACCCGGTGAC ATTTGGATAG GAGCCACTAT GGCAAACAAT GGAACAGCTA CTTCAAGTCA TGTTGTTTCA ACCAGAGTTG ATTACCAAAC AGGCTCCGGT TATTCAGGCT CAGCATATCA GGATATTGTA ATACCCCAGG GAGCTAAGTA CGGAATTTTG GAATTTTCTG CTTACACGCC TTCAGGAGAC ACGGGTGTTA CAAATTGTAC AGTTACTGTG ATTTCAGTTA ACTAG
|
Protein sequence | MKKGFTLIEI AVVVLVIAIL AAIALPQYRK SLERSRAAEA FDILTEIRNK QEARDLLGTG TAKGYTIKFS DLGEVIAGKT STTNTLDTKL FSYVLSDNPY PQAYAKRKDL DYSIVQTKGY QDSALCCIGK DCDMVDNVLK GCEKTACPTT CAAGYKRTGY FFSEDGPCCE AKTSCPTTCP TGQKRSSVQY TEDGACCVAK TSCPTTCPTG QQRTSVQYSE DGACCVSKTS CPATCPTGQK RTSVQYSEDG ACCTAKTACP ASCPAGEERT SVQYSEDGAC CTAVKCADDN KSLYLNSSNG FWSDTLCKGI CCGVGYYPVD KGTYLTCYNG VMYGSEAVCE DNPIPCGSGQ ILVGGVCKTA CPSTCPTGQK RTSSQYSEDG ACCVAKTACP ATCPTGQERT SVQYSEDGAC CQTKACPTGQ TLVGGVCKTA CPATCPTGQE RTSMQYSEDG ACCQTKTCPT GQTLVGGVCK TACPATCPTG QERTSAQYSE DGACCKAKTC PTGQTLVGTV CKTNCPSACP TGYARSLVNY TEDGACCKPQ VVAYNCQPVP GAGLTGGPGD IWIGATMANN GTATSSHVVS TRVDYQTGSG YSGSAYQDIV IPQGAKYGIL EFSAYTPSGD TGVTNCTVTV ISVN
|
| |