Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0681 |
Symbol | |
ID | 6263350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 753871 |
End bp | 755574 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642611152 |
Product | hypothetical protein |
Protein accession | YP_001875573 |
Protein GI | 187251091 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2165] Type II secretory pathway, pseudopilin PulG |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00000000227256 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCAAA AAAAAGGGTT CACCTTAATA GAAATAGCTG TGGTAGTTTT AGTAATAGCT ATTTTAGCGG CTATAGCTTT GCCGCAGTAC AGAAAGTCGT TAGAACGCTC AAGAGCTGCC GAAGCTTTTG ACATCCTTAC AGAGATAAGG AACAAACAAG AATCCAGGGA TTTACTTGGC ACGGGCACCG CCAAAGGCTA TACTGTAAAG TTTAGCGATT TGGGGGAAGT AATAGCGGGC AAAACCTCCA CAACAAACAC GTTAGACACA AAACTTTTTT CATATGTTCT TTCAGATAAT CCTTACCCTC AGGCGTACGC CAAAAGAAAA GATTTAGATT ATTCAATAGT TCAAACCAAA GGCTACCAAG ACAGCGCATT GTGTTGTATA GGCAAAGACT GCGATATGGT AGATAATGTT TTAAAAGGGT GTGAGAAGAC AGCGTGTCCT ACAACATGCG CGGCTGGATA TAAAAGAACA GGGTACTTTT TTTCGGAAGA CGGGCCTTGC TGTGAAGCTA AAACGTCGTG TCCCGCAACA TGTCCTGCAG GGCAAAAAAG AAGCAGCGTG CAGTATACGG AAGATGGAGC GTGCTGCGTA GCCAAAACAT CATGCCCGAC AACATGTCCT ACAGGCCAGC AGAGGACAAG CGTACAATAT AGTGAAGACG GAGCGTGCTG CGTAGCCAAA ACATCATGCC CGACAACATG TCCTACAGGC CAGCAGAGGA CAAGCGTACA ATATAGTGAA GACGGAGCAT GCTGCGTATC AAAAACGTCG TGTCCCGCAA CATGTCCTAC AGGTCAAAAG AGGACAAGCG TACAATACAG TGAAGACGGA GCATGTTGCA CGGCTAAAAC GGCTTGTCCC GCATCATGTC CTGCCGGAGA AGAAAGAACA AGCGTGCAAT ACAGTGAAGA CGGAGCGTGT TGCACGGCTA AAACGGCTTG TCCCGCGACA TGTCCCACGG GCCAGGAAAG AACAAGCGTA CAATACAGTG AAGATGGAGC GTGCTGCCAA ACAAAAACCT GCGGCAGCGG ACAAACCCTT GTTGGGGGAG TATGTAAAAC AGCGTGTCCT GCTACATGTC CTACAGGTCA AAAGAGAACA AGTTCCCAGT ATTCGGAAGA TGGAGCGTGC TGCGTGGCTA AAACAGCGTG CCCCGCTACA TGTCCTACGG GCCAGGAAAG AACAAGCGTG CAATACAGTG AAGACGGCGA CTGTTGTAAA ACTTCAAGCG GATGTCCTGT CGGTACTTCA ATAGGAGCTA ATGGAAAATG TTGTACTCCT GAATTAATGG GTATTGACGA ACGGGACGGA GGCTGTTGCG TAGAATTTTC AAACTGCGGT CTTACCGGAC CCGGAGGACT TGCCTTTCCT TGCAGATGTG CAAGAACAGG CACAGGCACT ATATCTTGTT CTGGTAACAG TACGCAATCC TGCGGTCAGT GCGGCACCCA AACAAGAACA TGTAATACTT CAACAGGAGT ATGGAGCTCA TGGAGTTCGT GCAATGAATC TCCAAATCCT TTAAGCCCTT CGGACCAGGA TATGTGCTTA AGCTGCGGCG GTACATTAAC ATGTAAAGGC TGCGATTGCG GAACTTATTA CGACTTTAGC AGCGGCTCGG GTATTTTATC ATATTGCTCT TTTAACAGCC GTTACGGATG CGTATGCGGA CAGACGTACA GATCATGCAG ATAA
|
Protein sequence | MSQKKGFTLI EIAVVVLVIA ILAAIALPQY RKSLERSRAA EAFDILTEIR NKQESRDLLG TGTAKGYTVK FSDLGEVIAG KTSTTNTLDT KLFSYVLSDN PYPQAYAKRK DLDYSIVQTK GYQDSALCCI GKDCDMVDNV LKGCEKTACP TTCAAGYKRT GYFFSEDGPC CEAKTSCPAT CPAGQKRSSV QYTEDGACCV AKTSCPTTCP TGQQRTSVQY SEDGACCVAK TSCPTTCPTG QQRTSVQYSE DGACCVSKTS CPATCPTGQK RTSVQYSEDG ACCTAKTACP ASCPAGEERT SVQYSEDGAC CTAKTACPAT CPTGQERTSV QYSEDGACCQ TKTCGSGQTL VGGVCKTACP ATCPTGQKRT SSQYSEDGAC CVAKTACPAT CPTGQERTSV QYSEDGDCCK TSSGCPVGTS IGANGKCCTP ELMGIDERDG GCCVEFSNCG LTGPGGLAFP CRCARTGTGT ISCSGNSTQS CGQCGTQTRT CNTSTGVWSS WSSCNESPNP LSPSDQDMCL SCGGTLTCKG CDCGTYYDFS SGSGILSYCS FNSRYGCVCG QTYRSCR
|
| |