Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0933 |
Symbol | |
ID | 6262599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1036165 |
End bp | 1037748 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642611412 |
Product | hypothetical protein |
Protein accession | YP_001875823 |
Protein GI | 187251341 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4968] Tfp pilus assembly protein PilE |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000356044 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000476431 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAG GGTTCACCTT AATAGAAATA GCTGTGGTAG TTTTAGTAAT AGCTATTTTA GCGGCTATAG CTTTGCCGCA GTACAGAAAG TCGTTAGAAC GCTCAAGAGC TGCCGAAGCT TTTGACATCC TTACAGAGAT AAGGAACAAA CAAGAATCCA GGGATTTACT TGGCACGGGC ACCGCCAAAG GCTATACTGT AAAGTTTAGC GATTTGGGGG AAGTAATAGC GGGCAAAACC TCCACAACAA ACACGTTAGA CACAAAACTT TTTTCATATG TTCTTTCAGA TAATCCTTAC CCTCAGGCGT ACGCCAAAAG AAAAGATTTA GATTATTCAA TAGTTCAAAC CAAAGGCTAC CAAGACAGCG CATTGTGTTG TATAGGCAAA GACTGCGATA TGGTAGATAA TGTTTTAAAA GGGTGTGAGA AGACAGCGTG TCCTACAACA TGCGCGGCTG GATATAAAAG AACAGGGTAC TTTTTTTCGG AAGACGGGCC TTGCTGTGAA GCTAAAACGT CGTGCCCGAC AACATGTCCT ACGGGACAAA AAAGAAGCAG CGTGCAGTAT ACGGAAGATG GAGCGTGCTG CGTAGCCAAA ACATCATGCC CGACAACATG TCCTACAGGC CAGCAGAGGA CAAGCGTACA ATATAGTGAA GACGGAGCAT GCTGCGTATC AAAAACGTCG TGTCCCGCAA CATGTCCTAC AGGTCAAAAG AGAACAAGTG CCCAATATTA TGAAGACGGA GCATGCTGCG TATCAAAAAC AGCGTGTCCT ACAACGTGTC CTACAGGACA GCAGAGGACA AGCGTACAAT ATTATGAAGA CGGAGCCTGC TGCGTAACAA AAACAGCGTG TCCCGCAACA TGTCCTACGG GTCAAAAAAG AACAAGTGCC CAATATTATG AAGACGGAGC ATGCTGCACG GCTAAAACAG CGTGTCCTGC GACATGTCCT ACGGGCCAGG AAAGAACAAG CGTGCAATAC AGTGAAGACG GAGCGTGTTG CCAAACCAAA ACCTGCGGCA GCGGACAAAC CCTTGTTGGT GGAGTATGTA AAACAGCGTG TCCCGCCACA TGCCCCACGG GCCAGGAAAG GACATCCGCG CAATACAGTG AAGACGGAGC GTGCTGCAAA GCAAAAACAT GTCCCACAGG GCAAACCCTT GTTGGTGGAG TATGTAAAAC AGCGTGTCCC GCCACATGTC CCACGGGCCA GGAAAGGACA TCCGCGCAAT ACAGTGAAGA CGGAGCGTGC TGCAAAGCAA AAACATGTCC CACAGGGCAA ACCCTTGTTG GTGGAGTATG CAAAACATCA TGTCCCATTA CATGTCCTAT AGGCCAGCAG AGGACAACCG TCCAATATAC CGAAGACGGA GCATGCTGTA AAAGTAAACC ATGCCCGTTA GTCCCGCAAT CGATAATTGA TGATTGTAAC AATGCTTCTT ATGCCGTGTG GAACGAAAGT GAATGCGGTT GCAGATGTTG TAATAAGCAA CATGGGAATC CATATTTAGA ACCTTCTACA GGTATTGTAA GATGTCTCCC CACAGGTGTA GCAGCATATT TATGTGCGGT GTAA
|
Protein sequence | MKKGFTLIEI AVVVLVIAIL AAIALPQYRK SLERSRAAEA FDILTEIRNK QESRDLLGTG TAKGYTVKFS DLGEVIAGKT STTNTLDTKL FSYVLSDNPY PQAYAKRKDL DYSIVQTKGY QDSALCCIGK DCDMVDNVLK GCEKTACPTT CAAGYKRTGY FFSEDGPCCE AKTSCPTTCP TGQKRSSVQY TEDGACCVAK TSCPTTCPTG QQRTSVQYSE DGACCVSKTS CPATCPTGQK RTSAQYYEDG ACCVSKTACP TTCPTGQQRT SVQYYEDGAC CVTKTACPAT CPTGQKRTSA QYYEDGACCT AKTACPATCP TGQERTSVQY SEDGACCQTK TCGSGQTLVG GVCKTACPAT CPTGQERTSA QYSEDGACCK AKTCPTGQTL VGGVCKTACP ATCPTGQERT SAQYSEDGAC CKAKTCPTGQ TLVGGVCKTS CPITCPIGQQ RTTVQYTEDG ACCKSKPCPL VPQSIIDDCN NASYAVWNES ECGCRCCNKQ HGNPYLEPST GIVRCLPTGV AAYLCAV
|
| |