Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1469 |
Symbol | |
ID | 4069619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1777433 |
End bp | 1778485 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983478 |
Product | type IV pilus assembly protein PilM |
Protein accession | YP_590545 |
Protein GI | 94968497 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01175] type IV pilus assembly protein PilM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.11937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.330674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGAC TGGGAACAGC AAAGAGCATA GTTGGCCTGG ATATCGGATC CAGCAGCATC AAGGCCGTGG AGTTAAAGAA GTCGCGCAAT GGCGTAGAAG TGGCGCACAT GGCCATGGAG CCCCTGTCGT CTGACATCGT CGTGGACTCG ATGATTGTGG ACAGCGGCAG CGTCGCCAGC GCAATTACCA AGATCTTTAC GGAGTCGGGC ATCAAGACTC GTGCGGTAGC GACCTCGGTC AGCGGACACT CCGTGATCGT GAAGCGCATC CCGATGTCGA CGATGAGCGA CTCTGAACTT TCCGGCATCA TCCAGACCGA AGCCGCGCAA CACATCCCGT TCGATATCTC GGACGTCAGC ATTGACTACC AGATCCTTTC CGACACCGGT GGCTCGACGA TGGACGTCCT GCTGGTCGCG GTGAAGAAAG ACAAAATTCT TAACTACACG AACGTTCTGT CGCTCGCCGG CAAGTCTCCG GCGGTGGTGG ACATCGACGC GTTCGCCCTC CAGAACTGCT ACGAATACAA CTATCAACCC GGTCCGGGCG CGACAGTTGC GTTGTTGAAT CTCGGCGCCA GCGTAATGAA CATCAACATC GTGAAGGGCA CCACACCCCT GTTCACGCGC GATGTGAGCG TCGGCGGCCA CCAATACACC GATTCGTTGC AGAAGGAACT GGATCTCAGC TTTGAAGACG CGGAAGCGCT GAAGCTCGGT AAGAAAGTGG GCACAGTCAG CGAAGACGCG AAGATGCCGA TCCTCCAGCA AGTGACCGAA ATCATCGTGC TGGAAATTCA GAAGACTTTC GACTTCTTCC GCGCTACCGC GACGGGAGAG CACATTGAGC GCATTTACCT CGCGGGCGGT TCGTCGCAGG TGCCGGGCCT GATTGAAGGC CTGCGCCAGG AGTTCTCGCT CCCAGTCGAG ATCCTCAATC CATTCCAGCG CATTGAACCG CCTCTTGGCA CGGGCGCGGA TCTCGCCGAC AAGAACGCCG GCCAGATGGC AGTTGCCGTG GGACTCGCCC TTAGGAGTTT TGACGAATTA TGA
|
Protein sequence | MFGLGTAKSI VGLDIGSSSI KAVELKKSRN GVEVAHMAME PLSSDIVVDS MIVDSGSVAS AITKIFTESG IKTRAVATSV SGHSVIVKRI PMSTMSDSEL SGIIQTEAAQ HIPFDISDVS IDYQILSDTG GSTMDVLLVA VKKDKILNYT NVLSLAGKSP AVVDIDAFAL QNCYEYNYQP GPGATVALLN LGASVMNINI VKGTTPLFTR DVSVGGHQYT DSLQKELDLS FEDAEALKLG KKVGTVSEDA KMPILQQVTE IIVLEIQKTF DFFRATATGE HIERIYLAGG SSQVPGLIEG LRQEFSLPVE ILNPFQRIEP PLGTGADLAD KNAGQMAVAV GLALRSFDEL
|
| |