Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2686 |
Symbol | |
ID | 4808858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3170039 |
End bp | 3171142 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108105 |
Product | type IV pilus assembly protein PilM |
Protein accession | YP_001039078 |
Protein GI | 125975168 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01174] cell division protein FtsA [TIGR01175] type IV pilus assembly protein PilM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000584895 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACAA TTCCTTTCTT GAAAACCAAC CTTCTGAGTA TTGACATCGG TTTTAGGAAC ATAAAAATTG TTGAAGTGGA GCTAGGCAGG AATAATGAGA TTTTTATTAA AAATTTCGGT ATAGCTTCTA CTCCAAAAGG GTCTATCAAA AACGGGGCTA TTAAAGATGT CAAGAGTGTT ACCAACGAGA TAAGGAAGGT AATGGAGAAC ATAAACACAA AGGCAAAGAA TGCAAAGATT GTTATGTCAG GTACGAATAT TATTTCCCGT GTTTTTGTTG TTGAAAAGAT CCCCGGGGAA GATATGAATC ATCTTGTCAG AACGACTATT TCCCAAAGTA TGCCGATAGA CCTTGATGCC CATCAGATAG ATTACAAGGT GTTGCAGGAA TTCAGAGAGG ATGGAATTGA TAAAATAAAG GTATTTGTTA CCGCTGTTTT AAAAAGTATT ATACAAAGTT ATATTGACAT TTTGATAGAA TTGGGACTCA AACCCATATC AGTGGATATT CCCGCCAACA GCGCGGCAAA GTTTTTCAAC AGGGAGATTA TGGTTTCCGA GAGTGAAACG TGGTTTAAAA GGCAAAGATC CAGCAAGCTT AGCCAGAATA CTTTTGCCGT TATTGATTTT GGTTCTGAGA CTACAATAGT AAACATATTA AGGAACAGGG TTCTGGAGTT TAACAAAGTT ATTTTAAGGG GCAGCAGTAA TATTGACGAG GCCATCGCGG CAAGTACAGG CAAAAAGCTC GAAGAGGCTG AAAGAATTAA AAAGATTCAC GGACTTGCTC TTACTGATAT CAATGCCGAT GAAGAACAGG AGAAAATTTA CAACAGTATC AAATCCGTTA TTGACGATAT AATACGGCAG ATGTTTCAAT GTTTTGAGTT TTATGAAAAA AGATGTTACG GCGAGAAAAT AGGAAAGATT TACATGATAG GCGGAGGATC GCAGTTAAAA GGACTTAGGG AATATTTGGA AGAGGTGTTC CAGGTTCCTG TATATCCCGT AGAGCTTCTT AGCATAGAAG GAATACAGAT AAACAAAGGA CTTGACGGGG AAAGACTCAA CTATCTTATC AACTCTGTGG GAATAACCTT GTAA
|
Protein sequence | MVTIPFLKTN LLSIDIGFRN IKIVEVELGR NNEIFIKNFG IASTPKGSIK NGAIKDVKSV TNEIRKVMEN INTKAKNAKI VMSGTNIISR VFVVEKIPGE DMNHLVRTTI SQSMPIDLDA HQIDYKVLQE FREDGIDKIK VFVTAVLKSI IQSYIDILIE LGLKPISVDI PANSAAKFFN REIMVSESET WFKRQRSSKL SQNTFAVIDF GSETTIVNIL RNRVLEFNKV ILRGSSNIDE AIAASTGKKL EEAERIKKIH GLALTDINAD EEQEKIYNSI KSVIDDIIRQ MFQCFEFYEK RCYGEKIGKI YMIGGGSQLK GLREYLEEVF QVPVYPVELL SIEGIQINKG LDGERLNYLI NSVGITL
|
| |