Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0184 |
Symbol | |
ID | 4808672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 219986 |
End bp | 221089 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640105595 |
Product | type IV pilus assembly protein PilM |
Protein accession | YP_001036618 |
Protein GI | 125972708 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01174] cell division protein FtsA [TIGR01175] type IV pilus assembly protein PilM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000394027 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTTAC CATTTTTGAG AAATAATTAT CTAAGCATTG ATATTGGTTT TAGGAATATC AAAGTTGTTG AAGTGGCAGT TAACAGGAAC AACGAAGTCT ACATCAACAA TTTTGGAATA GCTCCTACTC CGTTAAATTG CATAAAAAAC GGCGTTATCA AAAATGTGGA TGCGGTATCC AGCGAGATTA GCAGGATAAT CCGGGAGAAC AAGATGAAGG TAAAGAAGTC CAAGATTGTT ATGTCCGGTA CCAGCATAAT TTCGAGAGTG TTTATGATTG AAAAAGTACC GGGCAAGAGT ATTGATGAGC TTGTTCAGGA AGCGATAATG TCCAACTTGC CTATCAACCC GGATGAACAC AAAATTGACT ACAAGATTTT GCAGGAGTTG AAAGAAGATA ACGTCGAAAA GGTTAAAGTT TTTGTAACTG CAGTAAGAAA AAGCATAATA AACAGTTACA TTGATGTTCT GTTTGAACTT GGTTTAAAAC CTGTTTCAGT TGATATACCG GCAAACAGTA CGGCAAAATT TTTCAACAGA GATATAAAGG TCAGCATTGA CAATGTTAAG TTTGTAAACC CCAATAGAAG TCAGTTGAAC AAAGATGCTT TTGCAGTTAT AGATTTTGGT TCGGAAACCA CGATTATAAA TATTCTGAGA AACAAAATAC TGGAGTTTAA CAAAGTAATA TTAAGCGGAA GCAGCAATCT GGACGAGGCA ATAGCTAAAA ATATGCGCAA GTCCATAATT GAAGCTGAAA GGCTTAAGAA ATTGTATGGC ATTTCGGTTC CGCCGCACCA TTCCACCAAC GAGGAGCATA TGAAGGTTTA CAGGATTTTA AAGGAAGTGG TTGATAAGCT TTTGAACCAG ATTTACATGT GCCTTACTTT TTATGAAGCT CGTTGCTATG GTGTAAAAGT CGGAAAGATA TTTATAATAG GCGGGGGTTC ACTCTTAAAA GGTTTGGCCG ACTATATGGA ACAAGTGCTG CAGGTACCTG TTTATCCTGT GGGACTTTTG GATATAGACG GTATTGATAT AAACAAAAAT CTAAACAGCG ACAAGCTGAA TTTCCTTGTG AATGCCGTAG GAATAACGCT GTAA
|
Protein sequence | MLLPFLRNNY LSIDIGFRNI KVVEVAVNRN NEVYINNFGI APTPLNCIKN GVIKNVDAVS SEISRIIREN KMKVKKSKIV MSGTSIISRV FMIEKVPGKS IDELVQEAIM SNLPINPDEH KIDYKILQEL KEDNVEKVKV FVTAVRKSII NSYIDVLFEL GLKPVSVDIP ANSTAKFFNR DIKVSIDNVK FVNPNRSQLN KDAFAVIDFG SETTIINILR NKILEFNKVI LSGSSNLDEA IAKNMRKSII EAERLKKLYG ISVPPHHSTN EEHMKVYRIL KEVVDKLLNQ IYMCLTFYEA RCYGVKVGKI FIIGGGSLLK GLADYMEQVL QVPVYPVGLL DIDGIDINKN LNSDKLNFLV NAVGITL
|
| |