Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0656 |
Symbol | |
ID | 4808186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 809447 |
End bp | 810553 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640106071 |
Product | type IV pilus assembly protein PilM |
Protein accession | YP_001037084 |
Protein GI | 125973174 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01175] type IV pilus assembly protein PilM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000003661 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCTGG ATTTGTTTAT GAAAAAGACT TCAATGGTAT GTATAGATGT CGGGTATCGA AATATAAAAG TTGTTGAAGT CGCCGTAAAG AAGAATAATA ATATTTTTAT TGAAAATTAC GGTATTGTTC CGACCCCGCC TGATTGCATT AAGAACGGTG CAATATATGA CGTTGACAGG GTATTGAGCG TGATTAAAAG CGTTATAAGA GAGCAAAACA TGAAAGCTAA AAATGCAAAA ATTATAATGT CCGGAACAAA CATAATAACA AGAATTTACT TAATTGACAA AGTACAGGGA GAAAGCGAAG ATTTCACCGT AAAGAACAGT ATGCCTCAAT TTCTCCCCAT TGATATAGAT AATTACAGAG TTGACTATAA AATTCTTCAG ACAATAAAGG AAAAAGGCAG TGAAAAATAC AAAGTTTTTG TGACGGCAGT ACCTAAAAAC ATCCTTCAAA GTTATGTTGA CGTGTTGCAA GGTTTGGATT TAAAGCCTCT GGCTGTTGAC ATACCGGCAA ATAGTACTGC AAAATTTTTT AACAGGGAAA TTCTCACAAG AGATATGGAT GAATATTATT CCAAGAGGAA GTATAAAAAA GTGGAAAGTG ACACTTTTGC AGTATTGGAC TTTGGATCTG AGACGACAAT TGTTAATTTT CTTAAAGACA GAGTGCTTGA ATTTAACAAA GTTATTCTTT CCGGAAGTTC CAATATTGAC GAGCATATTG CAAGGGAACT CAATATAAGT CTTCAGGAAG CTGAAAGACT TAAGAAAACA TATGGAATGA CTCCCCCCAA CAATCTTTCA AAAAGAGAAC ATGTAATAAC TTACGGAAAA GTCAGCAATT TTATTGAAAG GCTTACCCGG CAGATAGCAA AGTGTTTTGA ATTTTATCTT GAAAGGTGTT ATGGTACTCC GATTTCAAAG ATTTTTATTA TAGGTGGAGG TTCACAGCTT AGCGGACTTA ATCAATATTT GTTTTCAACG TTCAATGTCC CGGTTTATCC CGTAGGACTT TTGAATCTCA AAGGAGTTGA GCTTAAGAAA AATCTTGACA AAGATAAACT CAATTACCTG ATAAATGCTG TGGGAATATC CCTTTAA
|
Protein sequence | MLLDLFMKKT SMVCIDVGYR NIKVVEVAVK KNNNIFIENY GIVPTPPDCI KNGAIYDVDR VLSVIKSVIR EQNMKAKNAK IIMSGTNIIT RIYLIDKVQG ESEDFTVKNS MPQFLPIDID NYRVDYKILQ TIKEKGSEKY KVFVTAVPKN ILQSYVDVLQ GLDLKPLAVD IPANSTAKFF NREILTRDMD EYYSKRKYKK VESDTFAVLD FGSETTIVNF LKDRVLEFNK VILSGSSNID EHIARELNIS LQEAERLKKT YGMTPPNNLS KREHVITYGK VSNFIERLTR QIAKCFEFYL ERCYGTPISK IFIIGGGSQL SGLNQYLFST FNVPVYPVGL LNLKGVELKK NLDKDKLNYL INAVGISL
|
| |