Gene Cthe_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0656 
Symbol 
ID4808186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp809447 
End bp810553 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content35% 
IMG OID640106071 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_001037084 
Protein GI125973174 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000003661 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGG ATTTGTTTAT GAAAAAGACT TCAATGGTAT GTATAGATGT CGGGTATCGA 
AATATAAAAG TTGTTGAAGT CGCCGTAAAG AAGAATAATA ATATTTTTAT TGAAAATTAC
GGTATTGTTC CGACCCCGCC TGATTGCATT AAGAACGGTG CAATATATGA CGTTGACAGG
GTATTGAGCG TGATTAAAAG CGTTATAAGA GAGCAAAACA TGAAAGCTAA AAATGCAAAA
ATTATAATGT CCGGAACAAA CATAATAACA AGAATTTACT TAATTGACAA AGTACAGGGA
GAAAGCGAAG ATTTCACCGT AAAGAACAGT ATGCCTCAAT TTCTCCCCAT TGATATAGAT
AATTACAGAG TTGACTATAA AATTCTTCAG ACAATAAAGG AAAAAGGCAG TGAAAAATAC
AAAGTTTTTG TGACGGCAGT ACCTAAAAAC ATCCTTCAAA GTTATGTTGA CGTGTTGCAA
GGTTTGGATT TAAAGCCTCT GGCTGTTGAC ATACCGGCAA ATAGTACTGC AAAATTTTTT
AACAGGGAAA TTCTCACAAG AGATATGGAT GAATATTATT CCAAGAGGAA GTATAAAAAA
GTGGAAAGTG ACACTTTTGC AGTATTGGAC TTTGGATCTG AGACGACAAT TGTTAATTTT
CTTAAAGACA GAGTGCTTGA ATTTAACAAA GTTATTCTTT CCGGAAGTTC CAATATTGAC
GAGCATATTG CAAGGGAACT CAATATAAGT CTTCAGGAAG CTGAAAGACT TAAGAAAACA
TATGGAATGA CTCCCCCCAA CAATCTTTCA AAAAGAGAAC ATGTAATAAC TTACGGAAAA
GTCAGCAATT TTATTGAAAG GCTTACCCGG CAGATAGCAA AGTGTTTTGA ATTTTATCTT
GAAAGGTGTT ATGGTACTCC GATTTCAAAG ATTTTTATTA TAGGTGGAGG TTCACAGCTT
AGCGGACTTA ATCAATATTT GTTTTCAACG TTCAATGTCC CGGTTTATCC CGTAGGACTT
TTGAATCTCA AAGGAGTTGA GCTTAAGAAA AATCTTGACA AAGATAAACT CAATTACCTG
ATAAATGCTG TGGGAATATC CCTTTAA
 
Protein sequence
MLLDLFMKKT SMVCIDVGYR NIKVVEVAVK KNNNIFIENY GIVPTPPDCI KNGAIYDVDR 
VLSVIKSVIR EQNMKAKNAK IIMSGTNIIT RIYLIDKVQG ESEDFTVKNS MPQFLPIDID
NYRVDYKILQ TIKEKGSEKY KVFVTAVPKN ILQSYVDVLQ GLDLKPLAVD IPANSTAKFF
NREILTRDMD EYYSKRKYKK VESDTFAVLD FGSETTIVNF LKDRVLEFNK VILSGSSNID
EHIARELNIS LQEAERLKKT YGMTPPNNLS KREHVITYGK VSNFIERLTR QIAKCFEFYL
ERCYGTPISK IFIIGGGSQL SGLNQYLFST FNVPVYPVGL LNLKGVELKK NLDKDKLNYL
INAVGISL