Gene Cthe_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0184 
Symbol 
ID4808672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp219986 
End bp221089 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content37% 
IMG OID640105595 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_001036618 
Protein GI125972708 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01174] cell division protein FtsA
[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000394027 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTTAC CATTTTTGAG AAATAATTAT CTAAGCATTG ATATTGGTTT TAGGAATATC 
AAAGTTGTTG AAGTGGCAGT TAACAGGAAC AACGAAGTCT ACATCAACAA TTTTGGAATA
GCTCCTACTC CGTTAAATTG CATAAAAAAC GGCGTTATCA AAAATGTGGA TGCGGTATCC
AGCGAGATTA GCAGGATAAT CCGGGAGAAC AAGATGAAGG TAAAGAAGTC CAAGATTGTT
ATGTCCGGTA CCAGCATAAT TTCGAGAGTG TTTATGATTG AAAAAGTACC GGGCAAGAGT
ATTGATGAGC TTGTTCAGGA AGCGATAATG TCCAACTTGC CTATCAACCC GGATGAACAC
AAAATTGACT ACAAGATTTT GCAGGAGTTG AAAGAAGATA ACGTCGAAAA GGTTAAAGTT
TTTGTAACTG CAGTAAGAAA AAGCATAATA AACAGTTACA TTGATGTTCT GTTTGAACTT
GGTTTAAAAC CTGTTTCAGT TGATATACCG GCAAACAGTA CGGCAAAATT TTTCAACAGA
GATATAAAGG TCAGCATTGA CAATGTTAAG TTTGTAAACC CCAATAGAAG TCAGTTGAAC
AAAGATGCTT TTGCAGTTAT AGATTTTGGT TCGGAAACCA CGATTATAAA TATTCTGAGA
AACAAAATAC TGGAGTTTAA CAAAGTAATA TTAAGCGGAA GCAGCAATCT GGACGAGGCA
ATAGCTAAAA ATATGCGCAA GTCCATAATT GAAGCTGAAA GGCTTAAGAA ATTGTATGGC
ATTTCGGTTC CGCCGCACCA TTCCACCAAC GAGGAGCATA TGAAGGTTTA CAGGATTTTA
AAGGAAGTGG TTGATAAGCT TTTGAACCAG ATTTACATGT GCCTTACTTT TTATGAAGCT
CGTTGCTATG GTGTAAAAGT CGGAAAGATA TTTATAATAG GCGGGGGTTC ACTCTTAAAA
GGTTTGGCCG ACTATATGGA ACAAGTGCTG CAGGTACCTG TTTATCCTGT GGGACTTTTG
GATATAGACG GTATTGATAT AAACAAAAAT CTAAACAGCG ACAAGCTGAA TTTCCTTGTG
AATGCCGTAG GAATAACGCT GTAA
 
Protein sequence
MLLPFLRNNY LSIDIGFRNI KVVEVAVNRN NEVYINNFGI APTPLNCIKN GVIKNVDAVS 
SEISRIIREN KMKVKKSKIV MSGTSIISRV FMIEKVPGKS IDELVQEAIM SNLPINPDEH
KIDYKILQEL KEDNVEKVKV FVTAVRKSII NSYIDVLFEL GLKPVSVDIP ANSTAKFFNR
DIKVSIDNVK FVNPNRSQLN KDAFAVIDFG SETTIINILR NKILEFNKVI LSGSSNLDEA
IAKNMRKSII EAERLKKLYG ISVPPHHSTN EEHMKVYRIL KEVVDKLLNQ IYMCLTFYEA
RCYGVKVGKI FIIGGGSLLK GLADYMEQVL QVPVYPVGLL DIDGIDINKN LNSDKLNFLV
NAVGITL