Gene Cthe_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2686 
Symbol 
ID4808858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3170039 
End bp3171142 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content38% 
IMG OID640108105 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_001039078 
Protein GI125975168 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01174] cell division protein FtsA
[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000584895 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACAA TTCCTTTCTT GAAAACCAAC CTTCTGAGTA TTGACATCGG TTTTAGGAAC 
ATAAAAATTG TTGAAGTGGA GCTAGGCAGG AATAATGAGA TTTTTATTAA AAATTTCGGT
ATAGCTTCTA CTCCAAAAGG GTCTATCAAA AACGGGGCTA TTAAAGATGT CAAGAGTGTT
ACCAACGAGA TAAGGAAGGT AATGGAGAAC ATAAACACAA AGGCAAAGAA TGCAAAGATT
GTTATGTCAG GTACGAATAT TATTTCCCGT GTTTTTGTTG TTGAAAAGAT CCCCGGGGAA
GATATGAATC ATCTTGTCAG AACGACTATT TCCCAAAGTA TGCCGATAGA CCTTGATGCC
CATCAGATAG ATTACAAGGT GTTGCAGGAA TTCAGAGAGG ATGGAATTGA TAAAATAAAG
GTATTTGTTA CCGCTGTTTT AAAAAGTATT ATACAAAGTT ATATTGACAT TTTGATAGAA
TTGGGACTCA AACCCATATC AGTGGATATT CCCGCCAACA GCGCGGCAAA GTTTTTCAAC
AGGGAGATTA TGGTTTCCGA GAGTGAAACG TGGTTTAAAA GGCAAAGATC CAGCAAGCTT
AGCCAGAATA CTTTTGCCGT TATTGATTTT GGTTCTGAGA CTACAATAGT AAACATATTA
AGGAACAGGG TTCTGGAGTT TAACAAAGTT ATTTTAAGGG GCAGCAGTAA TATTGACGAG
GCCATCGCGG CAAGTACAGG CAAAAAGCTC GAAGAGGCTG AAAGAATTAA AAAGATTCAC
GGACTTGCTC TTACTGATAT CAATGCCGAT GAAGAACAGG AGAAAATTTA CAACAGTATC
AAATCCGTTA TTGACGATAT AATACGGCAG ATGTTTCAAT GTTTTGAGTT TTATGAAAAA
AGATGTTACG GCGAGAAAAT AGGAAAGATT TACATGATAG GCGGAGGATC GCAGTTAAAA
GGACTTAGGG AATATTTGGA AGAGGTGTTC CAGGTTCCTG TATATCCCGT AGAGCTTCTT
AGCATAGAAG GAATACAGAT AAACAAAGGA CTTGACGGGG AAAGACTCAA CTATCTTATC
AACTCTGTGG GAATAACCTT GTAA
 
Protein sequence
MVTIPFLKTN LLSIDIGFRN IKIVEVELGR NNEIFIKNFG IASTPKGSIK NGAIKDVKSV 
TNEIRKVMEN INTKAKNAKI VMSGTNIISR VFVVEKIPGE DMNHLVRTTI SQSMPIDLDA
HQIDYKVLQE FREDGIDKIK VFVTAVLKSI IQSYIDILIE LGLKPISVDI PANSAAKFFN
REIMVSESET WFKRQRSSKL SQNTFAVIDF GSETTIVNIL RNRVLEFNKV ILRGSSNIDE
AIAASTGKKL EEAERIKKIH GLALTDINAD EEQEKIYNSI KSVIDDIIRQ MFQCFEFYEK
RCYGEKIGKI YMIGGGSQLK GLREYLEEVF QVPVYPVELL SIEGIQINKG LDGERLNYLI
NSVGITL