Gene Cthe_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0477 
Symbol 
ID4808330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp591028 
End bp592014 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content40% 
IMG OID640105891 
Productflagellar motor switch protein FliM 
Protein accessionYP_001036908 
Protein GI125972998 
COG category[N] Cell motility 
COG ID[COG1868] Flagellar motor switch protein 
TIGRFAM ID[TIGR01397] flagellar motor switch protein FliM 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00127431 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAGATA TTCTGTCTCA AAATGAAATA GACGATTTAC TAAAGGCCCT GAATACAGGT 
GAACTGGATG TCCAGCAAAT CAGTTCCAAA ATCGAGGAAA GGAAGATAAA AACCCACGAT
TTCAGAAGGC CCAGTAAATT TGCAAAAGAT CATGTAAGGA CTTTAAATGT TATCCACGAC
AACTATGCAA GAATTATCAC AAACTTCCTT TCAGGATATT TGAGGACTCT GGTTCAGGTA
GAAGTTATTT CCGTTGAGCC GATAGCTTAT TATGAATTCA ACAATTCAAT ATCAAATCCT
GCGGTGCTTG CAGTTATTGA TTTTGCTCCG CTGACAGGAT CAATTATCCT TGAAATGGCA
CCACCTGTTG CTTATGCACT GATTGACAGG ATTCTCGGCG GAAAAGGCTT GCCAATTGAA
AGAGTAAGGG AGTTTACGGA AGTTGAGATT GCCATTATCG AAAGAATTAT CATACAGCTT
GTAAACCTTA TGAGAGAGCC GTGGGAAAAC GTGGTGGAAC TTAAACCCAG GCTTGAGAAA
ATTGAAACCA ACGCCCAGTT TGCACAGATA GTCTCGGCCA ATGAGACGGT GGCTTTGATT
ACTTTGGGAG CAAAAATCGG CGAAGTTGAA GGAATGATTA ATCTTTGCAT TCCTCATATA
GTAGTAGAAC CAATAGTTTC AAAACTGAAT ACGAAGTTTT GGTTTTCCAG TGTTGAAAAA
GAAGCAACAA AAGAAGACAA AGAAACCATT CAGAAAAAAA TTGAGTATAC AAAAGTACCT
GTCAGAGCAA TACTGGGAAG AGCAACCATT CAAGTGGCTG ATTTTCTTGA ATTACAACCG
GGGGATGTAA TAACTCTGGA TTCAAACGTC AACGGAAATC TTGATGTATT GGTGGGTGAC
TTGCTGAAAT TCCGTGGTTC TCCCGGGGTT AAGAAAAACA GGAATGCGAT CAAAATAACC
GAAGTAATAA GAAGGGAGGA TGAGTAG
 
Protein sequence
MGDILSQNEI DDLLKALNTG ELDVQQISSK IEERKIKTHD FRRPSKFAKD HVRTLNVIHD 
NYARIITNFL SGYLRTLVQV EVISVEPIAY YEFNNSISNP AVLAVIDFAP LTGSIILEMA
PPVAYALIDR ILGGKGLPIE RVREFTEVEI AIIERIIIQL VNLMREPWEN VVELKPRLEK
IETNAQFAQI VSANETVALI TLGAKIGEVE GMINLCIPHI VVEPIVSKLN TKFWFSSVEK
EATKEDKETI QKKIEYTKVP VRAILGRATI QVADFLELQP GDVITLDSNV NGNLDVLVGD
LLKFRGSPGV KKNRNAIKIT EVIRREDE