Gene Cthe_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0478 
Symbol 
ID4808331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp592016 
End bp593236 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content42% 
IMG OID640105892 
Productflagellar motor switch protein 
Protein accessionYP_001036909 
Protein GI125972999 
COG category[N] Cell motility
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1776] Chemotaxis protein CheC, inhibitor of MCP methylation
[COG1886] Flagellar motor switch/type III secretory pathway protein 
TIGRFAM ID[TIGR02480] flagellar motor switch protein FliN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000894996 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGATA TGTTGTCTCA GGCTGAAATA GATGCTTTGC TAAATGGTAG CTCATCGGAA 
GGGAATGAAG AAAACAGCGC AAATGAGGAG CTTACGTCAC AGGAAATTGA TGCATTGGGT
GAAATAGGCA ACATAAGTAT GGGAACCTCT GCTACTACTC TTAACTCGTT GTTGGGACAA
AGGGTAGTGA TAACAACTCC CAAAGTATCA GTATGTACAT GGGAAGAATT AGCGAAAGAA
TATCACAGTT CTTATGTGGC TGTAAAAGTC GAATATACTG AAGGACTTGA CGGTATGAAT
CTTCTTCTTT TAAAAGAGGA CGATGCCAAG GTGATTACCG ATTTGATGAT GGGCGGAGAC
GGCAGTAATA AAGAGGGAGA ATTGACGGAT TTGCACTTAA GTGCAATCAG TGAGGCAATG
AATCAAATGA TTGGTTCTGC AGCCACTTCA ATGTCTTCGA TGTTTAACAA GAGAATAGAT
ATTTCGCCTC CGAAAGTGTT TCCCATGACA TTGGACGCTG CCGTTCCTGA AGCGGAATTT
CCAAGGGGCG ACAAGATAGT AAAAGTTGCA TTTAAAATGG TTATCGGAGA ATTGATTGAC
AGTCAAATTA TGCAGCTTTT GCCTATCGAT TTTGCCAAAA AAATGGTTAG TAATATTATG
AATTCCAAAC CCGAGGACAA TCAGGTAATA GAAGAGGTTG CTGCATCAGC TGTTCAGGAA
CCTGTTGCCG GACAGCATAA TGAGCAGTAT TATCATGCTC AACCGTCTCA GCAGCCGTAT
CAACAGCCGC CTCAGCAGCC GTATCAACAA CCGTATCAAC AACCATACCA TCAACCGTAT
CAGCAACAAT ATCAGCAGTA TTACCAGCCG TATGAGCAGC CACCCAGACA AAATGATGGA
TACCGCAATC CTATAAATGT GCAACCTGCA CAGTTTGAAG CTTTTGATGA CGGTTCGAAA
ATCTCAATTG ACAAGAAAAA CATCGGGCTT ATCATGGATG TGCCGCTTCA GGTTACAGTT
GAGCTTGGAC GAACCAATAA ACTTATTAAA GATATTTTGG AATTCGGACC TGGTTCAATA
ATTGAGCTTG ACAAGCTTGC AGGTGAACCC GTGGATATTC TGGTAAACGG GAAAGTAATT
GCCATAGGTG AAGTTGTGGT TATAGATGAA AGTTTTGGTG TCAGGATTAC AGACATACTT
CATCCGTCGA AAAGGCTTTA A
 
Protein sequence
MEDMLSQAEI DALLNGSSSE GNEENSANEE LTSQEIDALG EIGNISMGTS ATTLNSLLGQ 
RVVITTPKVS VCTWEELAKE YHSSYVAVKV EYTEGLDGMN LLLLKEDDAK VITDLMMGGD
GSNKEGELTD LHLSAISEAM NQMIGSAATS MSSMFNKRID ISPPKVFPMT LDAAVPEAEF
PRGDKIVKVA FKMVIGELID SQIMQLLPID FAKKMVSNIM NSKPEDNQVI EEVAASAVQE
PVAGQHNEQY YHAQPSQQPY QQPPQQPYQQ PYQQPYHQPY QQQYQQYYQP YEQPPRQNDG
YRNPINVQPA QFEAFDDGSK ISIDKKNIGL IMDVPLQVTV ELGRTNKLIK DILEFGPGSI
IELDKLAGEP VDILVNGKVI AIGEVVVIDE SFGVRITDIL HPSKRL