Gene Cthe_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0471 
Symbol 
ID4808324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp585504 
End bp587090 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content43% 
IMG OID640105885 
Productflagellar hook-length control protein 
Protein accessionYP_001036902 
Protein GI125972992 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACTC AACATTTAAT TCCCGATTTT ATTGCAAAAA TGACCGGTTC TACCGAAGTG 
CAAAAGTCGG CGGGCATGAA AAAAAGCTCT TCATCGCAGT TTAAAGATAC CCTTGACCTG
GCTGTCGAAA AGTCGTATGC ATGGCAGAGC GGCACTAAAT CCTATGAATA TGCAATGGAT
ATCAAAGACA GGCCTTATCA GCGTTTGCAA AACAAAAACA TACTGGACTC AGCCGAAACA
AAGGAAAGAA GACCGGACAG AACCAAGCCT TTTAACAAAG CATCAAACCA GATAACATCC
AGGGCATCAG ATAAGGCAGA ACGCACAGCT TCTCCGGAAG AAGAAAACAT CGAAGGCGAA
AATGACAGAA AGCTGAAAGG GAAAGCTATG GAGAAAGCTC TGGCAGAAGT TCTTGGAATT
AGTGTGGAAG AGCTGGAAAA GCTCATGGCT CAGCTGGGCA TAAATTTTGA GAATGCTGAC
GGTGAAACGG GTATTCAGGA AGCTGCCGAT AAAATTTCAG CATATCTGGG TTTAAACCAG
GATGAGAAAA TGGCTCTGGC AGAGATGATG TCCCTTGTTC AAAAGCAAGT TGAACAGGCC
TTCAAGGATG TTCAGTCTGC GTATGATGCT GCAAGATACG ACATGAAGGA TGAGAATGAA
GCTTATGGGG TCGAGCTTTT CCATGCTGAA ACTGAGACGG CGGTGGAAGA TACTTCGGCG
CTGAGAAATG ATTTGGATGT TTCCAAAGTT TCTCAGGAAG TAAAAAATAA GCTCGATGAA
GAAACGGACG GTTTCTTAAA GCAGATTGCC GCTAAAGTTG TGGAAGTGGT ACAAAAGGAT
GAAAGTACAG CCGGATTGAA AACAGTGAGT GTGAATGGTG AAAACATTGA AGAGATTGGG
TTGAAAACTG ATGTGGAAGA CATCGGCAAT GTCAGGGAGG CAAAATATTC TTCGGATGAA
AAAGACAGCG CCGACAACAG CGGAAACGCC GGCAGCGAGA CATCATCCAT GGCATCAAGA
GGCGTTGAAG CGGAATCTGC AGCAAAAAAC AGTAACAACA TACAATTTGA AGCAGTTTCA
AACTTAACTG TTCAAAGAGC AACAGGCCAG GCAGAACCGG ACAAAACCCG AAATATTATT
CCGGTTACAA ATAAGGAAAT AGTCGAGCAG GTTGTTGAAA AAGCAAAGGT GGTATTAAGC
GGTGACAAAA GTGAAATGGT CATTGACTTA AAGCCGGAGC ACCTTGGGAA GTTGGAACTC
AAAATTGTTA CCGAAAGAGG AATGGTTGTT GCCAAATTTG TTGCAGAAAA TGAGCAGGTA
AAGGCAGCTT TGGAATCAAA CATGAACATG CTCAAGGAAT CTTTGGAAAA GCAGGGCTTT
TTGGTGGAAG GGTTTAGTGT CACGGTCGGA GACAACAAAA GACGCGAGAA CAGCAGAGAC
AAAACGAATC AGGGCACTGC GAACCAAAGA ATATCCGGTG AAAAACTGCA GGTGTCGGAT
ATGACCGGAG TTGAAAGAAT GCAAAGAATT CATGAAAACA TCGATCCTTA CAGCTATGGG
AGCAGCAGTA TTGATTTAAC TGCATAA
 
Protein sequence
MITQHLIPDF IAKMTGSTEV QKSAGMKKSS SSQFKDTLDL AVEKSYAWQS GTKSYEYAMD 
IKDRPYQRLQ NKNILDSAET KERRPDRTKP FNKASNQITS RASDKAERTA SPEEENIEGE
NDRKLKGKAM EKALAEVLGI SVEELEKLMA QLGINFENAD GETGIQEAAD KISAYLGLNQ
DEKMALAEMM SLVQKQVEQA FKDVQSAYDA ARYDMKDENE AYGVELFHAE TETAVEDTSA
LRNDLDVSKV SQEVKNKLDE ETDGFLKQIA AKVVEVVQKD ESTAGLKTVS VNGENIEEIG
LKTDVEDIGN VREAKYSSDE KDSADNSGNA GSETSSMASR GVEAESAAKN SNNIQFEAVS
NLTVQRATGQ AEPDKTRNII PVTNKEIVEQ VVEKAKVVLS GDKSEMVIDL KPEHLGKLEL
KIVTERGMVV AKFVAENEQV KAALESNMNM LKESLEKQGF LVEGFSVTVG DNKRRENSRD
KTNQGTANQR ISGEKLQVSD MTGVERMQRI HENIDPYSYG SSSIDLTA