Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0471 |
Symbol | |
ID | 4808324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 585504 |
End bp | 587090 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105885 |
Product | flagellar hook-length control protein |
Protein accession | YP_001036902 |
Protein GI | 125972992 |
COG category | [N] Cell motility |
COG ID | [COG3144] Flagellar hook-length control protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAACTC AACATTTAAT TCCCGATTTT ATTGCAAAAA TGACCGGTTC TACCGAAGTG CAAAAGTCGG CGGGCATGAA AAAAAGCTCT TCATCGCAGT TTAAAGATAC CCTTGACCTG GCTGTCGAAA AGTCGTATGC ATGGCAGAGC GGCACTAAAT CCTATGAATA TGCAATGGAT ATCAAAGACA GGCCTTATCA GCGTTTGCAA AACAAAAACA TACTGGACTC AGCCGAAACA AAGGAAAGAA GACCGGACAG AACCAAGCCT TTTAACAAAG CATCAAACCA GATAACATCC AGGGCATCAG ATAAGGCAGA ACGCACAGCT TCTCCGGAAG AAGAAAACAT CGAAGGCGAA AATGACAGAA AGCTGAAAGG GAAAGCTATG GAGAAAGCTC TGGCAGAAGT TCTTGGAATT AGTGTGGAAG AGCTGGAAAA GCTCATGGCT CAGCTGGGCA TAAATTTTGA GAATGCTGAC GGTGAAACGG GTATTCAGGA AGCTGCCGAT AAAATTTCAG CATATCTGGG TTTAAACCAG GATGAGAAAA TGGCTCTGGC AGAGATGATG TCCCTTGTTC AAAAGCAAGT TGAACAGGCC TTCAAGGATG TTCAGTCTGC GTATGATGCT GCAAGATACG ACATGAAGGA TGAGAATGAA GCTTATGGGG TCGAGCTTTT CCATGCTGAA ACTGAGACGG CGGTGGAAGA TACTTCGGCG CTGAGAAATG ATTTGGATGT TTCCAAAGTT TCTCAGGAAG TAAAAAATAA GCTCGATGAA GAAACGGACG GTTTCTTAAA GCAGATTGCC GCTAAAGTTG TGGAAGTGGT ACAAAAGGAT GAAAGTACAG CCGGATTGAA AACAGTGAGT GTGAATGGTG AAAACATTGA AGAGATTGGG TTGAAAACTG ATGTGGAAGA CATCGGCAAT GTCAGGGAGG CAAAATATTC TTCGGATGAA AAAGACAGCG CCGACAACAG CGGAAACGCC GGCAGCGAGA CATCATCCAT GGCATCAAGA GGCGTTGAAG CGGAATCTGC AGCAAAAAAC AGTAACAACA TACAATTTGA AGCAGTTTCA AACTTAACTG TTCAAAGAGC AACAGGCCAG GCAGAACCGG ACAAAACCCG AAATATTATT CCGGTTACAA ATAAGGAAAT AGTCGAGCAG GTTGTTGAAA AAGCAAAGGT GGTATTAAGC GGTGACAAAA GTGAAATGGT CATTGACTTA AAGCCGGAGC ACCTTGGGAA GTTGGAACTC AAAATTGTTA CCGAAAGAGG AATGGTTGTT GCCAAATTTG TTGCAGAAAA TGAGCAGGTA AAGGCAGCTT TGGAATCAAA CATGAACATG CTCAAGGAAT CTTTGGAAAA GCAGGGCTTT TTGGTGGAAG GGTTTAGTGT CACGGTCGGA GACAACAAAA GACGCGAGAA CAGCAGAGAC AAAACGAATC AGGGCACTGC GAACCAAAGA ATATCCGGTG AAAAACTGCA GGTGTCGGAT ATGACCGGAG TTGAAAGAAT GCAAAGAATT CATGAAAACA TCGATCCTTA CAGCTATGGG AGCAGCAGTA TTGATTTAAC TGCATAA
|
Protein sequence | MITQHLIPDF IAKMTGSTEV QKSAGMKKSS SSQFKDTLDL AVEKSYAWQS GTKSYEYAMD IKDRPYQRLQ NKNILDSAET KERRPDRTKP FNKASNQITS RASDKAERTA SPEEENIEGE NDRKLKGKAM EKALAEVLGI SVEELEKLMA QLGINFENAD GETGIQEAAD KISAYLGLNQ DEKMALAEMM SLVQKQVEQA FKDVQSAYDA ARYDMKDENE AYGVELFHAE TETAVEDTSA LRNDLDVSKV SQEVKNKLDE ETDGFLKQIA AKVVEVVQKD ESTAGLKTVS VNGENIEEIG LKTDVEDIGN VREAKYSSDE KDSADNSGNA GSETSSMASR GVEAESAAKN SNNIQFEAVS NLTVQRATGQ AEPDKTRNII PVTNKEIVEQ VVEKAKVVLS GDKSEMVIDL KPEHLGKLEL KIVTERGMVV AKFVAENEQV KAALESNMNM LKESLEKQGF LVEGFSVTVG DNKRRENSRD KTNQGTANQR ISGEKLQVSD MTGVERMQRI HENIDPYSYG SSSIDLTA
|
| |