Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2243 |
Symbol | |
ID | 4809981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2670009 |
End bp | 2671568 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107649 |
Product | flagellar hook-associated protein FlgK |
Protein accession | YP_001038638 |
Protein GI | 125974728 |
COG category | [N] Cell motility |
COG ID | [COG1256] Flagellar hook-associated protein |
TIGRFAM ID | [TIGR02492] flagellar hook-associated protein FlgK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTT TCTTCGGATT TAACGTGGCA GTAAAGGGAT TGTTCACGGC TCAAAGAAAT ATGGATATAA TAAACCACAA TATTAACAAC GTAAACACAC CGGGATATTC CAGGCAGGTA GCGATTCAGT CGGCATCAAA CCCCATATCG CTCCTTAACG GAACGGGAAT GCTGGGTACA GGCTCGGAAG TGCTTGCCAT TGAAAGAATA AGGGATGAAT ACCTTGATTA CAAGTACTGG AGTGAAAACA TTTCTTACGG GGAATGGAAT GCCAAAAGGA CTCTTCTTGC CGATATGGAA GTTACATTCA ATGAACCGTC GGACAGCGGA TTTAATGCTG TGATAAACAG TTTTTACAAT TCTTTGCAGG AACTTGCAAA AGACCCCAGC AGCGATGCGA TCAGGGCTTT GGTAAAAGAA CAGGGCGTAA CCTTTGCGCG CTACTTTAAC AACATTGCAT CCCATTTTGA AGAGCTTCAG TTTGATATAA ACAATCAGGT AAAGACTGTT GTTACCGAGA TAAACTCATT AGGCACTCAA ATAGCCCAGT TGAACAAGCA AATATACACT GCCGAGCTGG ACGGAAATAC TGCAAATGAC TTAAGAGACA AAAGGACACT TTTGATTGAT GAGCTTTCAA AGCTTGTAAA CATAGATGTC AACGAAGTAG TGGTGGGCAA ACTCCCCAAC GGCAAAGACG ACAAACGAAT GATAATTACG ATCAGCGGAA AGGCTTTTGT AGACAACTTT GATGTCAACA AGCTCACCAT TAAGCAAAGA GAGAATAAAC TGAATGAGGA AGAGGATATT CCAAATCTGT ACGAAGTTTT GTGGGAAGAC GGCAACAGCC TTAGTGTCAG GGGCGGAGAG CTGAAAGGAC TGCTTGATGT TAGGGATGGA AATGACGGAG AAAACGGAAG TCCCAACTAT AAAGGCATAC CCTATTACAT AAGGAAACTT AATGAATTTG TCAGGACTTT TGCAATAGCA TTTAACGAAG GAATAGTGGG AAACGACAAG GTGGCGGGTC ATGTGGACGG ATATGGCACC AACGGAAATA CAGGAATAAG ATTTTTCACC ATATTGGGAG AGGAAAACAA ACCTATATCC AGTGCTGATT TCATGTCCGC CGGAGATATT GACGCCTGCT ACGGGAAAAT GACCGCCAAA AACTTTACCG TAAGTAGCGA CATTCTGGAT GATCCAAGAA ACATTGCAAC AGCTGATACA AAGGATCAGG TCGGGAACAT AGGAAATATA AACAGTATCC TGGCTATGAG AAATAACGTC CATATGTTCA AGGAAGGTGC TCCGGAGGAC TTTGTAAAAT CTGTTATTAC AACTCTGGCG ATAGATTCTC AGCAAACTAT AAGGCTTTCG TCAATCCATA AAAATATGAT TGAACAGGTT GAAAATCAGA GGATGTCTGT GTCAGGGGTG TCTTTGGACG AAGAAGTGGC AAATCTGGTA AAGCACCATC AGGCTTATGC GGCGGCCGCA CAAATGATTA ATACCATGGC GGAAGTTTAC GACATACTGA TTAACAGAGT AGGTCTTTGA
|
Protein sequence | MSSFFGFNVA VKGLFTAQRN MDIINHNINN VNTPGYSRQV AIQSASNPIS LLNGTGMLGT GSEVLAIERI RDEYLDYKYW SENISYGEWN AKRTLLADME VTFNEPSDSG FNAVINSFYN SLQELAKDPS SDAIRALVKE QGVTFARYFN NIASHFEELQ FDINNQVKTV VTEINSLGTQ IAQLNKQIYT AELDGNTAND LRDKRTLLID ELSKLVNIDV NEVVVGKLPN GKDDKRMIIT ISGKAFVDNF DVNKLTIKQR ENKLNEEEDI PNLYEVLWED GNSLSVRGGE LKGLLDVRDG NDGENGSPNY KGIPYYIRKL NEFVRTFAIA FNEGIVGNDK VAGHVDGYGT NGNTGIRFFT ILGEENKPIS SADFMSAGDI DACYGKMTAK NFTVSSDILD DPRNIATADT KDQVGNIGNI NSILAMRNNV HMFKEGAPED FVKSVITTLA IDSQQTIRLS SIHKNMIEQV ENQRMSVSGV SLDEEVANLV KHHQAYAAAA QMINTMAEVY DILINRVGL
|
| |