Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0472 |
Symbol | |
ID | 4808325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 587106 |
End bp | 588077 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105886 |
Product | flagellar hook capping protein |
Protein accession | YP_001036903 |
Protein GI | 125972993 |
COG category | [N] Cell motility |
COG ID | [COG1843] Flagellar hook capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAG ATGCTGTAGG CCGTCAAAAG ACCATTCAGG AAATCATTGA CAGTACCACA AACAAGTCGA CTCAAAGAAA TACCGGGGAA CTGGGAAAAG ATGAGTTTTT AAACCTTCTC ATTACACAGC TTCAATATCA GGACCCTCTA AATCCTGTGG ATGACAAGGA ATTTATCAGC CAGATGGCAC AGTTCAGCGC CCTGGAACAA ATGCAGAACC TCAATAAAAG CTTTTCAGCC ACAAAGGCCT TTGGCATGAT AGGAAAATAT ATTACCGGTA CTACCAGTAA CGGCAGTGAT TCAGGAGCCG GTTTTGTAGA GGGTATTGTA CGCAGTGTGA AAATGGAAAA CGGAAAAATT CTTTTGGAAG TCAATGGCTT GGATGTTCCG GTGGATAATG TGCTCAGTGT TTCAGAAGAA AGCAATTATT ATTACAAGTA CAATAGTTCA AATATATCCC AATACACCGG AATTATCGGC TATGAAGTAA GCGGAAGTGT ATATGATCCT TCCACCGGGG ACATTGTCGG CGTGAGTGGA ATTGTAAGGG AAATCCAGAA AGGTATTTAT GAAGATTATG CTGTAATGGA CGGTGTAGAG GTAATAATTT CCGGAATAGA CACTGAATTT AATTCGGCAG ATCCGAACTT TAGAAAGGAT TACCTTACAG AAAACGTAGG CCAGGGAGTT TCTCTTATAA TCACCGACGC ATACGGTTAT GGATTTAAGG TTCCTGTTAC AGGGGTTCTT AAAGATTTCA AGATAGCACC GAACGGTAAG ATTATCGGAA TTTTGGATGG GGTATATGTG CCTGTGGACA GCATCAGCAA TATAAAGAAA CCATCTGCGA ATACGGGCGC GGCTGACGGC ACAAACGAAA ACCCGCAGGT TGAACAGTCC CAGACTGAGG AGTCTCAAGT GGAGGAACCT CAGGTCGGAG AACCCCGGGA CGAAGAATCT GACAGTGTAT AG
|
Protein sequence | MAVDAVGRQK TIQEIIDSTT NKSTQRNTGE LGKDEFLNLL ITQLQYQDPL NPVDDKEFIS QMAQFSALEQ MQNLNKSFSA TKAFGMIGKY ITGTTSNGSD SGAGFVEGIV RSVKMENGKI LLEVNGLDVP VDNVLSVSEE SNYYYKYNSS NISQYTGIIG YEVSGSVYDP STGDIVGVSG IVREIQKGIY EDYAVMDGVE VIISGIDTEF NSADPNFRKD YLTENVGQGV SLIITDAYGY GFKVPVTGVL KDFKIAPNGK IIGILDGVYV PVDSISNIKK PSANTGAADG TNENPQVEQS QTEESQVEEP QVGEPRDEES DSV
|
| |