Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2621 |
Symbol | |
ID | 4809043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3099480 |
End bp | 3100280 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108035 |
Product | protein of unknown function DUF1078-like protein |
Protein accession | YP_001039014 |
Protein GI | 125975104 |
COG category | [N] Cell motility |
COG ID | [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR02488] flagellar basal-body rod protein FlgG, Gram-negative bacteria [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000161045 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAG CACTGTGGAC CGCGGGGTCG GGAATGACGG CTCAGCAGCT GAATGTTGAC GTTATAGCAA ACAATCTTTC GAATGTTAAC ACCACCGCAT ACAAAAAAGA AAGACTGGAA TTTAAGGATA TGCTCTATGA CACCTTGAAC AGGGCATATA TCCTTGACGG AGAAGGAAGA CCGGTAAATT TGCAGGTGGG ATATGGAACG GTTCCCATGG CCACACTGAA AAACTTCCAG AGCGGAAACT TTGAGAAGAC CGACAATCCT CTTGATCTTG CCATAGACGG CGATGGCTTT TTCATGGTGC TCGGTCCAAG GGGAGACATA GTTTATACCC GGGACGGAAG CTTTAAAATC AGTGTGACGG AAAACGGAAA CATGCTCACA ACATCGGACG GATACCCGGT GCTGGATGAG TCCGGAGTTG AAATAATACT TGATATTGAT ATATCCAAAC TGAATGTTTC ATCCGACGGG GAACTGAGCT ACACCGACGA AAACGGGGTA GTAGTACCTC TGGGACAGAG AATTGGTCTT GTGAAATTCC CCAACAGAAA CGGTCTTGAA AGTATCGGAA GCAACTTTTA TGCAAGCACT TCCGCATCCG GAGAAGCGGT ACCTGATGAG GAACTGGGCA AAAAGAGCAA TATAAGGCAG TATTATCTTG AGTCGTCAAA TGTGCAGGTA GTTGAGGAAA TGGTTAAGCT CATTGTTGCC CAAAGGGCAT ATGAAATCAA TTCCAAAGCA ATTCAGTCAG CGGATGAAAT GTTGGGCATA GCAAATAATT TAAAGAGATA A
|
Protein sequence | MMRALWTAGS GMTAQQLNVD VIANNLSNVN TTAYKKERLE FKDMLYDTLN RAYILDGEGR PVNLQVGYGT VPMATLKNFQ SGNFEKTDNP LDLAIDGDGF FMVLGPRGDI VYTRDGSFKI SVTENGNMLT TSDGYPVLDE SGVEIILDID ISKLNVSSDG ELSYTDENGV VVPLGQRIGL VKFPNRNGLE SIGSNFYAST SASGEAVPDE ELGKKSNIRQ YYLESSNVQV VEEMVKLIVA QRAYEINSKA IQSADEMLGI ANNLKR
|
| |