Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0474 |
Symbol | |
ID | 4808327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 588682 |
End bp | 590004 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105888 |
Product | protein of unknown function DUF1078-like protein |
Protein accession | YP_001036905 |
Protein GI | 125972995 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000090236 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAT CCATGTTTTC CGGAGTTTCA GGACTTCAGG CCCATCAGAC AAAAATGGAC GTTATAGGAA ACAATGTGGC AAACGTTAAC ACCGTAGGTT TTAAATCAAG CAGGGTGACG TTCCAGGAAG TTTTCAGCCA GACTTTAAAA GGAGCCAGCT CTCCTGATCC AACTACCGGG AGAGGAGGAA CTAATCCCAT GCAGGTGGGA TTGGGTCTGG GAGTTGCAAC CATTGACACA CTCATGACTC GCGGAAGTGT TCAAAGAACC GATAATCCTA CCGACCTTGC AATTGAAGGA GACGGATTCT TTATTGTAAA AGGCGGAAGC AGTGACACAT TCAAATTCAC AAGAGCCGGA AACTTCGGAA TAGACAGACT GGGAAACCTT GTAACCGGAA GCGGACTGAA TGTTTATGGC TGGCAGTCCT ATACGAAACT GCCTGACGGA ACCTATAAAT TTGACACTGA AAGTCAAATT GAACCTATAA ATCTTTATTC CGACGATGTG AATAAAAATA AAAGAATGAT AGCAGCAAAA GCAACTACTT ATGCAATCTT TGAAGGAAAT CTGGATGCAT CCTATTCAAT TTACAGCAGC GCAGCATCCG GCACATCCAG CAACAATAGA TTCACTATGC CTGTTACGGT ATATGATTCT TTGGGAAACA GCTACAAAAT AAATATAAGT TTCTGGAAAA CCGACGTTAC AGGCGGTGTT ACCACATGGA CATGGCAGGT TGATTCGGGA AATGGGGTAA CAGCATCGGG AGCTACCGGA ACTATACTGT TTGACGATCA GGGACAGGTA ATTGAAAGTT CCGCGGTTAC ACCCAGTATA ACAATTATAC CTGACAGCAG CGTTGGAAGC CAGAATATCA ATGTGAAGCT GGACTTTTCC AGACTTACAA TGTATGCGGC AGACAGTTCC GCAAAAGCTA CAAATGTGGA CGGATATCCG GCAGGTTCGC TTGTGACCTT CAGTATTGGT TCCGACGGAA TGATTATGGG TATATACAGT AATGGTCAGC AGCAGCCGTT GGGACTTATA GCCTTGGCAA GTTTTGACAA CCCTGCGGGT CTTGAAAAGG TAGGAGAGAA TATGTATATT CCGACTTCCA ACTCCGGAGA GTTTAAAAAA GGTGTCAAGG CAGGCACTGA AGGAGTGGGA ACTTTAAGCC CCGGTACGTT GGAAATGTCC AACGTGGATC TGTCAAAAGA GTTTACTGAA ATGATTATTA CCCAAAGAGG ATTTCAGGCA AACAGCAGGA TAATAACAAC TTCCGACGAA ATGCTCCAGG AGCTTGTCAA CCTTAAGAGA TAG
|
Protein sequence | MMRSMFSGVS GLQAHQTKMD VIGNNVANVN TVGFKSSRVT FQEVFSQTLK GASSPDPTTG RGGTNPMQVG LGLGVATIDT LMTRGSVQRT DNPTDLAIEG DGFFIVKGGS SDTFKFTRAG NFGIDRLGNL VTGSGLNVYG WQSYTKLPDG TYKFDTESQI EPINLYSDDV NKNKRMIAAK ATTYAIFEGN LDASYSIYSS AASGTSSNNR FTMPVTVYDS LGNSYKINIS FWKTDVTGGV TTWTWQVDSG NGVTASGATG TILFDDQGQV IESSAVTPSI TIIPDSSVGS QNINVKLDFS RLTMYAADSS AKATNVDGYP AGSLVTFSIG SDGMIMGIYS NGQQQPLGLI ALASFDNPAG LEKVGENMYI PTSNSGEFKK GVKAGTEGVG TLSPGTLEMS NVDLSKEFTE MIITQRGFQA NSRIITTSDE MLQELVNLKR
|
| |