Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2042 |
Symbol | |
ID | 7310748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2399980 |
End bp | 2401344 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608976 |
Product | protein of unknown function DUF1078 domain protein |
Protein accession | YP_002506368 |
Protein GI | 220929459 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.593026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAT CTATGTTTTC TGGTGTATCA GGTTTAAAGG CACACCAGGC AAAGATGGAC GTTATAGGTA ATAACGTTGC AAATGTAAAC ACATTAGGCT TTAAAGCAGG AAGAGTAACC TTCCAGGAAA TATTCAACCA GACATTGAGA GGTGCGGGGG CACCTGATGC TGCAACTGCT AGAGGAGGAA CAAATCCTAT GCAGATTGGT TTAGGTATTG CAGTTGGTTC AATCGACAAT CAAATGACCG GCGGAAGTCC GCAAAGAACC GATAACCCTA CTGATTTGTC TATTTCAGGC GACGGCTTCT TTATAGTGAA GGGTTCAATT GCTGATACCT TCAAATTCAC AAGAGCAGGA AACTTTGGTC TGGACAAGTT GGGAAATCTG GTATCAGGCG ACGGTATGAA TGTATACGGT TGGACTAAAT ATGATACATT AGGTGATGGT ACAGTAAAGT TTGATACTGA AGCAGAAATA ACTCCTATTA ATCTTTACTC TGACGTAACC AACGGGAACA AGAAGATCAT AGCAGCCAAG GCTACAAGTT ATGCAGAGTT TTCGGGAAAC CTGAATTCTG CGTTACCTAT ATTGGCAGAT CCTGACGATT CAGATCCCCA GTTTACAGTG CCGTTTACAA TTTATGATTC ATTGGGTAAT GCACATGAAC TGATGGTTAA TTTCAAAAAA ACTGACGATT TAACTGATAT ACCTGTCAAA TTACCGGATG GAACAGATGG TACAGAACCC GGTACAGAAT GGACATATAC TATTAGTGAC AAGGCTGGCA ATTCTGTATT AACAGATGCA GGTAAAGTAA ACTTTAACTC AAAAGGAAAA CTTGTTTTAG GTGACAATGA TGCTCCGGAA CAAAGAATCA TGGAATTTGA TCCGGGACAA AATAGTGGTA CTGGTAAAAT TAATATTACT CTTGACTTCA AAAAACTAAC CCAGTACGCA GGAGACAATT CCGTAAAACC GTCCAACATT GACGGATACA CAACAGGAAA CCTTGTAACA TTTAATATCG GTTCAGATGG TATGCTTACA GGTGTTTACA GCAACGGTCA GCAGCAGCCG CTGGGACTTA TAGCACTGGC GGGCTTCGAT AATCCTGCCG GTTTGCAAAA GGTAGGAGGC AACCTGTTTA TACCGACAAC CAACTCCGGT GACTTTACTA AAGGTGTTCC GGCAGGTTCG CAGGGAGTAG GTACATTGAG TCCCGGAACG CTTGAAATGT CAAATGTAGA TCTTTCAAGA GAGTTTACGG ATATGATTGT TACACAGAGA GGTTTCCAGG CAAACAGCAG AATAATAACA ACATCAGATG AAATGCTTCA GGAGCTTGTA AACCTAAAGA GGTAA
|
Protein sequence | MMRSMFSGVS GLKAHQAKMD VIGNNVANVN TLGFKAGRVT FQEIFNQTLR GAGAPDAATA RGGTNPMQIG LGIAVGSIDN QMTGGSPQRT DNPTDLSISG DGFFIVKGSI ADTFKFTRAG NFGLDKLGNL VSGDGMNVYG WTKYDTLGDG TVKFDTEAEI TPINLYSDVT NGNKKIIAAK ATSYAEFSGN LNSALPILAD PDDSDPQFTV PFTIYDSLGN AHELMVNFKK TDDLTDIPVK LPDGTDGTEP GTEWTYTISD KAGNSVLTDA GKVNFNSKGK LVLGDNDAPE QRIMEFDPGQ NSGTGKINIT LDFKKLTQYA GDNSVKPSNI DGYTTGNLVT FNIGSDGMLT GVYSNGQQQP LGLIALAGFD NPAGLQKVGG NLFIPTTNSG DFTKGVPAGS QGVGTLSPGT LEMSNVDLSR EFTDMIVTQR GFQANSRIIT TSDEMLQELV NLKR
|
| |