Gene Ccel_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2042 
Symbol 
ID7310748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2399980 
End bp2401344 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content41% 
IMG OID643608976 
Productprotein of unknown function DUF1078 domain protein 
Protein accessionYP_002506368 
Protein GI220929459 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.593026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGAT CTATGTTTTC TGGTGTATCA GGTTTAAAGG CACACCAGGC AAAGATGGAC 
GTTATAGGTA ATAACGTTGC AAATGTAAAC ACATTAGGCT TTAAAGCAGG AAGAGTAACC
TTCCAGGAAA TATTCAACCA GACATTGAGA GGTGCGGGGG CACCTGATGC TGCAACTGCT
AGAGGAGGAA CAAATCCTAT GCAGATTGGT TTAGGTATTG CAGTTGGTTC AATCGACAAT
CAAATGACCG GCGGAAGTCC GCAAAGAACC GATAACCCTA CTGATTTGTC TATTTCAGGC
GACGGCTTCT TTATAGTGAA GGGTTCAATT GCTGATACCT TCAAATTCAC AAGAGCAGGA
AACTTTGGTC TGGACAAGTT GGGAAATCTG GTATCAGGCG ACGGTATGAA TGTATACGGT
TGGACTAAAT ATGATACATT AGGTGATGGT ACAGTAAAGT TTGATACTGA AGCAGAAATA
ACTCCTATTA ATCTTTACTC TGACGTAACC AACGGGAACA AGAAGATCAT AGCAGCCAAG
GCTACAAGTT ATGCAGAGTT TTCGGGAAAC CTGAATTCTG CGTTACCTAT ATTGGCAGAT
CCTGACGATT CAGATCCCCA GTTTACAGTG CCGTTTACAA TTTATGATTC ATTGGGTAAT
GCACATGAAC TGATGGTTAA TTTCAAAAAA ACTGACGATT TAACTGATAT ACCTGTCAAA
TTACCGGATG GAACAGATGG TACAGAACCC GGTACAGAAT GGACATATAC TATTAGTGAC
AAGGCTGGCA ATTCTGTATT AACAGATGCA GGTAAAGTAA ACTTTAACTC AAAAGGAAAA
CTTGTTTTAG GTGACAATGA TGCTCCGGAA CAAAGAATCA TGGAATTTGA TCCGGGACAA
AATAGTGGTA CTGGTAAAAT TAATATTACT CTTGACTTCA AAAAACTAAC CCAGTACGCA
GGAGACAATT CCGTAAAACC GTCCAACATT GACGGATACA CAACAGGAAA CCTTGTAACA
TTTAATATCG GTTCAGATGG TATGCTTACA GGTGTTTACA GCAACGGTCA GCAGCAGCCG
CTGGGACTTA TAGCACTGGC GGGCTTCGAT AATCCTGCCG GTTTGCAAAA GGTAGGAGGC
AACCTGTTTA TACCGACAAC CAACTCCGGT GACTTTACTA AAGGTGTTCC GGCAGGTTCG
CAGGGAGTAG GTACATTGAG TCCCGGAACG CTTGAAATGT CAAATGTAGA TCTTTCAAGA
GAGTTTACGG ATATGATTGT TACACAGAGA GGTTTCCAGG CAAACAGCAG AATAATAACA
ACATCAGATG AAATGCTTCA GGAGCTTGTA AACCTAAAGA GGTAA
 
Protein sequence
MMRSMFSGVS GLKAHQAKMD VIGNNVANVN TLGFKAGRVT FQEIFNQTLR GAGAPDAATA 
RGGTNPMQIG LGIAVGSIDN QMTGGSPQRT DNPTDLSISG DGFFIVKGSI ADTFKFTRAG
NFGLDKLGNL VSGDGMNVYG WTKYDTLGDG TVKFDTEAEI TPINLYSDVT NGNKKIIAAK
ATSYAEFSGN LNSALPILAD PDDSDPQFTV PFTIYDSLGN AHELMVNFKK TDDLTDIPVK
LPDGTDGTEP GTEWTYTISD KAGNSVLTDA GKVNFNSKGK LVLGDNDAPE QRIMEFDPGQ
NSGTGKINIT LDFKKLTQYA GDNSVKPSNI DGYTTGNLVT FNIGSDGMLT GVYSNGQQQP
LGLIALAGFD NPAGLQKVGG NLFIPTTNSG DFTKGVPAGS QGVGTLSPGT LEMSNVDLSR
EFTDMIVTQR GFQANSRIIT TSDEMLQELV NLKR