Gene Ccel_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2031 
Symbol 
ID7310738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2390593 
End bp2392617 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content38% 
IMG OID643608965 
Productflagellar biosynthesis protein FlhA 
Protein accessionYP_002506357 
Protein GI220929448 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAAGC TTAACGGTTT TTTTGTACCA TTAATCGTTG TACTTGCAAT TATAAATCTA 
ATTTTGCCAA TACCTACAGG TGTGTTGGAC TTTTTGCTTG CAGTTAATAT AATTCTTTCA
GCAATAATTA TGCTTAATAC AATATACCTT AAATCAGCAC TTGATCTTTC CGTGTTTCCG
ACATTGATAG TTGTTACAAC TATATTCCGT CTATCACTTA ATGTTACGGC CACAAAGCTT
ATTTTGGCAA ATGGTGAAGC AGGCCATGTT ATAAGGGGTT TTGGGCAATT TGTAGGAAGA
AATAATCTGG TTGTGGGTTT CGTAATATTT TTTATAATAA TGATTGTTAA CTTTCTGGTA
ATTACGAAAG GTTCTGAAAG AGTTGCGGAA GTTGCAGCCA GATTTACACT TGATGCTATG
CCCGGTAAAC AGATGGCTAT AGACGCCGAT TTAAATACAG GTCTTATAAG TGATGCTGAA
GCAAAAGAAC GCAGAAAAAA GATACAAAGA GAAGCAGACT TTTACGGTTC TATGGATGGT
GCCAGTAAAT TTGTAAAAAA TGATGCCATT GCAGGTATTA TTATTACATT AATAAACATT
ATCGGAGGTA TCATCATAGG AGTTGTTATG CTTGAAAAGG ACATCGGCGA TGCCCTACAG
ACATATACAA TTCTAACTAT AGGTGATGGT CTGGTAAGCC AGCTGCCTGC ACTAATGCTT
TCAACAGCTA CAAGCTTTAT TGTTACACGT GCAGGTGCAG ATTCTGATCT TAATAAGGAT
GTCCTAAGAC AGTTATTCTA TAACCCCAAG GTTCTTATTA TAGCGGCCTG TCTCAGCGTG
GGTCTGGCAC CATTTTTAAC CCCCGCTCCA TTTCTGATTC TTGCCGCAGT CTTATTATTC
GTTGCATATA AAATCAGACA ACTTCAAGAG GAAGCTGATA AGGATGAAGT AGTTCAAATT
CAGGAAAGTG AAGTTGAGGA AATTAGAAAG CCTGAAAATG TTGTCAGCCT TCTTCAGGTT
GACCCTATAG AACTCGAGTT TGGTTACGGT ATTATTCCTC TGGCTGATGT AAATCAGGGA
GGAGACCTAT TGGACAGGGT TGTTCTAATC AGAAGGCAGC TGGCACTTGA ACTTGGTATG
ATTGTACCGG TTATCAGGTT AAGAGATAAC ATTCAAATAA GCCCAAATGA ATATATTATT
AAAATAAAAG GTACTCAGGT GGCAAAAGGA GAATTGCTTT TTGACTACTT CCTTGCCATG
AATTCGGGGG ATGTAGAGGA AGATCTTGAG GGTATAAAAA CTATAGAACC AGCATTCGGT
TTACCCGCAA TATGGATTCA GGAAAGTCAG AGGGACAAAG CTGAAATGCT TGGCTATACT
GTTGTCGATC CACCTTCTAT TATTGCGACG CATCTTACAG AGGTTATTAA AAAGCATTCC
TACGAATTGC TTGGAAGGCA GGAAGTACAG GCATTGGTGG ACAACATCAA GCAATCCTAT
CCGGCCATTG TAGATGAACT GGTTCCTAAG CTTTTGAGTG TCGGTGAAAT ACAAAAGGTT
CTTGCAAACC TGTTAAAGGA AAATGTAACA ATAAGAGATA TGGTTACTAT AATGGAAACA
TTGGCAGACT ATGCACCTGC AACCCATGAT ATTGATATGC TTACAGAATA TGTGCGTCAG
GCTTTGGGAA GATCCATATC ACAAAAATTT CTTAACGGAA ATACCAATGT AATTACACTT
GACCCAAAAG TAGAGCAAAT GATTCTGGAT TCAATACAAA AAACGGAGTT TGGTTCGTAT
CTTGCTCTTG ACCCTTCAGT TTCAAATACG ATAATAAATA GTGTTTCAAA AAATGTACAA
AGACTGATAC AACTTGGAAG TCAACCGATT ATTCTTGCTT CGCCGGTTGT CAGATTATAT
TTTAAACGTT TAACAGAAAA TGTAATACCC GGTTTGGTAG TCCTATCTTA CAATGAAATT
GATTCCGGGG TAGAAATACA ATCAGTAGGA ACAGTCAGTG TTTGA
 
Protein sequence
MAKLNGFFVP LIVVLAIINL ILPIPTGVLD FLLAVNIILS AIIMLNTIYL KSALDLSVFP 
TLIVVTTIFR LSLNVTATKL ILANGEAGHV IRGFGQFVGR NNLVVGFVIF FIIMIVNFLV
ITKGSERVAE VAARFTLDAM PGKQMAIDAD LNTGLISDAE AKERRKKIQR EADFYGSMDG
ASKFVKNDAI AGIIITLINI IGGIIIGVVM LEKDIGDALQ TYTILTIGDG LVSQLPALML
STATSFIVTR AGADSDLNKD VLRQLFYNPK VLIIAACLSV GLAPFLTPAP FLILAAVLLF
VAYKIRQLQE EADKDEVVQI QESEVEEIRK PENVVSLLQV DPIELEFGYG IIPLADVNQG
GDLLDRVVLI RRQLALELGM IVPVIRLRDN IQISPNEYII KIKGTQVAKG ELLFDYFLAM
NSGDVEEDLE GIKTIEPAFG LPAIWIQESQ RDKAEMLGYT VVDPPSIIAT HLTEVIKKHS
YELLGRQEVQ ALVDNIKQSY PAIVDELVPK LLSVGEIQKV LANLLKENVT IRDMVTIMET
LADYAPATHD IDMLTEYVRQ ALGRSISQKF LNGNTNVITL DPKVEQMILD SIQKTEFGSY
LALDPSVSNT IINSVSKNVQ RLIQLGSQPI ILASPVVRLY FKRLTENVIP GLVVLSYNEI
DSGVEIQSVG TVSV