Gene Ccel_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0944 
Symbol 
ID7309780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1122427 
End bp1124046 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content35% 
IMG OID643607872 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002505287 
Protein GI220928378 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAG CTATTATTGT TGATGATGAA GATTTGGTAC GCCAAGGTTT AAAAAAACAC 
TTTGATTGGA GCGACCACAA TATTGAGATA GTTGCCGATC TTTCCGATGG CCAGAAGGCA
TTTCAGTATG TGAAGGACAA TCACGTTGAT TTGGTCCTTA CTGATGTCCT CATGCCTTAT
ATGGATGGAA TAACATTGGC AAAGAATTTA CGAGAGCTTT ATCCAGAGAT AAAGATAATA
TTTATTAGTG GTCATGATGA TGTAAGCTAC CTAAAAAATG CACTTAAAGT CGAGGCTGTG
GACTATATTC TTAAATCCAT CGATTTAGAC GAGCTTAAAG ATACTGTTAG CCGAGTGGTA
AATACTATGA ATACTGAGAA TCAGAGTAAA AAAACCATGG CTGATATGGA AAATCTATTA
AATCAGAGTT TTCCACTTTT ACAAGAGCGT TTTTTTATCA CAATGATCCG CGATGATTTC
GAAAATCCTG ATATAATGAA AGAACGTATA GCATTTTTGA ATATCCCACT CAATGATGAA
ATGTATTATT GTGTACTAGT AGTGCAAATA CAGCGTATTT ACAGCAAGTT TCATGTCTTA
ACGGAACGTG AGAGACAAAT TCTTTCCCTT CAGATACAAA ACGAATGTAC AGAAGTTGGT
AAACAGTATA GTGATACCAT TTGCTTTGAA AATAAACAAG GTGAATATGT TATGATATTG
TCCTTATTAG AAGATGAATA TGAAGAAACT CTCCTTGAAG TTTCCGAAAA TCTTGACAAG
CGTCTCAACG GTTATATGAA TTTGCCAGTA TCTATTGGTA TAAGTGACAG ATTTAAAGGG
CTTGAAAATA TAAAAGCATC TTATGAGAAT GCATCAAATT CCATAAGTAA ACGGTATTTA
CTTGATGACG AACTGACCAT TTCCGTTGAT AAATACGAAA TGGACGAAAG TCTCAAAGAA
TATAAGGAAA GAGCTAAAAA GAGTCTGCAA GAATGTTTAA GCTCTGGAAA TACCGAACAG
GTGTCTGAGG TACTCCGAGA GCTTTTCCAT ATAATAAGAG AAAAATTTCC AGATGATGAA
GAGCAGAATC TGATGATTTT TTTACTACTA CTCCCAACAC GCATAGTAAC TGATATTAAA
ATAAATAAAA AAAGTGATTA TTCCAACCAG CGAATGATTT TAGAGAAATT TCTGTGCTGT
GCGGATTTTG AAGAACAATG CCTTCTGATT CAAAAGCTCT ATTTTGAGGT GGCAACCCTT
ATGAGCAGCA TGAGCAAAAC ATATTCCCAT ACAATCATCA ATCAGGTGCG AAAGACTATT
GAGGAACGCT TTAAGGAACA GATATCAATA AGTACATTAG CAAGGGATGT TTACTTAACA
CCCACATATT TATGCGTTTT GTTTAAACAA GTTACCGGAA CTACAATAAA TGATTATTTA
ACTCTGACTC GACTTGAGAA AGCAAAGAAG CTTTTATCAG ATCCGTACAT AAAACTGTAT
GATGTATGTT ATGAGGTTGG CTATTTATCA CCAAGCTATT TTTCCCGTTT ATTTAAGAAA
TACACAGGAA TCTCGCCTAG CGAATACAGG AATGTTGCAA TAGCATCTTC CGAGCAATAA
 
Protein sequence
MYKAIIVDDE DLVRQGLKKH FDWSDHNIEI VADLSDGQKA FQYVKDNHVD LVLTDVLMPY 
MDGITLAKNL RELYPEIKII FISGHDDVSY LKNALKVEAV DYILKSIDLD ELKDTVSRVV
NTMNTENQSK KTMADMENLL NQSFPLLQER FFITMIRDDF ENPDIMKERI AFLNIPLNDE
MYYCVLVVQI QRIYSKFHVL TERERQILSL QIQNECTEVG KQYSDTICFE NKQGEYVMIL
SLLEDEYEET LLEVSENLDK RLNGYMNLPV SIGISDRFKG LENIKASYEN ASNSISKRYL
LDDELTISVD KYEMDESLKE YKERAKKSLQ ECLSSGNTEQ VSEVLRELFH IIREKFPDDE
EQNLMIFLLL LPTRIVTDIK INKKSDYSNQ RMILEKFLCC ADFEEQCLLI QKLYFEVATL
MSSMSKTYSH TIINQVRKTI EERFKEQISI STLARDVYLT PTYLCVLFKQ VTGTTINDYL
TLTRLEKAKK LLSDPYIKLY DVCYEVGYLS PSYFSRLFKK YTGISPSEYR NVAIASSEQ