Gene Ccel_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1251 
Symbol 
ID7310046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1550692 
End bp1552392 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content36% 
IMG OID643608172 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002505587 
Protein GI220928678 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain
[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGGC TATTGATTGT AGATGATGAG GAAATTATTG TAAACGGATT ATATGAGATA 
TTCCGAAGCT TAAAAAATCT TGATCTGGAT GTATACAGGG CATATTCGGG CGAAGAGGCG
GTTGAGTGGC TAAGCAGAAC AAGAATGGAT ATTGTTTTAA CAGATATTAG AATGCCGGAA
ATGGATGGTT TGCAGTTATT GGATGTCATA TATAGAAGAT GGCCTCAATG CAAAGTAATA
TTCCTAACAG GGTATAACGA ATTTGAATAT GTATATAAAG CAATTCAGCA CCAGGGTGTA
AGCTATATAC TAAAAACAGA AGATAATGAC AAGGTAATAA GTGTTGTTGA AAATGCCATA
GAGGAGATAA AAAAAGAAAT AAAAACAGAA GACTTGATTC ATAACGCCAA GGAGCAAATT
GAACTTGCGT TGGGTCTCTT CCAAAAGGAT TACTTTATTC ATCTGCTCCA CGAAGAGAAT
TCATCAGATG TAAATAAAGC CCAGTTTGAA CAACTGGCAA TTCCAATGTA CCCTGATCAT
CCTGTAATTC TGCTGCTTGG ACATATTGAC AGCATTCCGG GTAATCTGTC CTACTGGGAT
AAGATACAGT ATATAAATTC TATTAAAGTG ATTATTACTA AAAATCTGAA TACCAATATC
AGGTCCATAT GCATTTTGGA TGATAGATAT AGGTTTATTT TTTTCATACA ACCAAATGAG
CTTATGACAA CAGGTTTCAG TAAAACTGAA GAAAATGCTT TTTATGATAA GGCTATTTCT
TTTGTAAAGG GTACCCTTGA AGTAATACAG ACTGCGTGTA TGGAGTCGTT GAATGCCTCT
ATAAGCTTTT CCCTCAGCGG AGAACCCTGT AACTGGGAAG ACGTATCTAA AAAGTATTTT
TCTCTTAATC AACTGCTGAA CTATAAAATA GGCACAGGGA CAGAAATGCT ATTGATTGAT
AATGAATTAA AAAATAATAT TCTAGCGTCA GGTACAGAAG TACCTGAGCT GGACACAAAT
GATGAGCCAT TGGAAATTCT TTTAAGAAGG AAAAATATGG ACCTAATTCA GCAATACATG
GAGTCTGGCC AAAAAGAAAA ATATTTTGAG GTATTGGAGG AATTATTAAC TCCAATAAAA
TCAGTCAAAA GTAAAAACAG CAATTTAGCC ATTGAAGCAT ATTCCATGGT ATCGCTCAGT
ATCCTTTCAT ATATCAATCG TTGGAAAATA ACGGAGAAAC TGGCATTTCA TATAGGTCTT
AATAGCCTTA TGAGGATTGA CAAACATGAA ACATGGTCAG ATGCAGTAAA GTATCTGATA
AATATATCTG ATATAATATT CCAACTCCAA ACCGACGAAC AGAAGAAGAG GGCGGATAAT
GCAGTTGATT TCGTTCAGAG TTTTATAAAT GAGCATTTGA GTGAAGATCT TTCTTTGGTA
AGACTTGCAG AACAGGTTTA CTTGAACCCA TCATATTTGT CACGCCTCTA TAAGCAGGTT
ACAAATACCA ACCTTTCTGA CTTTATTGAC AACGCCCGTA TAGAGCGGGC AAAAGAGCTT
CTGAAAAAAG AAAAGGTGAA GATAAATGAA GTAGCAAAAG CCGTAGGGTA TGAAACTGCA
GCATCCTTTA CAAGGTTCTT CAAGAAACTA ATGGGGTGTT CTCCACAGGA ATATCATGAT
ACCATGCTGT CAGGTAAATA A
 
Protein sequence
MYRLLIVDDE EIIVNGLYEI FRSLKNLDLD VYRAYSGEEA VEWLSRTRMD IVLTDIRMPE 
MDGLQLLDVI YRRWPQCKVI FLTGYNEFEY VYKAIQHQGV SYILKTEDND KVISVVENAI
EEIKKEIKTE DLIHNAKEQI ELALGLFQKD YFIHLLHEEN SSDVNKAQFE QLAIPMYPDH
PVILLLGHID SIPGNLSYWD KIQYINSIKV IITKNLNTNI RSICILDDRY RFIFFIQPNE
LMTTGFSKTE ENAFYDKAIS FVKGTLEVIQ TACMESLNAS ISFSLSGEPC NWEDVSKKYF
SLNQLLNYKI GTGTEMLLID NELKNNILAS GTEVPELDTN DEPLEILLRR KNMDLIQQYM
ESGQKEKYFE VLEELLTPIK SVKSKNSNLA IEAYSMVSLS ILSYINRWKI TEKLAFHIGL
NSLMRIDKHE TWSDAVKYLI NISDIIFQLQ TDEQKKRADN AVDFVQSFIN EHLSEDLSLV
RLAEQVYLNP SYLSRLYKQV TNTNLSDFID NARIERAKEL LKKEKVKINE VAKAVGYETA
ASFTRFFKKL MGCSPQEYHD TMLSGK