Gene Ccel_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3249 
Symbol 
ID7311827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3802738 
End bp3804333 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content39% 
IMG OID643610150 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002507518 
Protein GI220930609 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAACG TAATGATTGT TGATGATGAA CCGGTAATAA AGCAAGGACT TTTATGCTTT 
GTCAATTGGG AGGCCCTTGA TTGCAAAGTG ATTTGTGATG CAGAAAACGG GATAGATGCA
ATGGAAAAAC TGGCTGTCCA TCCCGTGAAT ATCGTCCTTA CAGATATTAA AATGCCTGGA
ATGGATGGCT TGGAATTGTC AAAACTCATT TACGAAAAGT ATCCTTCTAT AAAAGTCATT
ATTCTTACGG CCTTTTCAGA CTTTACCTAT GCACAAGCTG CAATTAAATA CAATGTATTG
GATTTTGTCA TCAAGACCAA CCCAACTGAA AAAATTCCGG ATGCCATTCG TAAAGCAAAG
GACCTAATTG CACAGGAGAA AGAAAAAGAA GAAAAGCTGA AGCTTATGGA AGAAAAGGCA
ATTCTGAGGC TGTCAGAGAT CAAGGAAAAT TTTTTCAAAG ATGTGTTTAA TGGAATCATA
GTGAATGATA CCCTTCTTCA AAGCAAGCTA ATTGAGCTTG AAATAAGTAT CGAAAACTAT
TTTGTTGTTT TATTTGACAT TCACAATGCT TCAAACGGAG ATTCATCAAT CAGTCCTGAG
GACTATAATA AATTCATACA TTCAGTGAAC AATCTTCTTA ATATGGCATT TAAGAGCTAT
CCTCACTATA CTGTGGCAAT GGACAGAAAT CTCCTTGTGG CCGTAATTTC CTTCAAGAAC
TACAATGTCC CTGTATGTAC TCAAACACTG CTGATGACCT GCAATGAAAT TCTATCCATG
GCAGACAGCT TCATGCGGTT TACTATCAGT ATAGGGTTGA GCCAAATGCA CCAAAAGGTC
CAGGCTCTAT CTGTTGCCTA CCAGGAGGCC AGGGAAGCAT TGAGAGGTGG ATTTTATAAT
AACAACCACG TTTCTGTTTT CGTGCCCCGT ACTCCGCCCA TACCATCCTT AATAAATCAA
CCCCACTATA TTGCCGACAA AATTGTAGGC AGCCTCAAAT TGGAAAACCC CTCAGAGGCA
ATTATAAGCC TTGAAAACCT ATTGGAGGAG TACAGGAGTA ATAAGGAACC TATTGAAAAT
ATCAAGGTTT CGTGCATGCT CATCGCCTCT TATTGTTACA GGCTTGTTGC GAGCTATAAG
CTGTTTGCTC CGGAACTTGC CGATAGCGAG CCCGAGGTAT ACAAGCAGAT ACAGTCTAGC
AGAAATATCC AAAATCTATT AAATATTTTA AGGCAGTTAG TAGAGAATGT TTCACAGGTT
GTGGAAAATA ACGGGAATAA ATTCAGTTGC TTGGTAAAGG AAACCCAAAA GTATATTCGC
GATAATTATA ACAAGAATAT CAGCCTCCAG TCCATAGCTG ACCACATCCA TGTCAACAGC
AGCTATCTTA GCCGGTTGTA TAAAAAGGAA ACAGGGGAAT CCATCGTGGA TGCCATTAAT
AAGTATCGGA TTGAAAACGC CAAAAAGCTT TTAAAAGACC CTGTCTACAA AGTGTTTGAG
GTAGCGTGTG CTGTAGGCAT AGAAGACCCT GCCTACTTTA CTCATGTTTT TACCAAATAT
ACTGGAATGA GTCCTAAGGA ATACAAGAGT AAATAA
 
Protein sequence
MYNVMIVDDE PVIKQGLLCF VNWEALDCKV ICDAENGIDA MEKLAVHPVN IVLTDIKMPG 
MDGLELSKLI YEKYPSIKVI ILTAFSDFTY AQAAIKYNVL DFVIKTNPTE KIPDAIRKAK
DLIAQEKEKE EKLKLMEEKA ILRLSEIKEN FFKDVFNGII VNDTLLQSKL IELEISIENY
FVVLFDIHNA SNGDSSISPE DYNKFIHSVN NLLNMAFKSY PHYTVAMDRN LLVAVISFKN
YNVPVCTQTL LMTCNEILSM ADSFMRFTIS IGLSQMHQKV QALSVAYQEA REALRGGFYN
NNHVSVFVPR TPPIPSLINQ PHYIADKIVG SLKLENPSEA IISLENLLEE YRSNKEPIEN
IKVSCMLIAS YCYRLVASYK LFAPELADSE PEVYKQIQSS RNIQNLLNIL RQLVENVSQV
VENNGNKFSC LVKETQKYIR DNYNKNISLQ SIADHIHVNS SYLSRLYKKE TGESIVDAIN
KYRIENAKKL LKDPVYKVFE VACAVGIEDP AYFTHVFTKY TGMSPKEYKS K