Gene Ccel_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0147 
Symbol 
ID7309057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp167379 
End bp168956 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content38% 
IMG OID643607076 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002504515 
Protein GI220927606 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAG TACTTCTTGT AGATGATGAA CCATATGTTC TTGAAGGGCT GAAAGTTATG 
CTGGACTGGG AAGCACATGG CTTTAGAATT TGCGGTGAAG CATCAAATGG AGAGGATGCA
CTGGAAATTG TGAGAGTGTG CAACCCTGAT CTTATAATGA CAGACATAAG TATGCCCAGA
ATAGATGGTT TGGAATTAAT CAGGCTGTCA ACCGAAAATC TCAAGTCTAC TGCAAAGTTT
GTCATACTCA GCGGCTATGA CGATTTTTCC TATGCCAAGC GTGCTATGCT GTACAATGCC
AGCAACTATT TGCTCAAACC ACTGGATGAT GTTGAACTCG ATAGTGTTGT TACTAAACTT
GCAGAGCAGA TCAAAAAGGA ACGTAAAGAA ACCGAAAATA TAAATAAACA GCTTTCATTC
ATTGCTAATC AGAGCATTAT AAGACTAATT AATGGAGACA ATAAGCCATC TCTTATAGGT
AGGGTAGGTA TGCTTCTTGA TATTGCAGAG GATGAAGAGT TCAGGTGTAT ACTTTTTGAG
ATTGATTCAG CTGACAGCTG GGTTCAGGGA GAAGAAAGAA CCGAATTGAA TATAAATAGA
ACAACACCGG CAAGGGTCAT AGAAGATGCT CTTGATCCGG CTTTTCAGTT TCGTATATTT
GAGGACGGAA AAGGACGAAT CGGCATAATT GCCAGTGAAA AAATGCCGTT CTTCAATACA
CTTGAGGAGT TTACAAAGAG TCTGCTTTTA CAATTAAATC AAATTTTTGG GGACGTGGTT
TACGCTTCAA TAAGTGATTC CGAAAAGGGC CTATCGTTGA TAAGCAAAAC GTACAAACAG
GCTCTTTTTG CCATTGGATT TAAATTTTAT TCACCTGGTA AAAGCCTCAT AAGTTACGAG
AATGTTAAGG GGCTGAATCT TAATTTCGAA ATGTGTACGG AATATTATAA TGCACTATTA
GATTTCATAA GGGCTAACCG TGTTGAGGAT ATTGAACCTG TTGTATGCAG ACTTTTTAAG
AATTTTTCAG AAAACCACAG TGCTCCACAG ATAATCATTT CTTATCTCAT GAACTTTCAG
ATGGAGTTGG TAAAGCTCAT AATGGAGATA GGAGGAGACT TAAAGGAGTT TCTAACTCTT
GCATTGGGTT TTCAAAAAAC AGCAGAGCAC TTAAATATGT CAAATCTTCA GGGTGAATTT
CTGAAGCAAT GCTTGAGTGC TGCTTCGCAT ATAAATGGTT TTAAACAGGG AAACCCTCAG
TTTATTGTAT GTGAGGTAAA AAACTATATT AAGCAGAATT ATTGCAAGGA CATTAAATTG
AAAGAGGTAG CAAGGCATTT TTACATGAAC TCCGTGTACC TCGGACAGTT GTTTAAAAAG
GTTTCAGGTG TACAGTTTAA CGATTACCTG AATAATGTAC GTGTTGAAGA AGCTAAAAAG
CTTCTTCAGA GAACAGATAT GAAGGTGAGC GAGATTTCAA GTGCGGTGGG CTACAATGAT
CCAAAATATT TTTTAAGTAA ATTTAAAGCA ATTACAAGTC TGCCCCCTTC AGCCTTTAAA
ACAGGGAAAA CTACCTAA
 
Protein sequence
MLKVLLVDDE PYVLEGLKVM LDWEAHGFRI CGEASNGEDA LEIVRVCNPD LIMTDISMPR 
IDGLELIRLS TENLKSTAKF VILSGYDDFS YAKRAMLYNA SNYLLKPLDD VELDSVVTKL
AEQIKKERKE TENINKQLSF IANQSIIRLI NGDNKPSLIG RVGMLLDIAE DEEFRCILFE
IDSADSWVQG EERTELNINR TTPARVIEDA LDPAFQFRIF EDGKGRIGII ASEKMPFFNT
LEEFTKSLLL QLNQIFGDVV YASISDSEKG LSLISKTYKQ ALFAIGFKFY SPGKSLISYE
NVKGLNLNFE MCTEYYNALL DFIRANRVED IEPVVCRLFK NFSENHSAPQ IIISYLMNFQ
MELVKLIMEI GGDLKEFLTL ALGFQKTAEH LNMSNLQGEF LKQCLSAASH INGFKQGNPQ
FIVCEVKNYI KQNYCKDIKL KEVARHFYMN SVYLGQLFKK VSGVQFNDYL NNVRVEEAKK
LLQRTDMKVS EISSAVGYND PKYFLSKFKA ITSLPPSAFK TGKTT