Gene Ccel_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0142 
Symbol 
ID7309053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp159948 
End bp162230 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content32% 
IMG OID643607071 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002504510 
Protein GI220927601 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT ATTCGGTAAT TGTTAAATTA ATGATTGTAC TTTCTTTATT TATAATTATA 
CCTATTATCA CTATTACATT CATTTTCAAT TATGGCATAA TGAAATATTC TGAAGATGAA
ATAGGTCGTT CTGGCTTGGG AAAGCTGGAT TCTGCAAAAA GTGTTACAGA TCTTCTAAGC
CAGACCATCA ATAACGAAAT CCTTCATTTG TCTTTGGATG AAACTCTTAA CAGATTGTAT
GGAATGGATG ATATAAATCA GGCTATTAAA GACAGTGATA ACAATCTAAT GCTCTATCAG
TTCGTATCCA AGCTATCCGA TATAGTTAAA ACAAATAATG CTTACTCTTC TGTATACCTT
TACCTAGATA ATTCTAATTA TATAGCAACC AGTAATGGCG TTTATCCCAA GGAAACCTTT
CCCGATGAAA CCTGGCTTAA ATACTATATA AAAAGCAAAG AACTCGGAGA ACCGTTATCC
TGGACAGACC CCAGACCAAG TGATACTAAT GGAAACAGCT CAGTCATATC ATATATTTTC
CCTTTGACTT ATACAACTAA TCTTAAAGGG TCAATTACGA TAAATATATA TGAAAGCAGA
CTGAGTAATC TTATTAACAG TAACAATTAT GACATCAGTG ATTTTATAAG CATTATAAAT
GCAAATGGTG AAGTCATATC TTCGGTTGAT AAAACCATAC TTAATAAAAA CTTAACTGAT
GTTCCATACA TATCAAAGAT TTTAAATAGT GAAATCGAAA GAGGACATTT TATTCATAGT
ATTAACGAAA AAAGGTTTCT GATTACTTAT CTGAAAACAG GTGCCGAAAA TTGGACATTT
GTAGGCGTAT TTTCTCTGGA TACCCTAACT ACCAAGGTAA ACTCTTTAAA AATGTCGATA
ATATACATAT CTATTTTTCT TTTAGTGCTT TTTGTTCTGT TGTCATATGT AATTTCTCGT
AGATTGTTCA ACCCTGTAAA AAAACTTGTA CAGGAGATAA AATCCCGAAA GGGTATTGAC
ATTATAGGAA ACGGAAATGA GTTCACTCTT CTGTCCAAAA CCTTTGATGT CATGATAAAA
CAAGAGGATC AGCTATTTCG TACCATTGAA AGGGATCAGA AAAATCTTCG TGAAAATTAT
TTATTAAGCC TTTTAAGGGG AAAGCCTTCA AACTCTGAAG ATGAAATGAA GCTTTTCCCG
TTGAAGAATA CCCTGTGTTG TGTAATATAT ATTGATAAAT ACAATGATTT TATATCAAAC
TTCTCCTATG AACATCAGTA TTACCTTAAA TCAGTAATAT TGAACCTATC CGAAGAAAAA
GTGGGTGAAT CTTTTATTTG TTCCGGTGTT GTTCTTGACG GAGATAAGGT AGTTATTATT
GTTAATATAA CCGATGAGGA CATTATAAAG GTTACTCATG TACTGAAAAG TGCCTTCTCT
ATTGTACAGC AGGAAGCGTC AAAAATAATT GAAACCACCA TATCCGTTTG TCTCGGAGGT
ATTTACGATG ACATTCTAAA AATAAGGAAT TCATATAATG AAGCACAGAA TTTAATTAAA
CAAAGATTTA TAATAGGCCA CGAAGCTTTT ATATATCAAA AGGAGAACAC TGCAGCTACT
AACAAATATT TCTACCCGTT TAATATGGAA AAACAGATAT TCAACAATAT TGATATAGGT
TCCAAAGATG CACTTTTATC TTCTATAAGC AGCTTTTTTG ATGAAATTAA GCATAATAAG
AACCTCAGCT ATGACAATAT CATGCTTATT CTTAATCAGT TGCTTGGAAG CACCATAAAA
TATTTGCTGG ATTTAAATAT AAGCGTCAGC AAGGTCTTCG GAAATGACTT TAACATATAC
AGTAAGCTTG CCGAGATTGA CACTCTGGAC GAAGCTGGTT TGTTCCTCTC AAACATCTAC
TTACAGATTA TTGAGTACAA TGAAATGTTC AAAGTTGAAG GGAAGTCCCA TATAGTAAAG
ATTTTGGACT ATATTCATAA AAACTATAAG AAGGATATTG CAATTAATAT GCTGGCTGAA
CATGTGGGAC TCAGCTATTC CCACGTAAGA AAAATATTTA ATGATGAAAC GGGAGAAAAC
ATTGTTAATT ATATAAATAA TATGAGAATT GAAGAAGCAC AGCGTCTGCT GCGTCAAACA
AACATGAATA TTAACGATAT TGCTCTTAGT CTCGGATATA ACAATAAACA GAGCTTTAAT
AGGTTCTTTA AAAAATATGT TGGTATTAAC CCCGGAGAAT ACAGAAATAT AAAGGCAAAT
TAA
 
Protein sequence
MKNYSVIVKL MIVLSLFIII PIITITFIFN YGIMKYSEDE IGRSGLGKLD SAKSVTDLLS 
QTINNEILHL SLDETLNRLY GMDDINQAIK DSDNNLMLYQ FVSKLSDIVK TNNAYSSVYL
YLDNSNYIAT SNGVYPKETF PDETWLKYYI KSKELGEPLS WTDPRPSDTN GNSSVISYIF
PLTYTTNLKG SITINIYESR LSNLINSNNY DISDFISIIN ANGEVISSVD KTILNKNLTD
VPYISKILNS EIERGHFIHS INEKRFLITY LKTGAENWTF VGVFSLDTLT TKVNSLKMSI
IYISIFLLVL FVLLSYVISR RLFNPVKKLV QEIKSRKGID IIGNGNEFTL LSKTFDVMIK
QEDQLFRTIE RDQKNLRENY LLSLLRGKPS NSEDEMKLFP LKNTLCCVIY IDKYNDFISN
FSYEHQYYLK SVILNLSEEK VGESFICSGV VLDGDKVVII VNITDEDIIK VTHVLKSAFS
IVQQEASKII ETTISVCLGG IYDDILKIRN SYNEAQNLIK QRFIIGHEAF IYQKENTAAT
NKYFYPFNME KQIFNNIDIG SKDALLSSIS SFFDEIKHNK NLSYDNIMLI LNQLLGSTIK
YLLDLNISVS KVFGNDFNIY SKLAEIDTLD EAGLFLSNIY LQIIEYNEMF KVEGKSHIVK
ILDYIHKNYK KDIAINMLAE HVGLSYSHVR KIFNDETGEN IVNYINNMRI EEAQRLLRQT
NMNINDIALS LGYNNKQSFN RFFKKYVGIN PGEYRNIKAN