Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0142 |
Symbol | |
ID | 7309053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 159948 |
End bp | 162230 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643607071 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002504510 |
Protein GI | 220927601 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATT ATTCGGTAAT TGTTAAATTA ATGATTGTAC TTTCTTTATT TATAATTATA CCTATTATCA CTATTACATT CATTTTCAAT TATGGCATAA TGAAATATTC TGAAGATGAA ATAGGTCGTT CTGGCTTGGG AAAGCTGGAT TCTGCAAAAA GTGTTACAGA TCTTCTAAGC CAGACCATCA ATAACGAAAT CCTTCATTTG TCTTTGGATG AAACTCTTAA CAGATTGTAT GGAATGGATG ATATAAATCA GGCTATTAAA GACAGTGATA ACAATCTAAT GCTCTATCAG TTCGTATCCA AGCTATCCGA TATAGTTAAA ACAAATAATG CTTACTCTTC TGTATACCTT TACCTAGATA ATTCTAATTA TATAGCAACC AGTAATGGCG TTTATCCCAA GGAAACCTTT CCCGATGAAA CCTGGCTTAA ATACTATATA AAAAGCAAAG AACTCGGAGA ACCGTTATCC TGGACAGACC CCAGACCAAG TGATACTAAT GGAAACAGCT CAGTCATATC ATATATTTTC CCTTTGACTT ATACAACTAA TCTTAAAGGG TCAATTACGA TAAATATATA TGAAAGCAGA CTGAGTAATC TTATTAACAG TAACAATTAT GACATCAGTG ATTTTATAAG CATTATAAAT GCAAATGGTG AAGTCATATC TTCGGTTGAT AAAACCATAC TTAATAAAAA CTTAACTGAT GTTCCATACA TATCAAAGAT TTTAAATAGT GAAATCGAAA GAGGACATTT TATTCATAGT ATTAACGAAA AAAGGTTTCT GATTACTTAT CTGAAAACAG GTGCCGAAAA TTGGACATTT GTAGGCGTAT TTTCTCTGGA TACCCTAACT ACCAAGGTAA ACTCTTTAAA AATGTCGATA ATATACATAT CTATTTTTCT TTTAGTGCTT TTTGTTCTGT TGTCATATGT AATTTCTCGT AGATTGTTCA ACCCTGTAAA AAAACTTGTA CAGGAGATAA AATCCCGAAA GGGTATTGAC ATTATAGGAA ACGGAAATGA GTTCACTCTT CTGTCCAAAA CCTTTGATGT CATGATAAAA CAAGAGGATC AGCTATTTCG TACCATTGAA AGGGATCAGA AAAATCTTCG TGAAAATTAT TTATTAAGCC TTTTAAGGGG AAAGCCTTCA AACTCTGAAG ATGAAATGAA GCTTTTCCCG TTGAAGAATA CCCTGTGTTG TGTAATATAT ATTGATAAAT ACAATGATTT TATATCAAAC TTCTCCTATG AACATCAGTA TTACCTTAAA TCAGTAATAT TGAACCTATC CGAAGAAAAA GTGGGTGAAT CTTTTATTTG TTCCGGTGTT GTTCTTGACG GAGATAAGGT AGTTATTATT GTTAATATAA CCGATGAGGA CATTATAAAG GTTACTCATG TACTGAAAAG TGCCTTCTCT ATTGTACAGC AGGAAGCGTC AAAAATAATT GAAACCACCA TATCCGTTTG TCTCGGAGGT ATTTACGATG ACATTCTAAA AATAAGGAAT TCATATAATG AAGCACAGAA TTTAATTAAA CAAAGATTTA TAATAGGCCA CGAAGCTTTT ATATATCAAA AGGAGAACAC TGCAGCTACT AACAAATATT TCTACCCGTT TAATATGGAA AAACAGATAT TCAACAATAT TGATATAGGT TCCAAAGATG CACTTTTATC TTCTATAAGC AGCTTTTTTG ATGAAATTAA GCATAATAAG AACCTCAGCT ATGACAATAT CATGCTTATT CTTAATCAGT TGCTTGGAAG CACCATAAAA TATTTGCTGG ATTTAAATAT AAGCGTCAGC AAGGTCTTCG GAAATGACTT TAACATATAC AGTAAGCTTG CCGAGATTGA CACTCTGGAC GAAGCTGGTT TGTTCCTCTC AAACATCTAC TTACAGATTA TTGAGTACAA TGAAATGTTC AAAGTTGAAG GGAAGTCCCA TATAGTAAAG ATTTTGGACT ATATTCATAA AAACTATAAG AAGGATATTG CAATTAATAT GCTGGCTGAA CATGTGGGAC TCAGCTATTC CCACGTAAGA AAAATATTTA ATGATGAAAC GGGAGAAAAC ATTGTTAATT ATATAAATAA TATGAGAATT GAAGAAGCAC AGCGTCTGCT GCGTCAAACA AACATGAATA TTAACGATAT TGCTCTTAGT CTCGGATATA ACAATAAACA GAGCTTTAAT AGGTTCTTTA AAAAATATGT TGGTATTAAC CCCGGAGAAT ACAGAAATAT AAAGGCAAAT TAA
|
Protein sequence | MKNYSVIVKL MIVLSLFIII PIITITFIFN YGIMKYSEDE IGRSGLGKLD SAKSVTDLLS QTINNEILHL SLDETLNRLY GMDDINQAIK DSDNNLMLYQ FVSKLSDIVK TNNAYSSVYL YLDNSNYIAT SNGVYPKETF PDETWLKYYI KSKELGEPLS WTDPRPSDTN GNSSVISYIF PLTYTTNLKG SITINIYESR LSNLINSNNY DISDFISIIN ANGEVISSVD KTILNKNLTD VPYISKILNS EIERGHFIHS INEKRFLITY LKTGAENWTF VGVFSLDTLT TKVNSLKMSI IYISIFLLVL FVLLSYVISR RLFNPVKKLV QEIKSRKGID IIGNGNEFTL LSKTFDVMIK QEDQLFRTIE RDQKNLRENY LLSLLRGKPS NSEDEMKLFP LKNTLCCVIY IDKYNDFISN FSYEHQYYLK SVILNLSEEK VGESFICSGV VLDGDKVVII VNITDEDIIK VTHVLKSAFS IVQQEASKII ETTISVCLGG IYDDILKIRN SYNEAQNLIK QRFIIGHEAF IYQKENTAAT NKYFYPFNME KQIFNNIDIG SKDALLSSIS SFFDEIKHNK NLSYDNIMLI LNQLLGSTIK YLLDLNISVS KVFGNDFNIY SKLAEIDTLD EAGLFLSNIY LQIIEYNEMF KVEGKSHIVK ILDYIHKNYK KDIAINMLAE HVGLSYSHVR KIFNDETGEN IVNYINNMRI EEAQRLLRQT NMNINDIALS LGYNNKQSFN RFFKKYVGIN PGEYRNIKAN
|
| |