Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0210 |
Symbol | |
ID | 4808628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 255476 |
End bp | 256537 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640105623 |
Product | LacI family transcription regulator |
Protein accession | YP_001036644 |
Protein GI | 125972734 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000188149 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA AGGATATAGC AAAAATCGTC GGAGTTTCAA GAAGTACCGT CTCACGGGTA ATAAACAACT ATCCTGACAT TCCTCAGGCA ACCCGGGAAA AGGTATTAAA AGCAATAAAA GAATACAACT ATTATCCTAA TGCTTCGGCC AGAAGACTTG CAGGAATGAA AAGCTCCACA TTGGGAATTT TTATTATAGA CATAAAAGAC AATGAAAAAC CGCACCATGT AATAGAAAAT AATGAAGACC TTTTATACGG CAATTCATAT TTTTCACCTT TTATAAATGC TTTTATTGAC CAGTCAAACA AAGCCCAGTA CCACGTTTTG GTGTCAACTA TATACAGTTC GGATGAGCTT TGGAAGATTC AAAGCGCTTT TTATGAGAAA AGAATTGACG GTGCGGTGAT AATAGGAAGC AGCAGCATTG ATTATAGTAA AATTTTTGAA ATAATGGATA AAGATTCTAT TACTGTCGCA GTTGATTTAG ATATGGAAAA AGAAAATACA GGCACAGTCA TGTCTGTAAA TATCAATAAT TATGGCGGTG TATCCGATGC AATAGATTAT CTTGTTGAAC TTGGGCACAA AGACATTGCC GTAATTACAG GAGATCTGAA CAAGCTTTCT GGTAAGATAA GATTTGAAAG CTTCAAAGAT GCGCTTTTAA GGCACGGCCT TCCGTTAAAC AATGATTTTA TTGCATATGG GGATTTTACT GAAAATAGCG GCTATGAGGG TATGAAAAAA ATATTGGCAT CCGGAAAAAA GCCAACAGCC GTATTTACAA GCAATGATAC CATGGCTATC GGAGCATACA GGGCAATAAA AGAATACGGG CTTAAAATTC CGGAGGATAT TTCGGTTATG GGATTCGACA ATTCATATAT ATCCCAATAC ATGTCCCCGC CTCTTACCAC AGTCAATGTT TCATTGCCTG AGATAGCAAA ATGCTCGATT GAATTATTGC TTGATTCTAT TAATAATAAG GAGATAAAAA ACAGACAAAA AACGGTGAAT GTTCAGATAG TCAAGCGTAA TTCATGTAAA AAAATTGTCT GA
|
Protein sequence | MNSKDIAKIV GVSRSTVSRV INNYPDIPQA TREKVLKAIK EYNYYPNASA RRLAGMKSST LGIFIIDIKD NEKPHHVIEN NEDLLYGNSY FSPFINAFID QSNKAQYHVL VSTIYSSDEL WKIQSAFYEK RIDGAVIIGS SSIDYSKIFE IMDKDSITVA VDLDMEKENT GTVMSVNINN YGGVSDAIDY LVELGHKDIA VITGDLNKLS GKIRFESFKD ALLRHGLPLN NDFIAYGDFT ENSGYEGMKK ILASGKKPTA VFTSNDTMAI GAYRAIKEYG LKIPEDISVM GFDNSYISQY MSPPLTTVNV SLPEIAKCSI ELLLDSINNK EIKNRQKTVN VQIVKRNSCK KIV
|
| |