Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2212 |
Symbol | |
ID | 4811077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2641040 |
End bp | 2641948 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107618 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001038607 |
Protein GI | 125974697 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATAATA GTAATAATAA TATTCTTCCG AAGGCATGGG AAAGTTTTCT CAGTTCATCA ACTTTTATTC CGGTTATAGT GAAAACCATT GAGAGGTTTC ATGATACCAG CTGGCATATG GAACCGAACA AGCATGAATG TTTCGAGATG GTGTATATAA AAAGGGGCAA GGCTGTTTTT GAGATAGCAG GTTATCCTGC GGAGATAGGC CCTAATGACA TTATTATAAT AAAGCCCAAT CAGCCCCATA AATTTATTGT AAAGTCCGAG TCCGGATGCG AGTTTATAGT CCTGAGCTTT AAATTTGTAA ACCGGTTTGA CGGCCAGTAT TCCGATGTGT CCCTTGAAAA CTTTTTGGAC TTTGTAAGCG GGAAGGAAAC AGGACCTTTT ATAACCCTGA AAGTCAGCCA AAAGAACGAT ATTATAGTGC TTTTAAACAG GATACTCAAA GAAAGGGAGA ATCCTGACAT AGGAAGCGAG TTTTTAAACT ATCTTTTGGT CATGGAGCTG TTTGTGCTTA TTTCCCGTGC TTTGAAGATG GAATGGGAGA ACAGCATAAA AAACAAAAGC CCGAAGATAA AGGAGCTTAT ACAGGCTTCT GTAAACTATA TAAACAACAA TTATGAGAGG GATATTTCCT TAAAGGATAT AGCCCGGTAT GTTTTTCTGA GCACAAGTTA CTTTACTCGG GCATTTAAGG AAGAAATGGG AATAAGTCCG ATAAATTATC TTTTAAAAAT AAGAGTGGAA AGGGCAAAGG AACTTCTTAA GGATACCGAC AACAGGATAA GCGACATTGC CCTAAGTGTC GGATTTTCCA ACCAGCAAAG ATTCAATGAT ATTTTCAAAA AGTATGTAAA GCTTACACCT CTTCAGTACA GAAAGAATGT ACAAGTCAAA AAACATTAA
|
Protein sequence | MDNSNNNILP KAWESFLSSS TFIPVIVKTI ERFHDTSWHM EPNKHECFEM VYIKRGKAVF EIAGYPAEIG PNDIIIIKPN QPHKFIVKSE SGCEFIVLSF KFVNRFDGQY SDVSLENFLD FVSGKETGPF ITLKVSQKND IIVLLNRILK ERENPDIGSE FLNYLLVMEL FVLISRALKM EWENSIKNKS PKIKELIQAS VNYINNNYER DISLKDIARY VFLSTSYFTR AFKEEMGISP INYLLKIRVE RAKELLKDTD NRISDIALSV GFSNQQRFND IFKKYVKLTP LQYRKNVQVK KH
|
| |