Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2689 |
Symbol | |
ID | 4808861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3174251 |
End bp | 3175747 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108108 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001039081 |
Protein GI | 125975171 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00169618 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGAAC TGTTTAAGTC TGTAAAACTT GATAAAAATT CAGGCACTCC TTTATATATG CAACTCAGCG ACAAAATCGC CGAAATGATT GAAAACGGGA TACTTCCGGC TGATCTGAAA CTTCCTTCAA TACGTCAAAT GTCGGCCTTG CTCAATGTCA ATTCAGTGAC AATAGTGTCC TGTTACAAAC ACCTGGAAAC AAAAGGCTAT GTATATTCCA GGCCGGGAAG CGGTACGTAT GTTGCCGTTG TTCTGCCCAA GCAAAGCGAA AATTACTCAG ACCGCAACAT TATACTGGAT GAGCTTTATC AAAGCGACGA CCTCAACTTA ATCAACAACG GGCATATTAA AATCAATGAA AATACCATTA ATTTTGCCAG CGCCACCCCC AAGTCAAGTC TGTTTCCGGT GGAAAACTTC AAGCTTGTTT TAAATGAAGT ACTTGACCGT GACCGGGGAA ACGCTTTTGA CTATCAGGAC AGTCAAGGCT ATTACCCTTT GCGTTCGTCA GTCCGCATTC TGCTCGAAAA AAGAAAAATT GCGTGTCATG AGGAAAATAT TCAGATCATT TCCGGCGCGC AGCAGGGAAT AGATATAATT GCCAAAGCCC TGTTGAGGCA GGGTGACTAT GTGATTACCG AAAGTCCCAC CTATACGGGA GCTATAGCGG TATTTAAATC AAGAGGAGCG GAAATCGCCG ATGTTCCCTT GTCTTGTGAC GGTCCGAATC TTAATATCCT TGAGTACAAC CTTAAAAAAT ACAAACCAAA GCTTATTTAC ACAATACCGT CTTTTCAGAA TCCTACAGGG ATATCATATT CCAATGAGAA AAGGAAGGAA ATTTTGGCCC TTGCGGAAAG GTATGACGCT TATATTATCG AGGATGATTA TGTAAGCGGA CTGGACTTTG AAAATATGGG GTTTGCCACC GTTAAATCCA TGGATAAATC CGACAGAGTT ATATTTCTCA AAAGTTTTTC CAAGATTTTT ATGCCGGGAC TCAGGCTTGG CTTTATGGTT GTGCCGTCAA GCCTTAAAAG TTATATAATA GAAGCAAAAC ATGCAACGGA CATATCCACC TCCGGACTTA TCCAGAGGGC TTTTGACCTT TATATAAGAA AAGGAATGTG GGACGGGCAT TTTAAACTAA TGTTCAATAT TTACAAAGAA AGGTATTATA AAACTATAGA AGCTTTGGAA AGGCATCTTC CCCAAAATGT TCAGTTCCAC AAGCCCGGAG GCGGCCTGAA TATTTGGCTC GAGCTTCCAG CAAACTGCTT TACAGGCAGC CTGTTAAAGG CTGCGTCTGC CGAGAATATT GTGTTTGCTC CCGGTAGGAT CTTCTACAGC AGTACGCCAA CCAATCTCAA CAATATAAGG TTAAGCTTTG CCGCAGTTGA CGCTGATGAA ATTGAAAGGG GCATTGAAAA ACTTTCCGAA TTGATTTTAC GCTTTGACAG GAACAAGTCC CTGACAGATA ATATTCCAAT ACTGTAA
|
Protein sequence | MLELFKSVKL DKNSGTPLYM QLSDKIAEMI ENGILPADLK LPSIRQMSAL LNVNSVTIVS CYKHLETKGY VYSRPGSGTY VAVVLPKQSE NYSDRNIILD ELYQSDDLNL INNGHIKINE NTINFASATP KSSLFPVENF KLVLNEVLDR DRGNAFDYQD SQGYYPLRSS VRILLEKRKI ACHEENIQII SGAQQGIDII AKALLRQGDY VITESPTYTG AIAVFKSRGA EIADVPLSCD GPNLNILEYN LKKYKPKLIY TIPSFQNPTG ISYSNEKRKE ILALAERYDA YIIEDDYVSG LDFENMGFAT VKSMDKSDRV IFLKSFSKIF MPGLRLGFMV VPSSLKSYII EAKHATDIST SGLIQRAFDL YIRKGMWDGH FKLMFNIYKE RYYKTIEALE RHLPQNVQFH KPGGGLNIWL ELPANCFTGS LLKAASAENI VFAPGRIFYS STPTNLNNIR LSFAAVDADE IERGIEKLSE LILRFDRNKS LTDNIPIL
|
| |