Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3164 |
Symbol | |
ID | 4809614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3738385 |
End bp | 3740019 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108597 |
Product | two component AraC family transcriptional regulator |
Protein accession | YP_001039552 |
Protein GI | 125975642 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0633186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTGCCG TTATGGTAGT CGATGATGAT CCTAAGATAC GGGAATGGTT TGAAGTTGAA ATAGAATGGG GAAAGATGGG ATTTGATTTT ATCTGTTCAG CCAAGGATGG AATTGACGCG CTCAACAAAT TGAATCAGCA TAATAAGTTG GATCTTGTAA TTTCTGATAT AGATATACCT AAAATGAATG GCATAGAATT GCTTAATTCC ATAAAAGAAT ACAATCTTTC TTCGATGATT GTCTTGTTAA GTGATGAGAA CAACATGGCC CATGTCAAGC AGGGGTTGCT TCTGGGAGCA TTTGATTACA TATTGAAACC TTTGGATAAG GATAAAGTAA TTGATGTTCT CAAAAAAGCC CATGATTCTC TTATGGATAA AAAAATGGAG GAAGAAAGAA ATAAAAACCT GAAGAAGAAG CTGGAGCTGA ATCTTTCTCT TTCCAGAGAT AAAATTTTAC AAGATTTGTT AAGGGGAAAG GAATTTCCTT TTCAGGAGCT TGACTATATT CAGAACGAAT ATAATATCAG TCTTCAAAAA GGAATGGTAC AGGTCGGCAT TATAGAAATC GGAAACTTTG ATGCCGATTC CAGAGAGCTT ATTAAAAGCG GAAGATTTGA CAGCCTGGTG GAGGAAGTCG GAAAAATTAT TTCAAATACT TTGTCGGAAT TTCCGGAGTT AAACTGCAAC ATGGTGGAAA TGGATATCGG GCTTATCAGC GTTATCCTTC AGCCGGTGGG TCAAAAGGAG CTTCAGGACT TTGAAGATAT GACCGCGGAC TTTTTCGAAA AAGTGCTCAA AGGAATAAAG CAGGATGCCA ACATGCGTGC AACCATCGGT ATTGGCGGAG CGTATTCGAG CCTGAGAGAC ATAAGCCAAA GTTACATGGG GGCAAAAGCG GCTCTGCGCC ACAAGTTCAT TTTGGGCGGC AACAGAGTTA TTCATATAAA GACTGATTAT AATGAAAAGC AAAACCTTCT GTACCCCGCC GAAAGGGAAA AAATGCTAGT GGAATCCATA ATGTCCGGGG ATGACAATGC TTTAAAACTT GCCGAGAATA TGTTTGACGA CATTGCGATA GGTACGGGGG ACAATCTTAA AAGGATAGCT TTTGCTGCCA ATCAGCTGGT CTTTAACATA TCGAACTTTA TCGACATGCA GTATGATTTT ATTAATAAAC TGTATGACTT CAGAAAGTTT AACAATATGG ATTTTTCCAA ATTTTCATCA AAGGATGAAA TAAAGGAGTT TTTCCTGTCT TTTGTCACAG AGCTTTTGAA TGTGGTGAAA GAATATAAAC CGGCTCAGAA CAACACTCTT ATAAAGAAGG CGTGTGAGTA TGTTTTAAAT CATATTGACC AGGAAATAAC TCTTATGACA ATAGCCGATT ATTTAAATAT CAGCAAAAAC TATTTCTGCT CTTTGTTTAA GCAGGAGACG GGATATAACT TTTTGGAATA TGTCACCAAA GTAAAAATGG AATGGGCGAA AAAACTCCTG AGGGAAGGAA ATTATAAAAC CTACGAAGTA AGCGAAATGC TTGGCTACAG AGAGGCAAGC TATTTTAGCA GACTCTTTAG GAAATATACG AAGCTAAGTC CTGCAGAGTA CAAGAAAAAT TTTGAAAATC AGTAA
|
Protein sequence | MFAVMVVDDD PKIREWFEVE IEWGKMGFDF ICSAKDGIDA LNKLNQHNKL DLVISDIDIP KMNGIELLNS IKEYNLSSMI VLLSDENNMA HVKQGLLLGA FDYILKPLDK DKVIDVLKKA HDSLMDKKME EERNKNLKKK LELNLSLSRD KILQDLLRGK EFPFQELDYI QNEYNISLQK GMVQVGIIEI GNFDADSREL IKSGRFDSLV EEVGKIISNT LSEFPELNCN MVEMDIGLIS VILQPVGQKE LQDFEDMTAD FFEKVLKGIK QDANMRATIG IGGAYSSLRD ISQSYMGAKA ALRHKFILGG NRVIHIKTDY NEKQNLLYPA EREKMLVESI MSGDDNALKL AENMFDDIAI GTGDNLKRIA FAANQLVFNI SNFIDMQYDF INKLYDFRKF NNMDFSKFSS KDEIKEFFLS FVTELLNVVK EYKPAQNNTL IKKACEYVLN HIDQEITLMT IADYLNISKN YFCSLFKQET GYNFLEYVTK VKMEWAKKLL REGNYKTYEV SEMLGYREAS YFSRLFRKYT KLSPAEYKKN FENQ
|
| |