Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0861 |
Symbol | |
ID | 4810479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1035988 |
End bp | 1037124 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106277 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001037288 |
Protein GI | 125973378 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCAAA AAAAAATATT TTCAAAGAAA AAGAATTTAA CTTTTAAAAA AGCCCTGCTC ATTTGGATGG TTAGTTTGAT AGTTTTAGGG GGAATGATTT TTCTTTTGGG AAAATTCGTT CTAAGTGACA GCAACGGCAG TTCAGACGTG GACAATGACC TTACATATCC ACCTGTTAAA AGTTACGAAC CGATTAGTTT TCCTACACCC GGCAAAATCG GCAATGACAA GCCTTCCGAT GAGGAAAACT CCGATAAATT TCAGGGTCTC ACAAACGGAA TGAAATATCA TGAAAAGCTG GTTGACGAAA GCAGTCGAAA CATATTGTTC ATCGGGCAGG ACAGAATTTC CGGCCTGTAT GATACCATAG GTATATTAAG CATTGACAGA AAAAACAAAA AACTGAAAAT AATAATGATT CCCCGCGACC TGTATATCGA TTACAGTTCC AGAGTCAAAC ACTATTTAGA AGAAAACGGC AGATTAAACG ATCCCGACTT TTATAAAATT AATGCCGCGC ACCTTATAGG GCCGTATATG AAATATGAGG GAAAATTTGG TGCATATTCA ATGAATTTTC TGGCCGAACT GATCAAAGAA ATTTTTGACA TTGAAGTTCA TGACTACGTA AGGGTTAACA CCGAAGGCTT CGTCCAAATT GTCGACCTTT TCGGCGGTGT CGACATTAAC GTTCCGTACG ACATGCATTA CGATGACATA TATCAGGATC TCCACATCCA TATAAACAAA GGATGGAACC ATCTTGACGG AAAAAAAGCC GAAGGATTCG TCCGTTACAG GCAAAGCAAC GACGAAATGG GAAACATCAC CCATTCGATC GGCGATTATG AAAGAAAGAA AAACCAGATT AATTTTATAA AAGCATTTAT TGAGCAGCAT GGGACAATGT CCAATATCGA CAAGCTCCCC AGTCTGATTT CTACGTTAAA CAAATACATG AAACACAGCA TCGGAGTCGG AGACGTTCTG ACAACTTACA TTGGCTATGC AAAAGATGTC GTTCTATACA AATACCCGAT AGAAACTTAT ACAGTTACCG GAAAAGACAA ATATATGAAC CAAAGATATT ATATTGTAAT TGAAAATGAC AACAAGACCA GCAAAGTTGT TGACTAA
|
Protein sequence | MGQKKIFSKK KNLTFKKALL IWMVSLIVLG GMIFLLGKFV LSDSNGSSDV DNDLTYPPVK SYEPISFPTP GKIGNDKPSD EENSDKFQGL TNGMKYHEKL VDESSRNILF IGQDRISGLY DTIGILSIDR KNKKLKIIMI PRDLYIDYSS RVKHYLEENG RLNDPDFYKI NAAHLIGPYM KYEGKFGAYS MNFLAELIKE IFDIEVHDYV RVNTEGFVQI VDLFGGVDIN VPYDMHYDDI YQDLHIHINK GWNHLDGKKA EGFVRYRQSN DEMGNITHSI GDYERKKNQI NFIKAFIEQH GTMSNIDKLP SLISTLNKYM KHSIGVGDVL TTYIGYAKDV VLYKYPIETY TVTGKDKYMN QRYYIVIEND NKTSKVVD
|
| |