Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2287 |
Symbol | |
ID | 4809876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2722448 |
End bp | 2723380 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107693 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001038682 |
Protein GI | 125974772 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000984991 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTAT TTACAAGAAG TTACGATAAA GACTGGAACC GCATTGTATA CTTAAAAGTA TTGGAAAATC AAGAGGCATG CCTTCATCTG AAGGATGATA AGACTTGTAA ATTAATCCTT TTGCTCGAAG GAAATCCCAC CATAAGATAT AATAATCAAA CAGCTTTTAT TATCGCACCC TCTGTTCTTT GCCTGAATCA TTTGGACACC GTAGAATTTG ACTACAACCC CAACTGCAAG ATGACAGTTC TCTTCTTTAG ACCAAGTGCC TTAAATGATC AGCTGGAATA CAGTGCATTT CATTCAAATC TTTATGAAAA AATGGTTGGA ACAACGCTGT TTCAAGATAT GGTATTATTG AGTTCATTTT ATGATATTAA CGATTCCAAA CGTGAGCTCA TCTTATTAGA CACTGTCTCA GCTGTTAGTC TGATGCAATT ACTCAACAAA ATAAACCAAG AGCTGACAGA GCAGAAAGAT GGCTATTGGC CATGCAGAAG CAGATCTTAT TTTATCGAAC TGCTCTTTTT CCTGGAAGGA TTACGCTGTA ACGAGTCATT GCAGAAAATG CGCATCATAT TGGGCAAGAA TAAAAATAGT ATTGTACATA ACATCATTCA GTATCTCAAT CAAAACATCG ACAAGAAAAT TACCCTTGAA CATTTGGAAA AAACATTTGC ATGCAACCGA AATCAGATTA ATAAAGAATT TCAAAAGGAA TTAAATACGA CAGTAATGAA ATATTTTACT CAAATGCGAA TGCAGCTGGC CAGCATCTTG TTAAGAGACA CCGAAATACC GATACTCGAA GTTGCGCTAA GAGTAGGGTA TTCCGATGCC GGTTATTTTT CCAAAACATT TAAATTATAT AGCGGTATAT CCCCCAGTGA ATATCGTAAT TCCTTTTATT CTACGATATC ACCGGCTTTA TAG
|
Protein sequence | MDLFTRSYDK DWNRIVYLKV LENQEACLHL KDDKTCKLIL LLEGNPTIRY NNQTAFIIAP SVLCLNHLDT VEFDYNPNCK MTVLFFRPSA LNDQLEYSAF HSNLYEKMVG TTLFQDMVLL SSFYDINDSK RELILLDTVS AVSLMQLLNK INQELTEQKD GYWPCRSRSY FIELLFFLEG LRCNESLQKM RIILGKNKNS IVHNIIQYLN QNIDKKITLE HLEKTFACNR NQINKEFQKE LNTTVMKYFT QMRMQLASIL LRDTEIPILE VALRVGYSDA GYFSKTFKLY SGISPSEYRN SFYSTISPAL
|
| |