Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2890 |
Symbol | |
ID | 4809097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3415714 |
End bp | 3416883 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108309 |
Product | putative transcriptional regulator |
Protein accession | YP_001039281 |
Protein GI | 125975371 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTGCC ATAAGCTGAG GCGGCTGCTT AAAAAGGAAG AGGGTCCAAA GCTTGACTTT AAGGCAGAAT ATGATTTATC CACTGAAAGC GGGAAAAAAG AGCTCACCAA AGACGTTATT GCCATAGCCA ATTCCAGGGG CGGGCGAGGT TACATTATCC TGGGCATTGA AGACAAAACC AAAAGAGTGC TGGGAATTGA GCCTAAAAGA TATACTGAGG AGCAGATTCA GCAGATAATT TACAACCGCT GTGACCCACC TGTGCCGATA TCAGTGGACT TTGTCGGGCT GGATGGAAAA ACGGTCGGGG TTATCACTGT CTACAGAAGC AGTCATAAGC CCCATCAGAT GATACAAAAC GGTGCGTTTT ACATAAGAAG AGGTTCAACC ACCGACATAG CAAGGCGCAG TGAAATTGCA AATCTTTTTC AGGAAAACGG TCTTATGACC TATGAGACGG TGATATTAAA AAATGTCGGT ATGGAGGAAC TGGACTTCGG GCTTATTAAG GATTACTTCA AAACTCTGAA TGTTTTTAGC GACAATCCCA GTGAGTTGAT TCTTGAGGCT TTGGGCATAA TCGGACAAAA GTCCGATACG GAGGAATACC ATCCAACCAT AGGGGGGCTT TTGCTGTTTG GGAAAAATCC TTCTTTGTAC CTTCCCCATG TTTATGTAAA AGTCGTTTAC AATGGAGAGG CCCGGCTGTT TTTCGGGAAT ATTTTAAAGA TGCTCGATGA TGTATCCGAC TATATGAGAA GCATAATAAA GGTGGAAGGA TATCCTTTTA ATGCTTTGGA AGAGGTAATT GCCAATGCTC TTGTTCACAG GGACTATTTG GATGTTTCAA AAGGAATTGT AATAACGGTT ACGGATAAAA ATATTGAAAT TAGCAATCCC GGAGCTCTTA TTGCCGGCAA CAGTGTCTAC AGTTTTTCAA GAGAGATCAA TCCCGACAGG CGAAATCCAT GGCTGTATCA AAGACTTCTG ACCTTGGATC CCAAAAAAAG GTTTATGAAA TCGGGAGTTG GATTAAAAAG GGTGAAAAAA TCTTTTACCG GCATTGGACC TGTGAAATTT ATAAATATTG GTTCCCAGAA TTTGTTTAAG GTGCTCCTGC CTATTGGAAA AAAACAAGAG TCAACCTCTG ATACAAGCGG TTTATTTTAA
|
Protein sequence | MDCHKLRRLL KKEEGPKLDF KAEYDLSTES GKKELTKDVI AIANSRGGRG YIILGIEDKT KRVLGIEPKR YTEEQIQQII YNRCDPPVPI SVDFVGLDGK TVGVITVYRS SHKPHQMIQN GAFYIRRGST TDIARRSEIA NLFQENGLMT YETVILKNVG MEELDFGLIK DYFKTLNVFS DNPSELILEA LGIIGQKSDT EEYHPTIGGL LLFGKNPSLY LPHVYVKVVY NGEARLFFGN ILKMLDDVSD YMRSIIKVEG YPFNALEEVI ANALVHRDYL DVSKGIVITV TDKNIEISNP GALIAGNSVY SFSREINPDR RNPWLYQRLL TLDPKKRFMK SGVGLKRVKK SFTGIGPVKF INIGSQNLFK VLLPIGKKQE STSDTSGLF
|
| |