Gene Cthe_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1824 
Symbol 
ID4809808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2158791 
End bp2159879 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content38% 
IMG OID640107238 
Producttwo component AraC family transcriptional regulator 
Protein accessionYP_001038238 
Protein GI125974328 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000196685 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGAG TATTGGTCGT GGATGACGAT GTGGCTGTAA GATATATGCT AAAAAGATAT 
AAGGGCTGGG AGTCCTTTGG CTTTGTATTG GCCGGGGAAG CTTCCGACGG CAGGGAAGCA
TTAAGAAAAC TTGATAAAGA ACCCTTTGAT GTGGTCATAT CCGATATAAA AATGCCGGGC
ATGGACGGAA TTGAGCTTTT AAGCGAATTG AGAAACAACG GAAACGACAT ATGTGTCCTG
TTTTTGAGCA CCCACAGTGA TTTTTCTTAC GCAAAGCAGG GGATAAGACT GGGGGTTTTT
GATTACCTGA CAAAGCCTTT TAGCGATGAA ACTTTGGGTG AAGCCCTGGA CCGGGTCAAG
GTTTATCTTG ATGAAAAAAA GAAGCAAAAA GCATTAGTAA ATATATATAA CAAAAATGCC
CTGGAAAGCA GTCAGGTTTA TTATTCAAAG AATGATGAAA AAAAGCTTGT TTCGGTTATA
TTGTCCGGGA GTTTGGAAGC GATAAATTTG GGAGACCGGC TCTTTGAAAA AATGGCGCAG
TTTACCGAAG GTGACGCAAA GAAGCTGGCA ATATTGATGG AAAATATTTT GCTGGAAGTT
GACGAAGGTA TTTATAAAGC TATCCCATGG ATTAAAAATT TGAAGAACAG GCCCTCCAAT
AAGAGTTTTG AAGGAATGGA CGAAAAAGAG CTGAAGGAAC GGTTTATCGA TCATATAAGC
GGTTGGGTGA GACTTGTTGT GAAATTTGAA CTGCATCAAT CGGACAGCTT GATGCGGAAA
ATTTGCGAGT ATGTAATAAA TCATGTGGAA GAGGATATAA AAATTGAAAA TATTGCTAAT
GAACTCTATG TCAGCAGGGA TTATATCGGC AAGTTATTTA AGCAAAAAGC GGGATATAAC
TTGAGTGAGT ATATTACAAA GGTGAAAATG GAACATGCCA AATATCTGAT TTCAAAAGGT
GAATATAAAA ATTATGAAAT AAGCGAGATA TTAGGCTACA AAAAAGCCGA TTATTTTTCA
CAGGTTTTTA AGAGTTATGT CGGCTGTACG CCCAGTGAAT ACAGGAAAAA TGCAGGATTT
GATTTTTAA
 
Protein sequence
MYRVLVVDDD VAVRYMLKRY KGWESFGFVL AGEASDGREA LRKLDKEPFD VVISDIKMPG 
MDGIELLSEL RNNGNDICVL FLSTHSDFSY AKQGIRLGVF DYLTKPFSDE TLGEALDRVK
VYLDEKKKQK ALVNIYNKNA LESSQVYYSK NDEKKLVSVI LSGSLEAINL GDRLFEKMAQ
FTEGDAKKLA ILMENILLEV DEGIYKAIPW IKNLKNRPSN KSFEGMDEKE LKERFIDHIS
GWVRLVVKFE LHQSDSLMRK ICEYVINHVE EDIKIENIAN ELYVSRDYIG KLFKQKAGYN
LSEYITKVKM EHAKYLISKG EYKNYEISEI LGYKKADYFS QVFKSYVGCT PSEYRKNAGF
DF