Gene Cthe_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2689 
Symbol 
ID4808861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3174251 
End bp3175747 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content42% 
IMG OID640108108 
ProductGntR family transcriptional regulator 
Protein accessionYP_001039081 
Protein GI125975171 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00169618 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGAAC TGTTTAAGTC TGTAAAACTT GATAAAAATT CAGGCACTCC TTTATATATG 
CAACTCAGCG ACAAAATCGC CGAAATGATT GAAAACGGGA TACTTCCGGC TGATCTGAAA
CTTCCTTCAA TACGTCAAAT GTCGGCCTTG CTCAATGTCA ATTCAGTGAC AATAGTGTCC
TGTTACAAAC ACCTGGAAAC AAAAGGCTAT GTATATTCCA GGCCGGGAAG CGGTACGTAT
GTTGCCGTTG TTCTGCCCAA GCAAAGCGAA AATTACTCAG ACCGCAACAT TATACTGGAT
GAGCTTTATC AAAGCGACGA CCTCAACTTA ATCAACAACG GGCATATTAA AATCAATGAA
AATACCATTA ATTTTGCCAG CGCCACCCCC AAGTCAAGTC TGTTTCCGGT GGAAAACTTC
AAGCTTGTTT TAAATGAAGT ACTTGACCGT GACCGGGGAA ACGCTTTTGA CTATCAGGAC
AGTCAAGGCT ATTACCCTTT GCGTTCGTCA GTCCGCATTC TGCTCGAAAA AAGAAAAATT
GCGTGTCATG AGGAAAATAT TCAGATCATT TCCGGCGCGC AGCAGGGAAT AGATATAATT
GCCAAAGCCC TGTTGAGGCA GGGTGACTAT GTGATTACCG AAAGTCCCAC CTATACGGGA
GCTATAGCGG TATTTAAATC AAGAGGAGCG GAAATCGCCG ATGTTCCCTT GTCTTGTGAC
GGTCCGAATC TTAATATCCT TGAGTACAAC CTTAAAAAAT ACAAACCAAA GCTTATTTAC
ACAATACCGT CTTTTCAGAA TCCTACAGGG ATATCATATT CCAATGAGAA AAGGAAGGAA
ATTTTGGCCC TTGCGGAAAG GTATGACGCT TATATTATCG AGGATGATTA TGTAAGCGGA
CTGGACTTTG AAAATATGGG GTTTGCCACC GTTAAATCCA TGGATAAATC CGACAGAGTT
ATATTTCTCA AAAGTTTTTC CAAGATTTTT ATGCCGGGAC TCAGGCTTGG CTTTATGGTT
GTGCCGTCAA GCCTTAAAAG TTATATAATA GAAGCAAAAC ATGCAACGGA CATATCCACC
TCCGGACTTA TCCAGAGGGC TTTTGACCTT TATATAAGAA AAGGAATGTG GGACGGGCAT
TTTAAACTAA TGTTCAATAT TTACAAAGAA AGGTATTATA AAACTATAGA AGCTTTGGAA
AGGCATCTTC CCCAAAATGT TCAGTTCCAC AAGCCCGGAG GCGGCCTGAA TATTTGGCTC
GAGCTTCCAG CAAACTGCTT TACAGGCAGC CTGTTAAAGG CTGCGTCTGC CGAGAATATT
GTGTTTGCTC CCGGTAGGAT CTTCTACAGC AGTACGCCAA CCAATCTCAA CAATATAAGG
TTAAGCTTTG CCGCAGTTGA CGCTGATGAA ATTGAAAGGG GCATTGAAAA ACTTTCCGAA
TTGATTTTAC GCTTTGACAG GAACAAGTCC CTGACAGATA ATATTCCAAT ACTGTAA
 
Protein sequence
MLELFKSVKL DKNSGTPLYM QLSDKIAEMI ENGILPADLK LPSIRQMSAL LNVNSVTIVS 
CYKHLETKGY VYSRPGSGTY VAVVLPKQSE NYSDRNIILD ELYQSDDLNL INNGHIKINE
NTINFASATP KSSLFPVENF KLVLNEVLDR DRGNAFDYQD SQGYYPLRSS VRILLEKRKI
ACHEENIQII SGAQQGIDII AKALLRQGDY VITESPTYTG AIAVFKSRGA EIADVPLSCD
GPNLNILEYN LKKYKPKLIY TIPSFQNPTG ISYSNEKRKE ILALAERYDA YIIEDDYVSG
LDFENMGFAT VKSMDKSDRV IFLKSFSKIF MPGLRLGFMV VPSSLKSYII EAKHATDIST
SGLIQRAFDL YIRKGMWDGH FKLMFNIYKE RYYKTIEALE RHLPQNVQFH KPGGGLNIWL
ELPANCFTGS LLKAASAENI VFAPGRIFYS STPTNLNNIR LSFAAVDADE IERGIEKLSE
LILRFDRNKS LTDNIPIL