Gene Cthe_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1861 
Symbol 
ID4809412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2205977 
End bp2207062 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content36% 
IMG OID640107280 
ProductCdaR family transcriptional regulator 
Protein accessionYP_001038275 
Protein GI125974365 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0198483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAA AGATATATCA GAATCTTGTA AATCAAATAA AGGACGTAAT AGATTCGGAG 
TTTGGGATAA TGGATGACAC AGGTCTTATT TTGGCCTGCT CAGATGAAAA GAAAGTGGGA
CAGAGCAGTT CACTGGTATC TGAGATAATG AGGTCCAAAG ATCAGTTTGT GGTGATTGAC
GGACAAACGT TTCAGAAGGT ATACATAAAG AACAAACTTG AATTTATTAC GTTTATTGAT
TCGGATTCTG AAAACAGCCA AAAATTTCTT GCGTTGATAT CGATTAATAC CATTAACGTA
AAGAATTATT TTGACGAGAA ATATGATAAG ATCAGTTTTA TAAAAGGAAT TATAATGGAT
AATATTCTTC CAGGAGATAT CACTTTGAGG GCAAAGGAGT TGCACCTTCA AAATAATGTA
AACAGAGTGG TTTTTCTTGT GGAAACCGAA AAGGCAAAAG ATATTTATGC CCACGAGATA
ATTGAAGGGC TTTTTCCGGT TAAAAACAAA GACTTTGTTG TAGTGCTTGA CGATGAGAAA
GTTGTGCTTA TAAAAGAGTT GAAGCCGGAC TATGACTACA AGGAGATAAA CAAAATTTCC
AAAGTTATTA TTGATACTTT GTCCACGGAG GGAATGATTA AAGCCAGGGT TGGAATCGGC
ACGGTTGTTG ACAATATAAA GGATATAGGA CGTTCTTTCA AAGAAGCACA GATGGCGCTG
CTTATAGGAG GCATTTTTGA CAGCGAAAAG AGTATTGTGG ATTACAACAG ACTTGGGATA
GGAAGGCTCA TATATCAGCT TCCTCCGACA TTGTGCAAGC TGTTTTTAAA AGAGGTGTTC
AAGGAAGGCT CCTTTGAAGC TTTGGATTCC GAGACGATGT ATACGATTAA CAAATTTTTT
GAAAACAATC TTAATGTAAG TGAAACTTCA AGGCAGCTTT ATGTTCATCG AAACACCCTT
GTGTACAGAC TGGATAAGAT TCAAAAAATT ACAGGCCTTG ACCTTAGATT GTTTGACGAT
GCAATAATAT TCAAAGTTGC CATGCTGGTA AAAAAATATC TTGACAGTAA TCAAGCCCTT
GTATAG
 
Protein sequence
MSVKIYQNLV NQIKDVIDSE FGIMDDTGLI LACSDEKKVG QSSSLVSEIM RSKDQFVVID 
GQTFQKVYIK NKLEFITFID SDSENSQKFL ALISINTINV KNYFDEKYDK ISFIKGIIMD
NILPGDITLR AKELHLQNNV NRVVFLVETE KAKDIYAHEI IEGLFPVKNK DFVVVLDDEK
VVLIKELKPD YDYKEINKIS KVIIDTLSTE GMIKARVGIG TVVDNIKDIG RSFKEAQMAL
LIGGIFDSEK SIVDYNRLGI GRLIYQLPPT LCKLFLKEVF KEGSFEALDS ETMYTINKFF
ENNLNVSETS RQLYVHRNTL VYRLDKIQKI TGLDLRLFDD AIIFKVAMLV KKYLDSNQAL
V