Gene Cthe_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1729 
Symbol 
ID4810159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2050674 
End bp2051909 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content36% 
IMG OID640107142 
ProductDNA methylase N-4/N-6 
Protein accessionYP_001038143 
Protein GI125974233 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase
[COG1475] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTA GGAAGTTAAA AATAGACAGC CTTATACCTG CTAAATATAA TCCGAGAAAA 
GATTTAAAGC CGGGTGATAA GGAATATGAA AAGATAAAAA ACAGTTTAAC TGAATTTGGA
TATGTAGATC CCATTATTGT AAATTCAGAC CTTACAATTA TTGGCGGTCA TCAAAGATGG
AAGGTTTTAA AAAGCTTAGG CTATACAGAA GTTGATTGTG TTGTTATTGA TATAGATAAA
ACAAAAGAAA AGGCTTTGAA TGTGGCACTT AATAAAATAA GCGGAGAGTG GAATGAAGCA
CTTCTTGCTG AGCTTATTAA GGATTTGCAG AGTATAGATT ATGATGTTTC CTTTACAGGT
TTTGAACCGC CGGAGATAGA AGAACTGTTT AGCAATGTTC ATGACAAGGA AATAAAAGAA
GATGATTTTG ATGTTGAAGA TGCTTTAAAA GAACCTGTAA TTTCAAAGCA GGGAGATTTG
TGGCTGCTTG GAAGGCACAG GCTTATTTGC GGGGATAGTA CTAAAGCTGA AACATATGAG
GCTTTAATGG ATGGTAAAAA AGCTAATTTA GTGGTTACAG ACCCTCCCTA CAATGTTGCA
TATGAAGCAA AAGCCGGAAA GATTCAAAAT GATAACCTTA AAGATGAGGA GTTTTATAAT
TTCCTTTATA AGGCGTTCAC TAATATGTAT GATGCTATGG AGAAAGATGC TTCAATTTAT
GTATTCCATG CAGATACAGA AGGATTAAAC TTTAGAAAGG CTTTTAAAGC TGTTGGATTT
TATTTATCCG GAGTTTGTAT CTGGGCAAAG CAAAGCTTGG TACTGGGCAG AAGTCCTTAT
CAGTGGAAAC ATGAACCTGT ACTCTTTGGT TGGAAGAAGG AAGGCAGGCA TAATTGGTAC
TCTGATAGAA AACAAAGTAC TATATGGAGC TTTGACAGAC CATCTAAGAA TGCTCTCCAT
CCAACAATGA AGCCAGTAGC TCTTTGTGCT TATCCAATTC AAAACAGCAG CATGAGCAAT
TGTATTGTTC TTGACCCTTT TGGCGGCAGT GGTTCTACTT TGATTGCCTG TGAGCAGACT
AATAGAATCT GCTATACCAT AGAGCTTGAT GAAAAGTATG CGGATGTTAT TGTAAAAAGA
TATATAGAGC AGGTTGGTAC AGATGAAGAA GTATTTTTAG TTAGAGATGG AGTTAAAATT
AAATATGCTG ATATAAAAAA GGAAGGTTGT GATTAA
 
Protein sequence
MQFRKLKIDS LIPAKYNPRK DLKPGDKEYE KIKNSLTEFG YVDPIIVNSD LTIIGGHQRW 
KVLKSLGYTE VDCVVIDIDK TKEKALNVAL NKISGEWNEA LLAELIKDLQ SIDYDVSFTG
FEPPEIEELF SNVHDKEIKE DDFDVEDALK EPVISKQGDL WLLGRHRLIC GDSTKAETYE
ALMDGKKANL VVTDPPYNVA YEAKAGKIQN DNLKDEEFYN FLYKAFTNMY DAMEKDASIY
VFHADTEGLN FRKAFKAVGF YLSGVCIWAK QSLVLGRSPY QWKHEPVLFG WKKEGRHNWY
SDRKQSTIWS FDRPSKNALH PTMKPVALCA YPIQNSSMSN CIVLDPFGGS GSTLIACEQT
NRICYTIELD EKYADVIVKR YIEQVGTDEE VFLVRDGVKI KYADIKKEGC D