Gene Cthe_3162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3162 
Symbol 
ID4809612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3735872 
End bp3737041 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content40% 
IMG OID640108595 
ProductSAM dependent methyltransferase 
Protein accessionYP_001039550 
Protein GI125975640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAC AAAGTTTGAA CAAATTGTCC TTGTTTTTAA CCGGTATGGA GCAAAGACTA 
ATTGACAACT GGGAGTTTTT CAAAGGCATA ACCGCGGTTT TCAAGTCCGG AACCAGGGAA
TTTCCTGCAA AAGTGTGGCA GGACGGAAAT AAACTTAAAA TGAATTTCAG CGGCAGTACC
GAAACGTTGG AATCAAACTG GCTTTGTGCA AGGATTGCAA AAATTGCGCA AAACTACGAC
AGTGTTGTAA TCAACTATGA AGAAAGAGGC ACAACAATTA TTATCGAGGC CGACGACAAA
AACGTGAGGA TGAAGACCCA GGAAGCAAAG GAACAGAAAG AGGCGATAAT AGCCCATAGT
GAAACTTCAC ATATTTCAAA CAGGGATTAT TACATCAAAG TTGGACAGGC CGATGAACTT
CTTAGGGAAA TCGGAATATT GGGCAGCAAC GGCAAAATAA AAAATGACAT GATAAGAAAG
TACAATCAGA TAGATCACTT TGTGGAACTT ATCGATGATA TGTTAAAAGA AGCTTTCAGA
GAGAATGAGT CGCTGACGAT TTTGGACTGC GGATGCGGCA AATCCTATCT TACTTTCGTA
TTGAACTATT ATATAAGGGA AGTGTTGAAA AAGCCCTGCC GTTTTATCGG ACTTGACTAC
TCAAGCACGG TAATTGAAGC GTCAAAAAAG ATTGCCCAAA ACCTTGGCTA TCGAAATATG
GAGTTTAAAG TGACGGATAT AAGAAATTTT CACACTTCGG AAAAGATACA CATGGTTATA
AGTCTTCATG CCTGCAACAC GGCAACGGAT GAAGCCATAG CTTTGGCTGT AAACAACAAT
GTAAAAGCCA TGGTCATGGT GCCGTGCTGT CAGCAGGAGA TTTTAAAGCA ATATTCATAT
CCACCCTTTG AACCTATAAT AAAACACGGA ATTTTAAAAG CAAGAATGGC GGATGTGATT
ACCGACGGTA TAAGGGCGCT GATTTTAGAG GCTTTGGGTT ACAAAGTTTC CATTGTGGAA
TACATATCAC CGACGGAGAC ACCGAAAAAC CTTATGCTGA GGGCAGTTAA AACTCAAGGT
CCTGACGAGA AGGCACTTGC GGAATATAAA AAATTGAAAG AAATGCTTGG GATTAACCCA
ACATTGGAAA AATTGATTTA CTTAAAATAA
 
Protein sequence
MNKQSLNKLS LFLTGMEQRL IDNWEFFKGI TAVFKSGTRE FPAKVWQDGN KLKMNFSGST 
ETLESNWLCA RIAKIAQNYD SVVINYEERG TTIIIEADDK NVRMKTQEAK EQKEAIIAHS
ETSHISNRDY YIKVGQADEL LREIGILGSN GKIKNDMIRK YNQIDHFVEL IDDMLKEAFR
ENESLTILDC GCGKSYLTFV LNYYIREVLK KPCRFIGLDY SSTVIEASKK IAQNLGYRNM
EFKVTDIRNF HTSEKIHMVI SLHACNTATD EAIALAVNNN VKAMVMVPCC QQEILKQYSY
PPFEPIIKHG ILKARMADVI TDGIRALILE ALGYKVSIVE YISPTETPKN LMLRAVKTQG
PDEKALAEYK KLKEMLGINP TLEKLIYLK