Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3162 |
Symbol | |
ID | 4809612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3735872 |
End bp | 3737041 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108595 |
Product | SAM dependent methyltransferase |
Protein accession | YP_001039550 |
Protein GI | 125975640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAC AAAGTTTGAA CAAATTGTCC TTGTTTTTAA CCGGTATGGA GCAAAGACTA ATTGACAACT GGGAGTTTTT CAAAGGCATA ACCGCGGTTT TCAAGTCCGG AACCAGGGAA TTTCCTGCAA AAGTGTGGCA GGACGGAAAT AAACTTAAAA TGAATTTCAG CGGCAGTACC GAAACGTTGG AATCAAACTG GCTTTGTGCA AGGATTGCAA AAATTGCGCA AAACTACGAC AGTGTTGTAA TCAACTATGA AGAAAGAGGC ACAACAATTA TTATCGAGGC CGACGACAAA AACGTGAGGA TGAAGACCCA GGAAGCAAAG GAACAGAAAG AGGCGATAAT AGCCCATAGT GAAACTTCAC ATATTTCAAA CAGGGATTAT TACATCAAAG TTGGACAGGC CGATGAACTT CTTAGGGAAA TCGGAATATT GGGCAGCAAC GGCAAAATAA AAAATGACAT GATAAGAAAG TACAATCAGA TAGATCACTT TGTGGAACTT ATCGATGATA TGTTAAAAGA AGCTTTCAGA GAGAATGAGT CGCTGACGAT TTTGGACTGC GGATGCGGCA AATCCTATCT TACTTTCGTA TTGAACTATT ATATAAGGGA AGTGTTGAAA AAGCCCTGCC GTTTTATCGG ACTTGACTAC TCAAGCACGG TAATTGAAGC GTCAAAAAAG ATTGCCCAAA ACCTTGGCTA TCGAAATATG GAGTTTAAAG TGACGGATAT AAGAAATTTT CACACTTCGG AAAAGATACA CATGGTTATA AGTCTTCATG CCTGCAACAC GGCAACGGAT GAAGCCATAG CTTTGGCTGT AAACAACAAT GTAAAAGCCA TGGTCATGGT GCCGTGCTGT CAGCAGGAGA TTTTAAAGCA ATATTCATAT CCACCCTTTG AACCTATAAT AAAACACGGA ATTTTAAAAG CAAGAATGGC GGATGTGATT ACCGACGGTA TAAGGGCGCT GATTTTAGAG GCTTTGGGTT ACAAAGTTTC CATTGTGGAA TACATATCAC CGACGGAGAC ACCGAAAAAC CTTATGCTGA GGGCAGTTAA AACTCAAGGT CCTGACGAGA AGGCACTTGC GGAATATAAA AAATTGAAAG AAATGCTTGG GATTAACCCA ACATTGGAAA AATTGATTTA CTTAAAATAA
|
Protein sequence | MNKQSLNKLS LFLTGMEQRL IDNWEFFKGI TAVFKSGTRE FPAKVWQDGN KLKMNFSGST ETLESNWLCA RIAKIAQNYD SVVINYEERG TTIIIEADDK NVRMKTQEAK EQKEAIIAHS ETSHISNRDY YIKVGQADEL LREIGILGSN GKIKNDMIRK YNQIDHFVEL IDDMLKEAFR ENESLTILDC GCGKSYLTFV LNYYIREVLK KPCRFIGLDY SSTVIEASKK IAQNLGYRNM EFKVTDIRNF HTSEKIHMVI SLHACNTATD EAIALAVNNN VKAMVMVPCC QQEILKQYSY PPFEPIIKHG ILKARMADVI TDGIRALILE ALGYKVSIVE YISPTETPKN LMLRAVKTQG PDEKALAEYK KLKEMLGINP TLEKLIYLK
|
| |