Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1639 |
Symbol | |
ID | 4809334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1965665 |
End bp | 1966963 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107054 |
Product | DNA methylase N-4/N-6 |
Protein accession | YP_001038055 |
Protein GI | 125974145 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATAC AAAAAATATC TGTTGAAAAA CTTAATCCAG CAGCATACAA CCCGCGCAAG GATTTAAAAC CTGGCGATAA GGAATATGAA AAGCTAAAAC GGTCAATAGA GGAATTTGGC TATGTGGAGC CTGTTATCTG GAACCAAAAA ACAGGTAATG TGGTAGGCGG GCATCAACGC TTAAAGGTTT TGCTGGACTT GGGACAGACT GAGATAGACT GCGTTGTAGT GGATCTTGAC CCGCAGAGAG AAAAAGCGCT TAATCTTGCT CTCAATAAGA TTCAAGGAGA GTGGGACGAG AATAAACTGG CCGAACTGAT GGCTGAGTTG GACGCAGGTG CATTTGATGT TTCGCTTACA GGGTTTGACG CCTCTGAAAT AGACGAACTA CTTAACCGAT GGTACTCCAA AGAGGCGGTA CAAGACAGCT TTGACATAGA TAAAGCGCAT GAGGAAATCG TGCAGCGCGA GCCGGTAACG AAGCGGGGCG ATATCTGGCT TCTCGGGAAT CATCGCTTGA TGTGCGGCGA CTCTACGAAG AATGAGGATT TTGAGAAGTT GATGGAAGGG TGTCACGCAC AGATGGCAGT GACTTCCCCG CCTTATGGGG TAGGCAAAGA ATATGAAAAA GCCGGGATTG AGCCATGGTT CGAGACAGTA CGCCCAGTGA TTAGAAACTT ATGCAGGTAT GCAGATATTG TCTGCTGGAA CTTAGGTGAT TTATATGCAA CCGGCTCCCA GTTTATTGAA CCAACCAGCG TGTACAGCGT GAATATGTTT TTGGACAATG GCTATCGACC TATCTGGATC CGTATTTGGA AGAAGCAGGG GCAGAACTTT GGTGTAGGAC CCTATCATCT TGTTTCAAAC AAGCCGGTTC AGCAGTATGA GTATATTTCG GCCTTCAGCA ATAAAGGAGA AGTTGAGGAA TATAACGATC AGGAATATGT ATGGCTTTCA GCCTTTGCGG GACACAGTTA TAAATTTGTG AAACGGCTTA CAAAGGAAGA ACGCAAGAAA TGGGGTTATG CTGGGATATG GGAGATGACC ACTGTACGGG CAAACAAGGA GCATCCTGCA ATGTTCCCTG TGGAGCTTCC ATGGCGGTGC ATCAAAATGC ACAGCGACAA AGGCGGTATT GTGCTTGAGC CGTTCTCTGG TAGCGGAACT ACTATAATTG CGGCTGAACA GACCGAGCGT AAATGCTACG CAATGGAGTT ATCCCCTGTT TACTGTGATT TAGCTGTTAA GCGCTGGGAG GAATTCACCG GCGAAAAAGC CATCAAACTG GAGGGTTAA
|
Protein sequence | MNIQKISVEK LNPAAYNPRK DLKPGDKEYE KLKRSIEEFG YVEPVIWNQK TGNVVGGHQR LKVLLDLGQT EIDCVVVDLD PQREKALNLA LNKIQGEWDE NKLAELMAEL DAGAFDVSLT GFDASEIDEL LNRWYSKEAV QDSFDIDKAH EEIVQREPVT KRGDIWLLGN HRLMCGDSTK NEDFEKLMEG CHAQMAVTSP PYGVGKEYEK AGIEPWFETV RPVIRNLCRY ADIVCWNLGD LYATGSQFIE PTSVYSVNMF LDNGYRPIWI RIWKKQGQNF GVGPYHLVSN KPVQQYEYIS AFSNKGEVEE YNDQEYVWLS AFAGHSYKFV KRLTKEERKK WGYAGIWEMT TVRANKEHPA MFPVELPWRC IKMHSDKGGI VLEPFSGSGT TIIAAEQTER KCYAMELSPV YCDLAVKRWE EFTGEKAIKL EG
|
| |