Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0519 |
Symbol | |
ID | 4808268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 635692 |
End bp | 637524 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105934 |
Product | DNA methylase N-4/N-6 |
Protein accession | YP_001036949 |
Protein GI | 125973039 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2189] Adenine specific DNA methylase Mod |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGAAA CAACTCTTAC GGGAAAAACC CCGGACATAG GGGAAGAAAA TATCAAGAAG TTAATGACCA TGTTTCCTGA AGTTGTTACA GAGGGAAAGG TAGATTTTGA AAAGCTGAAG CAACTTTTAG GTGAATATGT AGATGATAGT AACGAACGCT ATAATTTCAC CTGGAACGGT AAAGGGCGAG CCTTGCGTTT ATCCCAAACA CCTTCACTGG GAACGTTAAG ACCATGTAAA GAAGAAAGTA AAAACTGGGA TACAACCCAG AACCTTTACA TAGAAGGCGA TAACCTAGAG GTATTGAAGC TTTTGCAAAA GAGCTACTAT GGCAAGATTA AAATGATTTA CATAGACCCG CCTTACAACA CGGGGAAAGA CTTTGTATAT AGGGATGATT TCCATGATAG TCTTGAAAAC TATAAGAGAA TTACAGGGCA AGTAGATGGC AATGGCAAAG CAATAAGCAC AAATACTGAA ACCAGTGGTC GATACCACAC CGACTGGCTG AATATGATGT ACCCAAGGCT TAGGCTTGCA AGGAATTTAC TTTCAGATGA TGGTGTTATT TTTATTAGTA TTGATGATAA TGAGGTAGAT AACTTAAAGA AGATTTGTAA TGAGATTTTT GGAGAAGATA ATTTTATTGC TAACTGTGTA AGGAAACGCC GGGATAGTCA AGCTAATTTA TCTCAGAATA TCTCCCCAAT CCATGAGTAT GTGCTTATCT ACGCAAAGCG ATTTGGTAAT ATCCTTAACA AAGTTACACC TTCCCTGGAT ATGGGAAGTT ATAAGAATCC AGATAATGAT CCCCGTGGTC CATACACAAC AATGCCTTGT ACGAATGTTG GAGGAGCGGT TTATTCTATA GTTACACCAA CAGGCAAAAC AATAACCGAT GAATGGCGCT TTAAGAAAGA AACATTTGAG AAATTGCTGT TAGATAATAG AATTGTTTTT CCACGTAATG GAGAAGGAAA ACCACGCTAT AAGATATTTT TATCAGAAAA AATGGCTGAA GGAGTTTTAG CAAATACATG GTTAGACAAA ATCGCCTCAA ATCAAGAAGG AACACGTGAA ATAAAGGAAC TGTTTGGGGG ATTGTTATTT AACAATCCTA AGCCAACAGG TTTGTTGAAG TTTTTATTAG AGTTGGGATC AAGCAAAGAT TCTATAATCC TCGACTTCTT CTCTGGATCT GCCACTACAG CCCACGCCGT AATGCAGCTT AATGCTGAAG ATGGTGGCAA CCGTAGATTT ATTATGGTAC AGCTCCCAGA GCCAACGGAT GAAAATAGCG AAGCTTATAA GGCCGGATAT ATGAATATTT CTGAGATAGG CAAAGAGCGT ATCCGCCGTG CAGGAGAAAA AATCAAAGAG GAATATAAAG ATAAAGGGAA TATAGAAAAC CTTGATATCG GCTTTAAGGT GTTCAAGCTC GATACTTCAA ATATCAGAAA ATGGCAGCCG GATTATGATA ATTTAGAGCA ATCTTTAATG GATTATGTAG ATAACTTTGT GGAAGGCAGG ACGGAACTTG ATGTTGTTTA TGAGATAATG CTCAAATACG GTCTTGACCT GACTTATCCA GTTGATGAGT TTACAATTGC CGGTAAGAAA GTCTATTCTA TTGGCTATGG CATGCTGATG ATTTGCCTTG ATAATGAAAT TACAACAGAG GTTGCTAAGG GTATTTTGAC AAAAATAAAA GAATTATCAC CTGAAAGCAG CAGAGTTGTA TTTAAGGATA ATGGATTTAA GACAGACAGT AACAAGACCA ATATCAAGGA AATACTTAAA TCCGGCGGAA TTGAAGAATT TATAACTATA TAG
|
Protein sequence | MIETTLTGKT PDIGEENIKK LMTMFPEVVT EGKVDFEKLK QLLGEYVDDS NERYNFTWNG KGRALRLSQT PSLGTLRPCK EESKNWDTTQ NLYIEGDNLE VLKLLQKSYY GKIKMIYIDP PYNTGKDFVY RDDFHDSLEN YKRITGQVDG NGKAISTNTE TSGRYHTDWL NMMYPRLRLA RNLLSDDGVI FISIDDNEVD NLKKICNEIF GEDNFIANCV RKRRDSQANL SQNISPIHEY VLIYAKRFGN ILNKVTPSLD MGSYKNPDND PRGPYTTMPC TNVGGAVYSI VTPTGKTITD EWRFKKETFE KLLLDNRIVF PRNGEGKPRY KIFLSEKMAE GVLANTWLDK IASNQEGTRE IKELFGGLLF NNPKPTGLLK FLLELGSSKD SIILDFFSGS ATTAHAVMQL NAEDGGNRRF IMVQLPEPTD ENSEAYKAGY MNISEIGKER IRRAGEKIKE EYKDKGNIEN LDIGFKVFKL DTSNIRKWQP DYDNLEQSLM DYVDNFVEGR TELDVVYEIM LKYGLDLTYP VDEFTIAGKK VYSIGYGMLM ICLDNEITTE VAKGILTKIK ELSPESSRVV FKDNGFKTDS NKTNIKEILK SGGIEEFITI
|
| |