Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1728 |
Symbol | |
ID | 4810158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2049221 |
End bp | 2050672 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107141 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001038142 |
Protein GI | 125974232 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.770689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTCC TTGATTTCTT TGCAGGAATA GGCGGCTTTC GATTGGGGTT AGAACTTGCA GGACATAAAT GTATAGGATT TTGTGAAAAA GATAAATTTG CAGTTAAAAG CTATAGAGCA ATGTTTGATA CGGAAGGAGA GTGGTATGCA GATGACGTTA CAAAACTTAA AAGCGAAGAT ATCCCATATG CAGACATCTG GTGTTTTGGA TTCCCATGTC AAGACATCTC AGTTGCAGGA AAACAGCGAG GAATCAGAGG AAAAAGAAGC GGCTTATATT TTAGAATTAT TGACCTTATC AAAGGCAAAG AAGAAAAAGA TAAACCCTCA TACCTTCTTA TTGAGAACGT TAAAAACCTG CTGTCAATTA ATAACGGATG GGACTTTGCA GCCGTTCTCT CTGAGCTGGA TGAAGCAGGG TATGATGCTT TCTGGCAGGT GCTTAACTCT AAAGATTTCG GAGTACCCCA AAACAGAGAG CGTGTGTTCA TTACTGCAAA TCTTAGAAGC AGAGGCAGAC GAGAAATACT ACCTGTCAGA GGAGAAAACA CAGCAGCTCT TAAGCAGATT ATAGGCGGCT GTCAAGGTGA GCGAGTTTAT GATGCAGAAG GAGTAGCTTG CACACTTACA GGTTGTGGTG GAGGTGGAGG AGCAAAGACT GGTCTTTACT GCGTAGGAAA TATAAATCCA AGCGGCAAAG GTATGAATGG AAATGTTTAT TCATCAGAAG GAATTGCTCC AGCAGTTACA ACTAATAAGG GCGAAGGAAG CAAAATTTTT ATAGACCAGT CTTACACAAA GCCAAAGCTC ACAGATACAT CAAGATGTAT TACATCACGT TATACAGCAG GAGTGGTTAA TAGAACTGCA ATGAATAGCG GAGTTTTAGA GAGTGTTCCA ATTGAGTTTA ATAGAAATGA TGGTATTCTT GATGAAATAA AAATAGCACA TACAATCAGT GCCAGTGATT GGCGAGGACT AAACAGAAAT CAAACACAAA ATGCTGTTCT TGAAGCAAGG GCTGTTATAA CTCCTGAAAG AGAAAATAAA CGTCAGAATG GAAGGAGATT TAAAAATGCA GATGAACCAA TGTTTACTTT AACAAGTCAG GACAGACATG GAGTTGCAGT TAAAGAAGCC ACAAAAAAAG GCTATGCTGA AGCTGAGATT GGAGACAGCA TAAATATTTC AGTACCTAAT TCAAAAACAA GAAGAGGCAG GGTTGGAAAA GGTATATCAA ATACTTTAGA TACAGGATGC CAAATGGCTA CATTGGACAA AAATTACCGT ATTAGAAGGC TTACACCAAA GGAATGCTTC AGACTTCAAG GCTTCCCAGA TGAATTATTT GAAAAAGCAA GGGCTGTAAA TTCAGATGCT CAGCTATATA AGCAGGCAGG TAATGCAGTA ACTGTAAATG TAGCTTTTGC AGTAGCACAA TCTATTAAAT AA
|
Protein sequence | MTFLDFFAGI GGFRLGLELA GHKCIGFCEK DKFAVKSYRA MFDTEGEWYA DDVTKLKSED IPYADIWCFG FPCQDISVAG KQRGIRGKRS GLYFRIIDLI KGKEEKDKPS YLLIENVKNL LSINNGWDFA AVLSELDEAG YDAFWQVLNS KDFGVPQNRE RVFITANLRS RGRREILPVR GENTAALKQI IGGCQGERVY DAEGVACTLT GCGGGGGAKT GLYCVGNINP SGKGMNGNVY SSEGIAPAVT TNKGEGSKIF IDQSYTKPKL TDTSRCITSR YTAGVVNRTA MNSGVLESVP IEFNRNDGIL DEIKIAHTIS ASDWRGLNRN QTQNAVLEAR AVITPERENK RQNGRRFKNA DEPMFTLTSQ DRHGVAVKEA TKKGYAEAEI GDSINISVPN SKTRRGRVGK GISNTLDTGC QMATLDKNYR IRRLTPKECF RLQGFPDELF EKARAVNSDA QLYKQAGNAV TVNVAFAVAQ SIK
|
| |