Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3084 |
Symbol | |
ID | 4809958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3638500 |
End bp | 3639636 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108508 |
Product | putative RNA methylase |
Protein accession | YP_001039473 |
Protein GI | 125975563 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0116] Predicted N6-adenine-specific DNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA TTGAACTTAT AGCAACAGCG GCAGCCGGTG TGGAATCTGT GGTAAAACAT GAAGTAAAAA AACTGGGTTT TAAAGATATA ACCGTAGATA ACGGCAAAAT AATGTTTAAA GGGGATATAT CAAGTATTCC CCGGGCAAAT TTATGGTTAA GATCCGCCGA CAGGGTATTG CTTAAGATGG GTGAATTTGA AGCTTTAACT TTTGAAGAGC TTTTCGATAA AACCTACGCC CTCCCCTGGG ATCAGTGGAT AACCAGCGAC GGTAAATTCA CCGTATTGGG TAAATCGGTA AAGTCAAAAC TTTTCAGCAT CTCCGATTGT CAGGCAATTG TGAAAAAAGC AGTGGTAGAA AAGCTAAAGT CCAAATATCA TGTGGAATGG TTTGAAGAAA GCGGCCCCGA GTATACCATA CAAGTGGCCC TTCACAAGGA CATTGCAACT TTGACCATTG ATACAAGCGG TACAGCACTG CACAAAAGAG GCTACAGGGC CAAAAATGTT GAGGCCCCCA TAAAGGAAAC TCTTGCATCC GCAATGATTT TGATTAGCTA TTGGAACAAA AGAAAGCCTC TGTGGGATTG CTTTTGCGGT TCAGGAACAA TTCCAATTGA AGCTGCTCTC ATCGGACGCA ACATTGCTCC GGGTCTTAAC AGAACTTTCG CCTCTGAAGA ATGGCCGGCT ATAGGCAAGG ATATATGGAA ACAGGAAAGA GCTTTGGCCT TAAGGGCAAT AGACCACGAT ATTAAACTTA AAATCTACGG TTCGGATATA AATCCCGACG CTATTGAACT GGCAAAAGAA AACGCCTGTC TCGCAGGAGT CGATGATTGT ATTGAATTCT TCGTCAGTGA TTTCAGAAAT GTTAACATAA AGGAGGATTA CGGCGTAATC ATATGCAATC CTCCCTATGG CGAAAGAATA GGTGAACAAC AAGAAGTTGA AACAATAAAC AAGGATATGG GGAAAACATT CTCCAAATAT GATACCTGGT CAAAATACAT AATTACCCCG TTTGAAAACT TTGAACGTCT TTACGGTAAA AAAGCCGATA AAAAGCGCAA ATTTTACAAT GGAAACATAA AAGTGGATTA TTACCAATAC TTCGGTCCAA AACCAACCAA CACCTAA
|
Protein sequence | MKEIELIATA AAGVESVVKH EVKKLGFKDI TVDNGKIMFK GDISSIPRAN LWLRSADRVL LKMGEFEALT FEELFDKTYA LPWDQWITSD GKFTVLGKSV KSKLFSISDC QAIVKKAVVE KLKSKYHVEW FEESGPEYTI QVALHKDIAT LTIDTSGTAL HKRGYRAKNV EAPIKETLAS AMILISYWNK RKPLWDCFCG SGTIPIEAAL IGRNIAPGLN RTFASEEWPA IGKDIWKQER ALALRAIDHD IKLKIYGSDI NPDAIELAKE NACLAGVDDC IEFFVSDFRN VNIKEDYGVI ICNPPYGERI GEQQEVETIN KDMGKTFSKY DTWSKYIITP FENFERLYGK KADKKRKFYN GNIKVDYYQY FGPKPTNT
|
| |