Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1519 |
Symbol | |
ID | 4810557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1843585 |
End bp | 1844814 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640106939 |
Product | SAM-dependent methyltransferase |
Protein accession | YP_001037940 |
Protein GI | 125974030 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.197853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGC AACGGCAGTA TCCCAAAATT ATGATATCCC GTAAGGCGGA GCGCAGCGTA AAAGACGGAC ATCCATGGAT ATACGGAGAA GAAATTCTCA AAACAGAGGG CGAACTCCAA AACGGCGGTC TGGTGGATGT GTTTGCCGGA AACGCCTTTA TGGGGACAGG TTTTTACAAT AGTGTCAGCA AGATTACCGT TCGACTTATT TCACGCAACG CCAATGATGT ATTTGACGCC CGCTTTTGGC GCCGCCGGGT GGAATATGCC GTTCGCTACC GCAAAACCGT CATACCCGGT GCAGACTTTG CCTGTTGCCG CCTGATTCAC GGTGAGGCTG ACCATATGCC GGGTCTGACA GTGGACCGCT ATGGAAGCCT TTTGTCCGTA CAAATTACCT GTCTTGGCAT GGAACTTGTC AAGGATACGG TTTACCGCGC TCTCTGGGAT GTGCTTACCG AAATGGATGA AACCATCACA GGTATTTATG AACGCAATGA CATTGCCCTG CGTACACGGG AAGGACTGCC GGAATATAAA GGCTGGTACC TGTTCGATGG CATACCCAGG CCGGAATCTG CTGTGACTGA AATATGCGAA AACGGCATAA AGTATCTGGT GGATGTAGAG AACGGGCAGA AAACCGGCTT TTTCCTGGAT CAGAAATATA ACCGTGCCGC TGTGGCCCGA ATTGCCAAGG GCAAACGTGT GCTGGACTGT TTTACCCATA CCGGTTCCTT CGGCCTTAAT GCAGCCCTGG GGGGAGCCGA GCATGTGACC TGTGTTGACA TCTCACAGTC AGCCATAGAC ATGGCAAAAG CAAATGCCGT ACGCAACGGT CTGGACGGAA AAATGGATTT TGTCTGTGAA GATGTTTTTG ATTTGCTCAC AAAACTGGCA GAACAAAAAT GCCATGACTA TGATTATATC ATTCTTGACC CGCCGGCTTT TACCAAATCC CGCAAGACGG TGCAGTCTGC CGCCCGCGGA TATAAAGAAA TTAACCTCAA AGCCATGAAA CTGCTGCCCC GCGGCGGATA TCTGGCTACT TGCAGCTGCA GTCATTTCAT GACGGATGAC TTGTTCCGCA AAACACTGGC AAGCGCCGCA AAGGATGCTT CCGTATCCTT AAGGCAGATT GAAGCCCGTC AGCAGGCCCC GGACCATCCC ATCCTGTGGA ATGTTCCGGA GACGGACTAT TTAAAGTTTT ATATTTTTCA GGTTGTATAA
|
Protein sequence | MKQQRQYPKI MISRKAERSV KDGHPWIYGE EILKTEGELQ NGGLVDVFAG NAFMGTGFYN SVSKITVRLI SRNANDVFDA RFWRRRVEYA VRYRKTVIPG ADFACCRLIH GEADHMPGLT VDRYGSLLSV QITCLGMELV KDTVYRALWD VLTEMDETIT GIYERNDIAL RTREGLPEYK GWYLFDGIPR PESAVTEICE NGIKYLVDVE NGQKTGFFLD QKYNRAAVAR IAKGKRVLDC FTHTGSFGLN AALGGAEHVT CVDISQSAID MAKANAVRNG LDGKMDFVCE DVFDLLTKLA EQKCHDYDYI ILDPPAFTKS RKTVQSAARG YKEINLKAMK LLPRGGYLAT CSCSHFMTDD LFRKTLASAA KDASVSLRQI EARQQAPDHP ILWNVPETDY LKFYIFQVV
|
| |