Gene Cthe_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1519 
Symbol 
ID4810557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1843585 
End bp1844814 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content50% 
IMG OID640106939 
ProductSAM-dependent methyltransferase 
Protein accessionYP_001037940 
Protein GI125974030 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.197853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGC AACGGCAGTA TCCCAAAATT ATGATATCCC GTAAGGCGGA GCGCAGCGTA 
AAAGACGGAC ATCCATGGAT ATACGGAGAA GAAATTCTCA AAACAGAGGG CGAACTCCAA
AACGGCGGTC TGGTGGATGT GTTTGCCGGA AACGCCTTTA TGGGGACAGG TTTTTACAAT
AGTGTCAGCA AGATTACCGT TCGACTTATT TCACGCAACG CCAATGATGT ATTTGACGCC
CGCTTTTGGC GCCGCCGGGT GGAATATGCC GTTCGCTACC GCAAAACCGT CATACCCGGT
GCAGACTTTG CCTGTTGCCG CCTGATTCAC GGTGAGGCTG ACCATATGCC GGGTCTGACA
GTGGACCGCT ATGGAAGCCT TTTGTCCGTA CAAATTACCT GTCTTGGCAT GGAACTTGTC
AAGGATACGG TTTACCGCGC TCTCTGGGAT GTGCTTACCG AAATGGATGA AACCATCACA
GGTATTTATG AACGCAATGA CATTGCCCTG CGTACACGGG AAGGACTGCC GGAATATAAA
GGCTGGTACC TGTTCGATGG CATACCCAGG CCGGAATCTG CTGTGACTGA AATATGCGAA
AACGGCATAA AGTATCTGGT GGATGTAGAG AACGGGCAGA AAACCGGCTT TTTCCTGGAT
CAGAAATATA ACCGTGCCGC TGTGGCCCGA ATTGCCAAGG GCAAACGTGT GCTGGACTGT
TTTACCCATA CCGGTTCCTT CGGCCTTAAT GCAGCCCTGG GGGGAGCCGA GCATGTGACC
TGTGTTGACA TCTCACAGTC AGCCATAGAC ATGGCAAAAG CAAATGCCGT ACGCAACGGT
CTGGACGGAA AAATGGATTT TGTCTGTGAA GATGTTTTTG ATTTGCTCAC AAAACTGGCA
GAACAAAAAT GCCATGACTA TGATTATATC ATTCTTGACC CGCCGGCTTT TACCAAATCC
CGCAAGACGG TGCAGTCTGC CGCCCGCGGA TATAAAGAAA TTAACCTCAA AGCCATGAAA
CTGCTGCCCC GCGGCGGATA TCTGGCTACT TGCAGCTGCA GTCATTTCAT GACGGATGAC
TTGTTCCGCA AAACACTGGC AAGCGCCGCA AAGGATGCTT CCGTATCCTT AAGGCAGATT
GAAGCCCGTC AGCAGGCCCC GGACCATCCC ATCCTGTGGA ATGTTCCGGA GACGGACTAT
TTAAAGTTTT ATATTTTTCA GGTTGTATAA
 
Protein sequence
MKQQRQYPKI MISRKAERSV KDGHPWIYGE EILKTEGELQ NGGLVDVFAG NAFMGTGFYN 
SVSKITVRLI SRNANDVFDA RFWRRRVEYA VRYRKTVIPG ADFACCRLIH GEADHMPGLT
VDRYGSLLSV QITCLGMELV KDTVYRALWD VLTEMDETIT GIYERNDIAL RTREGLPEYK
GWYLFDGIPR PESAVTEICE NGIKYLVDVE NGQKTGFFLD QKYNRAAVAR IAKGKRVLDC
FTHTGSFGLN AALGGAEHVT CVDISQSAID MAKANAVRNG LDGKMDFVCE DVFDLLTKLA
EQKCHDYDYI ILDPPAFTKS RKTVQSAARG YKEINLKAMK LLPRGGYLAT CSCSHFMTDD
LFRKTLASAA KDASVSLRQI EARQQAPDHP ILWNVPETDY LKFYIFQVV