Gene Cthe_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0498 
Symbol 
ID4808351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp609535 
End bp611163 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content42% 
IMG OID640105911 
Producthypothetical protein 
Protein accessionYP_001036928 
Protein GI125973018 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.578621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATG TTACGAAAGG CTCATTCAAT GACAGTCCGA ATAACGGTTT TTTTGAGATC 
CAGTATAAGG AAGACGGAGT ATATCTTACA GTGCATCCGC CAATAGGAAA AGGCAAGGCG
GTTGAAGTAA ACGATGTAAT AAGCAGGCTT ACGCAGAAGA AAATTGTCTA TGATAAAGAA
ATGGTTGAAT TGGCTGTTCA GAGAGCGTCA AACGTACCTG TGAAAATTGG CGAACCTCAG
GAGGAACTTA AACTTGATGC GACAATAGAT GTTAACATTT CCCCGGACAA AATGAAAGCA
ACAATGGTAA TAAGACCCCC TGACGGTGGA AGAATGCTTA CTAAAGACGA GATGATGGAG
ATTTTGAAAA ACAGCGGGGT AAGATACGGA ATAAACGAGT CAATGCTTGA GAATGTTTCA
AAATATCCTG TCTATAATGA GATTATAGTA ATTGCCGAAG GTACGCCTCC CATAAACGGA
CAGAATGGAA AAGTGGAATT CCATTTTGAT TTGAAAAAAG AAAGAAAACC TACTATCCTT
GAGGATGGAA GGGTGGATTT CAGAGAACTG AATCTTATTG AAAGTGTAAA AAAAGGACAG
GTTCTCTGTA CACTGGTTCC TCCGCTTCCG GGTACACCGG GCAGAACGGT GGAGGATATC
GAGGTTCCGG CTTTGGACGG AAAACCTGCC GTGCTTCCAA AAGGGAAAAA TGTTGAAATA
AGTGAAGACG GACAAAGTCT TATTGCCGGC ATAGACGGAC AGGTAAATTA TATAGACGGC
AAGGTAAGTG TTTTTGCCAA TTATGAAGTT CCTGCAGACG TTGACAACTC CACCGGAAAC
ATAAGTTTTG TAGGCAATGT TATCATAAGA GGAAATGTTT TGTCCGGTTT TACCGTTGAA
GCCGGAGGCA GTGTTGAGGT AATGGGAGTG GTGGAAGCTG CCGTTATAAA GGCCGATGGT
GACATTATTC TAAGAAGGGG AATGCAGGGG CTTGGAAGAG GAATATTAAA AAGCGGCGGT
GACATAATTG CAAAATATAT AGAAAACAGC ATTATTGAAG CCAAAGGTGA CATAAAAGCC
GAGGCAATAA TGCACAGCAA CGTAAAATGC GGAAACAAGC TGGAGCTTTC CGGCAAGAAA
GGTCTTTTGA TAGGCGGAAA ATGCAAAGTG GGAAAAGAAA TAGTAGCGAA GGTTATCGGT
TCGTATCTTG CCACTCACAC CGATATTGAG GTGGGTGTTG ATCCGCAGAT TAAAGAGCGC
TACAAGGAGC TTCGGGATGA GATTCGGAAA ATAGAAGAGG ATTTGGTTAA AGCGGAACAG
GCCATAACAA TATTAAAGAA GCTTGAGGCC GCAGGAAAGC TTACTCCGGA GAAGCAGGAA
CTGATGGCCA GAAGCATTAG AACAAAGATT TATTATTCGA ACAGGCTTGG TGAATTAAAA
GAAGAATTGA TAATAACAGA GCAAAGGCTT CAGAAGGAGG CTGACGGAAA AATCAGGGTA
TTTGATCATA TATATCCGGG AACAAAAGTT ACAATAGGGA CGAGCATGAT GTATGTCAAA
GAGGACCTGC AATATTGTAC ATTATACAGG GACGGGGCTG ATATAAGAGT TGGGCCTATT
GACAAATAA
 
Protein sequence
MRDVTKGSFN DSPNNGFFEI QYKEDGVYLT VHPPIGKGKA VEVNDVISRL TQKKIVYDKE 
MVELAVQRAS NVPVKIGEPQ EELKLDATID VNISPDKMKA TMVIRPPDGG RMLTKDEMME
ILKNSGVRYG INESMLENVS KYPVYNEIIV IAEGTPPING QNGKVEFHFD LKKERKPTIL
EDGRVDFREL NLIESVKKGQ VLCTLVPPLP GTPGRTVEDI EVPALDGKPA VLPKGKNVEI
SEDGQSLIAG IDGQVNYIDG KVSVFANYEV PADVDNSTGN ISFVGNVIIR GNVLSGFTVE
AGGSVEVMGV VEAAVIKADG DIILRRGMQG LGRGILKSGG DIIAKYIENS IIEAKGDIKA
EAIMHSNVKC GNKLELSGKK GLLIGGKCKV GKEIVAKVIG SYLATHTDIE VGVDPQIKER
YKELRDEIRK IEEDLVKAEQ AITILKKLEA AGKLTPEKQE LMARSIRTKI YYSNRLGELK
EELIITEQRL QKEADGKIRV FDHIYPGTKV TIGTSMMYVK EDLQYCTLYR DGADIRVGPI
DK