Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0498 |
Symbol | |
ID | 4808351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 609535 |
End bp | 611163 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105911 |
Product | hypothetical protein |
Protein accession | YP_001036928 |
Protein GI | 125973018 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.578621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGATG TTACGAAAGG CTCATTCAAT GACAGTCCGA ATAACGGTTT TTTTGAGATC CAGTATAAGG AAGACGGAGT ATATCTTACA GTGCATCCGC CAATAGGAAA AGGCAAGGCG GTTGAAGTAA ACGATGTAAT AAGCAGGCTT ACGCAGAAGA AAATTGTCTA TGATAAAGAA ATGGTTGAAT TGGCTGTTCA GAGAGCGTCA AACGTACCTG TGAAAATTGG CGAACCTCAG GAGGAACTTA AACTTGATGC GACAATAGAT GTTAACATTT CCCCGGACAA AATGAAAGCA ACAATGGTAA TAAGACCCCC TGACGGTGGA AGAATGCTTA CTAAAGACGA GATGATGGAG ATTTTGAAAA ACAGCGGGGT AAGATACGGA ATAAACGAGT CAATGCTTGA GAATGTTTCA AAATATCCTG TCTATAATGA GATTATAGTA ATTGCCGAAG GTACGCCTCC CATAAACGGA CAGAATGGAA AAGTGGAATT CCATTTTGAT TTGAAAAAAG AAAGAAAACC TACTATCCTT GAGGATGGAA GGGTGGATTT CAGAGAACTG AATCTTATTG AAAGTGTAAA AAAAGGACAG GTTCTCTGTA CACTGGTTCC TCCGCTTCCG GGTACACCGG GCAGAACGGT GGAGGATATC GAGGTTCCGG CTTTGGACGG AAAACCTGCC GTGCTTCCAA AAGGGAAAAA TGTTGAAATA AGTGAAGACG GACAAAGTCT TATTGCCGGC ATAGACGGAC AGGTAAATTA TATAGACGGC AAGGTAAGTG TTTTTGCCAA TTATGAAGTT CCTGCAGACG TTGACAACTC CACCGGAAAC ATAAGTTTTG TAGGCAATGT TATCATAAGA GGAAATGTTT TGTCCGGTTT TACCGTTGAA GCCGGAGGCA GTGTTGAGGT AATGGGAGTG GTGGAAGCTG CCGTTATAAA GGCCGATGGT GACATTATTC TAAGAAGGGG AATGCAGGGG CTTGGAAGAG GAATATTAAA AAGCGGCGGT GACATAATTG CAAAATATAT AGAAAACAGC ATTATTGAAG CCAAAGGTGA CATAAAAGCC GAGGCAATAA TGCACAGCAA CGTAAAATGC GGAAACAAGC TGGAGCTTTC CGGCAAGAAA GGTCTTTTGA TAGGCGGAAA ATGCAAAGTG GGAAAAGAAA TAGTAGCGAA GGTTATCGGT TCGTATCTTG CCACTCACAC CGATATTGAG GTGGGTGTTG ATCCGCAGAT TAAAGAGCGC TACAAGGAGC TTCGGGATGA GATTCGGAAA ATAGAAGAGG ATTTGGTTAA AGCGGAACAG GCCATAACAA TATTAAAGAA GCTTGAGGCC GCAGGAAAGC TTACTCCGGA GAAGCAGGAA CTGATGGCCA GAAGCATTAG AACAAAGATT TATTATTCGA ACAGGCTTGG TGAATTAAAA GAAGAATTGA TAATAACAGA GCAAAGGCTT CAGAAGGAGG CTGACGGAAA AATCAGGGTA TTTGATCATA TATATCCGGG AACAAAAGTT ACAATAGGGA CGAGCATGAT GTATGTCAAA GAGGACCTGC AATATTGTAC ATTATACAGG GACGGGGCTG ATATAAGAGT TGGGCCTATT GACAAATAA
|
Protein sequence | MRDVTKGSFN DSPNNGFFEI QYKEDGVYLT VHPPIGKGKA VEVNDVISRL TQKKIVYDKE MVELAVQRAS NVPVKIGEPQ EELKLDATID VNISPDKMKA TMVIRPPDGG RMLTKDEMME ILKNSGVRYG INESMLENVS KYPVYNEIIV IAEGTPPING QNGKVEFHFD LKKERKPTIL EDGRVDFREL NLIESVKKGQ VLCTLVPPLP GTPGRTVEDI EVPALDGKPA VLPKGKNVEI SEDGQSLIAG IDGQVNYIDG KVSVFANYEV PADVDNSTGN ISFVGNVIIR GNVLSGFTVE AGGSVEVMGV VEAAVIKADG DIILRRGMQG LGRGILKSGG DIIAKYIENS IIEAKGDIKA EAIMHSNVKC GNKLELSGKK GLLIGGKCKV GKEIVAKVIG SYLATHTDIE VGVDPQIKER YKELRDEIRK IEEDLVKAEQ AITILKKLEA AGKLTPEKQE LMARSIRTKI YYSNRLGELK EELIITEQRL QKEADGKIRV FDHIYPGTKV TIGTSMMYVK EDLQYCTLYR DGADIRVGPI DK
|
| |