Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2083 |
Symbol | |
ID | 4810681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2475825 |
End bp | 2477024 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107490 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_001038483 |
Protein GI | 125974573 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.968018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAG TGATTCTTCA CTGCGACCTT AATAATTTTT ATGCCAGCGT GGAATGTCTG TATCATCCCG AACTTCGCGA CAAGCCGGTT GCGGTGTGTG GCTCGATAGA GGACAGACAT GGCATAGTGC TTGCCAAAAA CTATGCGGCA AAAAAATACA AAGTAAAAAC GGGCGAGACG GTATGGGAGG CAAAAAACAA GTGCCCGGGG CTGGTTGTGG TAAAAGCCAA TCATTCTTTG TATTACAAGT TTTCAAAATA TGCCCGCCAA ATTTACGAAT ATTACACCGA CAGAGTGGAA TCCTTTGGAT TGGACGAATG CTGGCTTGAT GTCAGTGAAA GTACATTGCT TTTTGGAGAC GGGACGAAGA TAGCCAACGA GATAAGAGAA AGAATAAAAA GGGAGCTGGG AGTGACGGTT TCCGTTGGCG TAAGCTATAA TAAAGTATTT GCAAAGCTTG GGTCTGACAT GAAAAAGCCG GACGCGGTTA CGGTTATTAC CGAGAATGAT TTTAAAGAAA AAATATGGGG ACTTCCGGTG GAAGCTCTTC TTTATGTGGG GGATTCAACA AAAAAGAAAC TTAACAATAT GGCTGTTTTT ACTATCGGAG ATTTGGCCAA TTGCCATTCG GAATTTCTCG TAAGGCAATT GGGAAAATGG GGATATACCC TGTGGAGCTT TGCAAACGGC TATGATACCA GCCCTGTTGC CAAAAATGAT TGTGAAATAC CGATAAAGAG CATAGGAAAT TCCCTTACCG CACCAAGGGA CCTTACGAAC AACGAAGATG TCCGGATTTT AATATATGTA CTTTCCGAAA GCGTGGGAGA AAGGCTTAGA AGTCACAATC TTAAAGGAAG GACCGTCCAG ATAAGTATAA AGGACCCGGA GCTTCAGACA TTGGAAAGGC AGGCCGGGCT TGACATACAT ACCAGTATTA CATCTGAAAT TGCGCAAAAA GCGTATGAAA TATTTTTAAA ATCCTGGAAT TGGTCAAAAA ACGTAAGGGC TCTGGGAGTC AGGGTGACGG ATTTGGTTGA GTCGGATACA TGCACGCAGA TATCATTGTT TTCGGACGAC ATAAAAAGGC AAAAGCTTGA GATACTTGAT GAGTGTGTGG ACAGGGTCAG GGAGAGATTT GGATATTATT CGGTGAGAAG AGGAATTTTG CTTCAGGACA GAGGATTAAA CAGGATTTAA
|
Protein sequence | MKRVILHCDL NNFYASVECL YHPELRDKPV AVCGSIEDRH GIVLAKNYAA KKYKVKTGET VWEAKNKCPG LVVVKANHSL YYKFSKYARQ IYEYYTDRVE SFGLDECWLD VSESTLLFGD GTKIANEIRE RIKRELGVTV SVGVSYNKVF AKLGSDMKKP DAVTVITEND FKEKIWGLPV EALLYVGDST KKKLNNMAVF TIGDLANCHS EFLVRQLGKW GYTLWSFANG YDTSPVAKND CEIPIKSIGN SLTAPRDLTN NEDVRILIYV LSESVGERLR SHNLKGRTVQ ISIKDPELQT LERQAGLDIH TSITSEIAQK AYEIFLKSWN WSKNVRALGV RVTDLVESDT CTQISLFSDD IKRQKLEILD ECVDRVRERF GYYSVRRGIL LQDRGLNRI
|
| |