Gene Cthe_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2083 
Symbol 
ID4810681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2475825 
End bp2477024 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content43% 
IMG OID640107490 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001038483 
Protein GI125974573 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.968018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAG TGATTCTTCA CTGCGACCTT AATAATTTTT ATGCCAGCGT GGAATGTCTG 
TATCATCCCG AACTTCGCGA CAAGCCGGTT GCGGTGTGTG GCTCGATAGA GGACAGACAT
GGCATAGTGC TTGCCAAAAA CTATGCGGCA AAAAAATACA AAGTAAAAAC GGGCGAGACG
GTATGGGAGG CAAAAAACAA GTGCCCGGGG CTGGTTGTGG TAAAAGCCAA TCATTCTTTG
TATTACAAGT TTTCAAAATA TGCCCGCCAA ATTTACGAAT ATTACACCGA CAGAGTGGAA
TCCTTTGGAT TGGACGAATG CTGGCTTGAT GTCAGTGAAA GTACATTGCT TTTTGGAGAC
GGGACGAAGA TAGCCAACGA GATAAGAGAA AGAATAAAAA GGGAGCTGGG AGTGACGGTT
TCCGTTGGCG TAAGCTATAA TAAAGTATTT GCAAAGCTTG GGTCTGACAT GAAAAAGCCG
GACGCGGTTA CGGTTATTAC CGAGAATGAT TTTAAAGAAA AAATATGGGG ACTTCCGGTG
GAAGCTCTTC TTTATGTGGG GGATTCAACA AAAAAGAAAC TTAACAATAT GGCTGTTTTT
ACTATCGGAG ATTTGGCCAA TTGCCATTCG GAATTTCTCG TAAGGCAATT GGGAAAATGG
GGATATACCC TGTGGAGCTT TGCAAACGGC TATGATACCA GCCCTGTTGC CAAAAATGAT
TGTGAAATAC CGATAAAGAG CATAGGAAAT TCCCTTACCG CACCAAGGGA CCTTACGAAC
AACGAAGATG TCCGGATTTT AATATATGTA CTTTCCGAAA GCGTGGGAGA AAGGCTTAGA
AGTCACAATC TTAAAGGAAG GACCGTCCAG ATAAGTATAA AGGACCCGGA GCTTCAGACA
TTGGAAAGGC AGGCCGGGCT TGACATACAT ACCAGTATTA CATCTGAAAT TGCGCAAAAA
GCGTATGAAA TATTTTTAAA ATCCTGGAAT TGGTCAAAAA ACGTAAGGGC TCTGGGAGTC
AGGGTGACGG ATTTGGTTGA GTCGGATACA TGCACGCAGA TATCATTGTT TTCGGACGAC
ATAAAAAGGC AAAAGCTTGA GATACTTGAT GAGTGTGTGG ACAGGGTCAG GGAGAGATTT
GGATATTATT CGGTGAGAAG AGGAATTTTG CTTCAGGACA GAGGATTAAA CAGGATTTAA
 
Protein sequence
MKRVILHCDL NNFYASVECL YHPELRDKPV AVCGSIEDRH GIVLAKNYAA KKYKVKTGET 
VWEAKNKCPG LVVVKANHSL YYKFSKYARQ IYEYYTDRVE SFGLDECWLD VSESTLLFGD
GTKIANEIRE RIKRELGVTV SVGVSYNKVF AKLGSDMKKP DAVTVITEND FKEKIWGLPV
EALLYVGDST KKKLNNMAVF TIGDLANCHS EFLVRQLGKW GYTLWSFANG YDTSPVAKND
CEIPIKSIGN SLTAPRDLTN NEDVRILIYV LSESVGERLR SHNLKGRTVQ ISIKDPELQT
LERQAGLDIH TSITSEIAQK AYEIFLKSWN WSKNVRALGV RVTDLVESDT CTQISLFSDD
IKRQKLEILD ECVDRVRERF GYYSVRRGIL LQDRGLNRI