Gene Cthe_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1040 
Symbol 
ID4811334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1243141 
End bp1244169 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content36% 
IMG OID640106458 
ProductDNA polymerase III, delta subunit 
Protein accessionYP_001037465 
Protein GI125973555 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000902231 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGGG AGAGTAAAAT GAGTATAGAT AAATTGAAAC TGGAAGTAAA AAACAAAAGT 
CTGGGAAAAC TCTATCTTTT TTACGGCGAA GAAGAATATC TGAAAAAATT TTACCTGGGC
AAAATAGAAG AAATTATTTT AAGCAGGGAT CAAACCGGGC TGAACAAGAT AGTGATAGAG
GGAAAGGCGG AAGCTTCAAA GATTATTGAA GCATGTGAGA CAATGCCGTT TTTCGCGGAA
AAAAAACTGG TTGTTGTTAA AAAGTCGGAA TTGTTTAACT CAAAAAAGTC AGGCTCCTCA
AATAATAAAA ATGATGAATT AATCACGTAT TTACAAAATA TTCCTGAACA TACCTGCCTT
GTTTTTTATG AGGAAAATAT TGATAACAGA CTTAAAATAA CAAGTGCCGT GAAAAAATAC
GGTATGGTGG TGGAATTTCC TTTTCAAAAG CCGGCCGAAC TTGTTAAATG GGCCATTAAA
GTTTTCAAAT CCTACGGCAA GGCAATTGAT GAAAATACGG CATCATACCT TATAGATACA
TGTGAGGAAG GAATGACTGA AATATTAAAT GAAATAAACA AAGTTGTTCT TTATTTGGGC
GAAAGCCAGA AAGTTACCGT AGATAGTATA AAAAAGGTTT GCACAAAGTC AATAAAAAGC
AGAATATTTG ATTTAATTGA CGCCATAGCC GAAAGAAAAC TTGATTTGGC TTTAAAGCTC
TTAAATGACA TGATTATTTT AAAGGAACCC ATGCCAAAGA TTTTGTTTAT GATAGCAAAA
CAATTAAAAC AGTTGTTGGA ATTAAAGCTT TTGTGCAGCA AGGGCATGGA TGCAAAAGAA
GCATGTTCAA AGATGGGGAT AAATCCTTAT GCCGCGAAAA AAATGGTACG GCAGACCGAC
TGTTTTTCTT TGGAGAAACT GAAGGAAGCA ATACGACAAG CTCTTGAGCT GGATCTTTCG
ATAAAGACGG GGCAGATAAA CGACAGAACG GCCGTGGAAA TATTAATCTG CAGTTTGGCG
GCTGAATAA
 
Protein sequence
MSRESKMSID KLKLEVKNKS LGKLYLFYGE EEYLKKFYLG KIEEIILSRD QTGLNKIVIE 
GKAEASKIIE ACETMPFFAE KKLVVVKKSE LFNSKKSGSS NNKNDELITY LQNIPEHTCL
VFYEENIDNR LKITSAVKKY GMVVEFPFQK PAELVKWAIK VFKSYGKAID ENTASYLIDT
CEEGMTEILN EINKVVLYLG ESQKVTVDSI KKVCTKSIKS RIFDLIDAIA ERKLDLALKL
LNDMIILKEP MPKILFMIAK QLKQLLELKL LCSKGMDAKE ACSKMGINPY AAKKMVRQTD
CFSLEKLKEA IRQALELDLS IKTGQINDRT AVEILICSLA AE