Gene Cthe_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1264 
SymboldnaE 
ID4809769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1532421 
End bp1535966 
Gene Length3546 bp 
Protein Length1181 aa 
Translation table11 
GC content42% 
IMG OID640106687 
ProductDNA polymerase III DnaE 
Protein accessionYP_001037689 
Protein GI125973779 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.756254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGA AATTTGTACA CCTTCATGTC CATACTGAAT ACAGTCTTTT GGACGGGGCA 
AACAGAATAA AGGACCTCAT AAGGCGTACA AAAGAGCTGG GAATGGACAG TATAGCAATA
ACCGACCATG GTGTAATGTA CGGAGTTGTT GATTTTTACA AGGAAGCCGT CAATAATGGA
ATAAAACCCA TACTTGGGTG TGAAATATAT ACTGCCAAAG GGTCAAGGTT TGACAAACAG
GGAGGCTGGG ATTCGGATCC CGGTCATATG GTGCTTTTGG CAAAAAATAA TACGGGATAC
AAAAATCTGA TGAAAATTGT ATCCATAGGT TTTACCGAAG GATTTTACTA TAAACCCAGG
GTTGACATGG AAGTTCTCGA AAAATACAGT GAAGGCTTGA TTGCTATGAG TGCCTGCCTT
TCCGGAGACA TACCCAAAGC GATTTTAAAC AACAATTATG AAAAGGCAAA GGAGCTGGCA
CTTAAGCTCA ACAGCATTTT CGGACAGGAT AATTTTTATC TTGAGCTTCA GATGAACGGT
ATCGAAGAGC AGAACATAGT CAACCAGCAG CTTATAAAGC TTAGCAGGGA AACGGGAATA
CCCCTTGTTG CCACCAATGA CGCCCATTAT CTTAGAAGAG AGGATGCCCG TGCCCATGAA
ATCCTTCTTT GCATACAAAC GGGAAAGAGT ATCAATGACG AAGACAGAAT GAGGTTTTCT
TCGGATGATT TTTATATAAA GTCCCCTGAA GAAATGATCA GTCTTTTCAG AAACATTCCC
GAAGCAATTT CAAATACCGT AAGGATTGCG GACATGTGCA ATGTCGAGCT TGAGTTTAAC
AAGCTGCACC TGCCCAAATT TGACGTGCCG GACGGAAAAG ACCCTTTCGA ATATCTAAGG
GCCCTGTGCT ATGAGGGATT TGAAAGAATT TACGGAAAAG ACAACCGGGA TGAGGAGAAA
ATAAACAGGC TTGAATATGA GCTTTCGGTA ATAAAGCAGA TGGGTTATGT GGATTATTTC
CTCATTGTGA GCGATTTTAT CAGATATGCA AAGGAAAAAG GGATAATGGT GGGACCCGGA
AGGGGTTCCG CGGCCGGAAG TTTTGTGGCC TACTGCCTTG GAATTACAAA TATTGATCCG
TTAAAGTACA ATCTTCTGTT TGAGAGATTT TTAAATCCGG AGAGAATAAG CATGCCGGAT
ATCGACATAG ACTTTTGCTT CGAAAGAAGG CAGGAAGTTA TAGATTATGT TGTCGAAAAG
TATGGAAAGG ACAGGGTTGC ACAGATAATT ACTTTTGGAA CCATGGCTGC AAGAGCGGTT
ATACGGGATG TGGGAAGAGC CCTTGACATA CCCTATGGAG AGGTTGATGC CATTGCCAAA
ATGATACCTT TTCAGATTGG CATGAGTATA GAAAAGGCCA TGGAGCTAAA TCCCGAGCTT
CGCCAGAGGT ATAATGATGA TGAAAGGGTA AAGGAGCTGA TTGATACCGC AAAGCTTCTT
GAAGGCATGC CGAGACATGC TTCCACCCAT GCCGCGGGAG TGGTTATTTC CCGGGAACCT
CTGACGGAAT ATGTTCCCCT TCAAAGGAGT GAGGACAGTA TCACGACTCA ATTTCCCATG
GGAATTTTGG AGGAACTGGG ACTTCTCAAG ATGGACTTTC TGGGACTTAG GACTCTTACG
GTAATAAGGG ATGCGGTGGC TCTTATAAAG AAGAATCACA ATATAGATGT AAAAATTGAC
GAGCTTCAAA TGGATGACCC AAATGTGTTC AAACTCATAG GAGAGGGACG GACGGCGGGA
GTGTTCCAAT TGGAAAGTGC GGGTATGACC CAGTTCATGA AGGAGCTTCA GCCGGCGTCT
CTGGAAGATA TAATAGCCGG CATATCCCTG TATCGGCCGG GTCCTATGGA TCAGATTCCA
AGATATCTTA GAAACAAAAA CAATCCGGAG CTTGTAAAGT ATGACCATCC TTTGCTGGAA
AACATACTGA ATGTAACTTA TGGATGCATG GTTTATCAGG AGCAGGTAAT GCAGATAGTC
CGCGACCTTG CCGGTTACTC AATGGGAAGG TCCGATCTTG TAAGGCGTGC AATGGCCAAG
AAAAAGGTCA GTGTAATGGA ACAGGAGAGA AAAAATTTTA TTTATGGAAT AGATGACGAT
AATGGAAATA TTATAGTAAA GGGAGCTGTC AGAAACGGTG TGGATGAAGA GACAGCAAAC
AAAATATTTG ATGAGATGAT GGACTTTGCA AGCTATGCGT TTAACAAATC CCATGCAGCT
GCTTATGCAG TTATAGCGTA TCAAACTGCA TGGCTCAAAT GTTATTATCC TGTGGAATTT
ATTGCAGCGC TTTTAAACAG TTTTATGGGA AGCAGCGATA AAATATCCCA GTATGTGCAT
GAGTGCCGCA AGCTGGGTAT TGAAGTACTT CCTCCGGATA TAAATGAAAG CGATGTCAGA
TTTACCGTTG TGAACGGAAA GATAAGATTT GGGCTTGCAG CGGTTAAAAA TGTCGGTGAG
AATGCAGTCA GGTCAATTAT TGATGAAAGG AATATAAACG GAAACTATAA AAGTTTCAGG
AACTTTTTGG AGAGAGTTGA CGGAAAGGAT GTAAACAAAA GATGCATTGA AAGCCTCATA
AAGAGCGGAG CTTTTGATTC AATGGGAGTA TACCGTTCAA GGCTTATGAA TGCTTATGAG
AAAATGATGG AAGGAATATC CAGCCAGAGG AAAAAGAGCA TGGAAGGCCA GCTTTCCATA
TTTGACATGG CACTTAACAC CGAGGATGGC AAAAAGGATG AGAAGCATCA GCTATATCCG
GAAGATGAAG ATATTTATCC CGATATTCCC GAATATTCGC AGAAGATACT TCTGTCAATG
GAAAAGGAAA TGCTGGGCCT TTATATATCG GGACATCCTT TGAGTGAATT TGAAAAGGAA
TTCAGTGAAG TGGTTACATT GTACAGCAAG GATATGGTGT CGGATGCGGA TGAAAACGGT
GAGGTAATAA CTGTTGAAGG AAACAAAGGT TTAAAAGACG GTATGACTGT GACGGTTGGG
GGAATAATTA CTTCAAGGAA GACAAAAACC ACGAAGAATA ACAATTTGAT GGCTTTTGTG
ACATTGGAAG ATTTGTATGG CACAATGGAG ATAATAGTTT TTCCTGCTGT TTTGGAGAGG
TTTTCGAACC TTCTGGAGGT GGAAAACATT GTTCTGATAA AAGGAAGTAT AAGTATAAAA
GAAGAGGAAC AGCCAAAAAT AATATGCGAG GAAGTAAGGC CGTTAAGAAA AGAAGACGGC
GCAAATCCGC TGAAAAGAAA AGTGGTAAAG CTTTATTTGA GAGTGGACGA TAACATTGAC
AACGAGCTGA TGGAATCAAT AATTTGCATG CTGAAGTTTT TTGGTGGGAA CACTCCTGTA
TGCCTTTACA ATGAAAGCCA AAAAAAGATC AAGGTGTTGG AAAGGGATTG CTGGGTAAGC
CTTAATGACA CCGTGATTAA TGAATTGAAG TTACTTATAG GGGAGGAAAA TGTCAAAGTC
TCGTAA
 
Protein sequence
MLQKFVHLHV HTEYSLLDGA NRIKDLIRRT KELGMDSIAI TDHGVMYGVV DFYKEAVNNG 
IKPILGCEIY TAKGSRFDKQ GGWDSDPGHM VLLAKNNTGY KNLMKIVSIG FTEGFYYKPR
VDMEVLEKYS EGLIAMSACL SGDIPKAILN NNYEKAKELA LKLNSIFGQD NFYLELQMNG
IEEQNIVNQQ LIKLSRETGI PLVATNDAHY LRREDARAHE ILLCIQTGKS INDEDRMRFS
SDDFYIKSPE EMISLFRNIP EAISNTVRIA DMCNVELEFN KLHLPKFDVP DGKDPFEYLR
ALCYEGFERI YGKDNRDEEK INRLEYELSV IKQMGYVDYF LIVSDFIRYA KEKGIMVGPG
RGSAAGSFVA YCLGITNIDP LKYNLLFERF LNPERISMPD IDIDFCFERR QEVIDYVVEK
YGKDRVAQII TFGTMAARAV IRDVGRALDI PYGEVDAIAK MIPFQIGMSI EKAMELNPEL
RQRYNDDERV KELIDTAKLL EGMPRHASTH AAGVVISREP LTEYVPLQRS EDSITTQFPM
GILEELGLLK MDFLGLRTLT VIRDAVALIK KNHNIDVKID ELQMDDPNVF KLIGEGRTAG
VFQLESAGMT QFMKELQPAS LEDIIAGISL YRPGPMDQIP RYLRNKNNPE LVKYDHPLLE
NILNVTYGCM VYQEQVMQIV RDLAGYSMGR SDLVRRAMAK KKVSVMEQER KNFIYGIDDD
NGNIIVKGAV RNGVDEETAN KIFDEMMDFA SYAFNKSHAA AYAVIAYQTA WLKCYYPVEF
IAALLNSFMG SSDKISQYVH ECRKLGIEVL PPDINESDVR FTVVNGKIRF GLAAVKNVGE
NAVRSIIDER NINGNYKSFR NFLERVDGKD VNKRCIESLI KSGAFDSMGV YRSRLMNAYE
KMMEGISSQR KKSMEGQLSI FDMALNTEDG KKDEKHQLYP EDEDIYPDIP EYSQKILLSM
EKEMLGLYIS GHPLSEFEKE FSEVVTLYSK DMVSDADENG EVITVEGNKG LKDGMTVTVG
GIITSRKTKT TKNNNLMAFV TLEDLYGTME IIVFPAVLER FSNLLEVENI VLIKGSISIK
EEEQPKIICE EVRPLRKEDG ANPLKRKVVK LYLRVDDNID NELMESIICM LKFFGGNTPV
CLYNESQKKI KVLERDCWVS LNDTVINELK LLIGEENVKV S