Gene Cthe_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1734 
Symbol 
ID4810164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2054533 
End bp2056407 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content30% 
IMG OID640107147 
ProductP4 family phage/plasmid primase 
Protein accessionYP_001038148 
Protein GI125974238 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTT TCAAAGGATA CATTCCTTTA AAAGGAAAAA AACCTTTAGA GGGATATAAG 
GACAGAAATA AATTTTATTG TTATGACTAT ATAAGAAAAA CAGGCAAAGA TTACGGTGCA
GTTTTACAAG ATGATATTAT TCAAATAGAT TTTGATTCAA TAGAAGAAGC AAATGTTGTA
AAAAACATAA TAACAGATTT AGATATTAAA TGTTGCATAT TAAAGACAGA TAGAGGGATG
CATTTTTATT TTAAAAACAC AGATTTAAAA ACAAGAAAAA TCAAGACAAA GACTCCTTTA
GGAGTAACAA TTGACATAGG ACTTGGCTCA AAGAATTGTA TTGTGCCATT GAGAGTGAAT
GGGGTAACAA GAAGATGGTT AAACAAAACA GAAGAAGTTG ACTATCTTCC AGAATGGCTA
AGACCATTAA AAACAGCACC GGATTTTCTA AATATGAAAG AAGGAGATGG AAGAAATCAA
GCATTATTCA ATTATATATT AACCCTTCAA TCAGAAGGTT TTAGTAAGGA TAGTATTAAA
AACATTATAA ATATCATAAA CAAATATATT TTAAAAGAAC CATTAGAACA AAGAGAAATA
GATACAATAC TTCGAGATGA AGCATTTTTA AAACCTTTGT TTTATCAAAA GTCAAAATTC
TTACATGACC AGTTTGCAAG ATTTATAAGG GATGAAGAGC ACATTATTAA AATTAATAAC
CAGCTCCATG TTTATAAGGA CGGAATTTAT AAAAATAGTA CTTTGGAAAT TGAATCAGTT
ATGATTAAAC ATTTACCAGA GCTTAATAAA GCAAAGAGGA ATGAAACTAT AAATTACCTA
GAACTTATTA CTAATAACGT TATTCCAAGC TTTGAAGATT ATAATCGCAT AGCATTTAAC
AATGGAATTT ATAACATAAT AGATGATTCT TTTACAGAAC ATTCTCCAGA TTTTATCATC
ACTAACAGAA TTCCGTGGGA TTATAACCCA AATGCTTATT TTGAATTAGC AGATAAAACC
TTAGATAAAA TCAGCTGCAA TGATGCTGAA ATTAGAAGTG TGCTTGAAGA ACTTATAGGA
TACACATTTT ATAGAAGAAA TGAAATTGGA AAAGCTTTCA TTTTAACTGG TGAAAAACAA
AATGGGAAAT CAACATTTTT GGATATGGTA ACTACACTTA TAGGTATTTC TAATATTGCA
GCTCTTGATT TAAAGGAGTT GGGAGAGCGA TTTAAAACAG CAGAATTATT TGGAAAGCTT
GCAAATATTG GTGACGATAT TGGAGATGAA TTTATTGCAG AGCCTTCAAT GTTTAAGAAG
CTTGTTACAG GGGATAGAGT TAATGCAGAG AGGAAAGGCA AAGACCCTTT TGATTTTAAT
AATTACTCCA AGCTTTTATT TTCAGCCAAT AATGTTCCAA GGGTTAAGGA TAAAACAGGA
GCTGTCCAAA GAAGACTTCT TATAATTCCG TTTAAGGCAA AGTTTACAGC AGATGATCCT
GATTTCAGAC CAGATATTAA ATATGAATTA AGGACAAAAG AGTCCATGGA ATATTTGATT
TTACTGGGTT TAAAGGGGCT TAAAAGAATC CTACAAAATA AAAAGTTCAC AAAATCCATA
CAAGTGGAGC ATGAGCTTAA AGAATATGAG AAGACCAACA ATCCAATTAT TGAATTCTAT
GAAGAATATG AAACAAAAGT AGAAAATGAG CCAACGAAAA ATGTCTATAA GAATTATTTG
GAGTTTTGTT TAAATAATAA TCTGCAGCCA CTTAGTCATA TAGAATTTTC AAGACAGATT
ACCAAGAGGT TCGGTTATAA AACTATAGAT AAAAAAATTG ATGGGAAGAA ATATAGGATA
TTTGTGAAAA TGTAA
 
Protein sequence
MDIFKGYIPL KGKKPLEGYK DRNKFYCYDY IRKTGKDYGA VLQDDIIQID FDSIEEANVV 
KNIITDLDIK CCILKTDRGM HFYFKNTDLK TRKIKTKTPL GVTIDIGLGS KNCIVPLRVN
GVTRRWLNKT EEVDYLPEWL RPLKTAPDFL NMKEGDGRNQ ALFNYILTLQ SEGFSKDSIK
NIINIINKYI LKEPLEQREI DTILRDEAFL KPLFYQKSKF LHDQFARFIR DEEHIIKINN
QLHVYKDGIY KNSTLEIESV MIKHLPELNK AKRNETINYL ELITNNVIPS FEDYNRIAFN
NGIYNIIDDS FTEHSPDFII TNRIPWDYNP NAYFELADKT LDKISCNDAE IRSVLEELIG
YTFYRRNEIG KAFILTGEKQ NGKSTFLDMV TTLIGISNIA ALDLKELGER FKTAELFGKL
ANIGDDIGDE FIAEPSMFKK LVTGDRVNAE RKGKDPFDFN NYSKLLFSAN NVPRVKDKTG
AVQRRLLIIP FKAKFTADDP DFRPDIKYEL RTKESMEYLI LLGLKGLKRI LQNKKFTKSI
QVEHELKEYE KTNNPIIEFY EEYETKVENE PTKNVYKNYL EFCLNNNLQP LSHIEFSRQI
TKRFGYKTID KKIDGKKYRI FVKM