Gene Cthe_2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2847 
Symbol 
ID4809127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3362822 
End bp3364975 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content47% 
IMG OID640108267 
ProductP4 family phage/plasmid primase 
Protein accessionYP_001039239 
Protein GI125975329 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.318392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATGA TGGACGCGGC ATTAAAATAC GCAGAAGCCA ATATCCCAGT TATACCTCTG 
CACTGGATTT GTGAGGATGG CTCCTGCTCC TGCAAGGCAG GGAGCGATTG CGACAGCAAG
GGAAAGCATC CGTTATATAC CGGCTGGTAC AAGAACTCCA CTACTGATGT TGAGCAAATA
AAGAAATGGT GGACGAAAAC CCCCAATGCC AATATCGGAA TTCCTACAGG TGAGAAATCC
GACTGGCTGG TGCTTGATGT GGACGATGGT GGTGATGAAA CCCTATCTGC ACTTGAGGCA
ACACATGGAA AACTTCCGGA TACGGTTACT GCTGTTACAG GAAGTGGAGG TCGGCACTAT
GTATTTATAT ACCCTAAAGG CCGGAGTATT CCTAATAAGA CCAAGTTTGC ACCGGGTCTT
GATATGCGTT CAACAGGTGG ATTGATTGCC GTAGCTCCAA GCATTCATAT AAGCGGTAAT
CGGTATGAAT GGTTAGAAGG ACATTCTCCC TTTGAGAGAA TCCCGGCAGA AGCTCCAGCA
TGGTTGTTGA AGCTCATGGA AAGGGTGGAA GTATTGCTTA CACCCTTTGA AGGTAGCAGT
ATTATTGCCG AGATTAAGGA AGGAAACCGC AACAGTACCC TGACAAGCCT TGCCGGAACC
ATGAGGGCAA GAGGAATGAC AGAAGAGAGC ATCTATGCGG CATTGCTTGC AGAAAACAAC
GCAAGGTGCA ATCCTCCGCT TGATGAAGCG GAAGTTAGAA AGATAGCGCA CAGTGTCAGC
CGATACCAGC CAAATCCTCC GATGAAGAAG CATTACCACA GGACAGACAG CGGGAATGCA
GAAAGGCTGC GTGACAGGTT TGGTGAAATC ATTAGGTATT GTCCGGCTTT CAAATACTGG
TTGGTATATG ACGGCTGTTG CTGGAGGAAA GAAACCGGAG AACTTATGCA GTTTGCTATA
AAAACAGCAA GAGACATGCT CGCAGAAGCA AGCCGGATAG AGGATGAGGC TGCAAGAAAA
GAACTGGTGC GCCATGCCAT GCAGTCTGAA AACGCAGGCA GGCTTAAAGC CATGATCGAT
GTGGCTTCAA ACCTTGAAGG AATGGTAATT ATGCCGGATG AGATTGATTC TGATATATGG
AAGCTGAACT GTAGAAATGG TGTGGTAGAC CTAAAGACAG GCGAACTCCT TCCTCATAAG
CGGGAGTACT ATATGAGCAA AATCTGCCCT GTTGAATATA AACCAAGCAG CAAGGCTCCC
AAATGGATGG AATTTCTGAA TACCATTACG GGAGGAAGCA AGGAGCTTGT AAGATACCTT
CAAAAAGCTG TAGGCTCGTC ATTAAGCGGG GATATTTCAG AGCAGGCCCT ATTCGTCCTT
TATGGAACAG GAGCAAACGG AAAGAGCACA TTTCTAAACA CCATCTCTGA CCTGTTGGGA
GACTATGCAA GAAATACTCC GTCCGAAACT TTTATGGCTA AAAGAATAGA AGCGATAGGA
AATGATATCG CAAGGCTTCA GGGAGCAAGG CTCGTTACTG CCATAGAAAT AAATGAGGGA
CAAAGGCTCT CTGAGGCATT GATTAAGAGC TTTACAGGCG GAGACAGAAT TACAGCAAGG
TTTCTTTATG GAGAATACTT TGATTTCCAG CCACAGTTCA CCCCGTTTCT CGTAGTAAAC
CACAGACCAG TCATAAGAGA TACCAGCCAC AGCATTTGGA GGCGCATTAA GCTGATTCCT
TTCACCGTTA CCATACCCGA GGATAAAAAG GATAAGCAGC TACCGGCAAA GCTGAGAGAA
GAGCTGCCTG GCATATTGTC ATGGGCAGTA GAGGGTTGCC TTCTTTGGCA GAAGGAAGGA
CTAAATATGC CTGATGAAGT CAAAAAAGCC ACAGAAGGTT ACCGGGAGGA AATGGATACC
TTCTCAAGTT TTATAGAGGA ATGCTGCATT GTGGAGGAGG GCAGGAAAGT CTCCAATAGA
AGCATCAGGT ACGCTTACGA AACATGGTGC CGGGAAAATG GAGACTACCC TCTTGGACAA
AAGCTATTCA ATGCAAAAAT GACGGAGCGC GGCTTTGCTG TCAAACGCAG CGGAGCCAAT
GGCAGCAGGG ACTGGCATGG TATTGGTCTT GCGGATGAGG GGATACTTTT GTGA
 
Protein sequence
MTMMDAALKY AEANIPVIPL HWICEDGSCS CKAGSDCDSK GKHPLYTGWY KNSTTDVEQI 
KKWWTKTPNA NIGIPTGEKS DWLVLDVDDG GDETLSALEA THGKLPDTVT AVTGSGGRHY
VFIYPKGRSI PNKTKFAPGL DMRSTGGLIA VAPSIHISGN RYEWLEGHSP FERIPAEAPA
WLLKLMERVE VLLTPFEGSS IIAEIKEGNR NSTLTSLAGT MRARGMTEES IYAALLAENN
ARCNPPLDEA EVRKIAHSVS RYQPNPPMKK HYHRTDSGNA ERLRDRFGEI IRYCPAFKYW
LVYDGCCWRK ETGELMQFAI KTARDMLAEA SRIEDEAARK ELVRHAMQSE NAGRLKAMID
VASNLEGMVI MPDEIDSDIW KLNCRNGVVD LKTGELLPHK REYYMSKICP VEYKPSSKAP
KWMEFLNTIT GGSKELVRYL QKAVGSSLSG DISEQALFVL YGTGANGKST FLNTISDLLG
DYARNTPSET FMAKRIEAIG NDIARLQGAR LVTAIEINEG QRLSEALIKS FTGGDRITAR
FLYGEYFDFQ PQFTPFLVVN HRPVIRDTSH SIWRRIKLIP FTVTIPEDKK DKQLPAKLRE
ELPGILSWAV EGCLLWQKEG LNMPDEVKKA TEGYREEMDT FSSFIEECCI VEEGRKVSNR
SIRYAYETWC RENGDYPLGQ KLFNAKMTER GFAVKRSGAN GSRDWHGIGL ADEGILL