Gene Cthe_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1986 
Symbol 
ID4810918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2366420 
End bp2368573 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content47% 
IMG OID640107402 
ProductP4 family phage/plasmid primase 
Protein accessionYP_001038397 
Protein GI125974487 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAATGA TGGACGCAGC ATTAAAATAT GCAGAAGCCA ATATCCCGGT TATACCTCTG 
CATTGGATTT GTGAGGATGG CTCATGCTCC TGCAAGGTAG GGAGTAATTG CGACAGCAAG
GGAAAGCATC CGTTATATAC CGGCTGGTAC AAGAATTCTA CTGCTGATAT TGAGCAAATA
AAGAAATGGT GGACGAAAAC ACCCAATGCC AATATCGGTA TTCCTACAGG TGAGAAATCC
GGCTGGCTGG TGCTTGATGT GGACGATGGC GGCAATGAAA CTCTATCGGC TCTTGAAGCA
ACACATGGAA AACTTCCGGA TACGGTTACT GCTGTTACAG GAAGTGGCGG TCGGCACTAT
GTCTTCAAAT ACCCACAAGG CAGGAGCATT CCTAATAAGA CCAAGTTTGT ACCGGGACTT
GATACCCGTT CAACAGGTGG TCTGATTGTC GCAGCTCCAA GCATTCATGT AAGCGGTAAT
CGGTATGAAT GGATAAAGGA TCATTCTCCC TTTGACAGAA CCCCGGCAGA AGCTCCGGAG
TGGCTGTTAA AGCTTATGGA AAGGGAGGAA GTATTGCTTA CACCCTTTGA AGGTAGCAGT
ATTACAGCCG AGATTATGGA AGGCAGTCGG AACAGTACCC TGACAAGCCT TGCAGGAACC
ATGAGGGCAA GAGGAATGAC AGAAGAGGGC ATCTATGCAG CGCTGCTTGC CGAAAACAAC
GCAAGGTGCA ATCCTCCGCT TGATGAAGCG GAAGTTAGAA ATATAGTGCA CAGTGTCAGC
CGATACCAGC CAAATCCTCC GATGAAGAAA CATTACCACA GGACAGACAG CGGTAATGCA
GAAAGGCTGC GTGACCGGTT TGGTTCAATC ATAAGGTATT GTCCGGCTTT TAAATACTGG
TTGGTATATG ACGGCTGTTG CTGGAGGAAA GAAACCGGAG AACTTATGCA GTTTGCTATA
AAAACAGCAA GAGACATGCT CGCAGAAGCA AGCCAGATAG AGGATGAAGC TACAAGAAAA
GAACTGGTGC ACCATGCCAT GCAGTCTGAG AACGCAGGCA GGCTTAAAGC CATGATCGAT
GTGGCTTCAA ACCTTGAAGG CCTGATAATC ATGCCAGATG AGCTTGATTC TGATATATGG
AAGCTGAACT GTAAAAATGG TGTTGTAGAT TTAAAGACAG GGGAACTTCT ACCCCATAAG
CGTGAGTACT ATATGAGCAA AATCTGTCCT GTTGAATATA GCCCAGAAAG CAAGGCTCCC
AGATGGATTG AGTTTCTGAA TACCATTACG GGAGGAAGCA ACGAGCTTGT AAGATACCTT
CAAAAGGCTG TGGGCTCATC TTTAAGCGGG GATATTTCAG AGCAGGCCTT ATTCGTCCTC
TATGGAACAG GAGCAAATGG CAAGAGCACA TTTCTAAACA CCATCTCCGA CCTGTTGGGA
GATTATGCAA GAAACACTCC ATCCGAAACC TTTATGGCAA AAAGGATAGA AGCGATAGGA
AATGATATTG CGAGATTACA GGGAGCAAGG CTTGTAACTG CCATAGAAAT AAATGAGGGA
CAAAGGCTCT CTGAGGCATT GATTAAGAGC TTTACAGGCG GAGACAGAAT CACAGCAAGG
TTCCTTTATG GAGAATACTT TGATTTCCAG CCACAGTTCA CCCCGTTTCT CGTAGTAAAC
CACAGACCAG TCATAAGAGA TACCAGCCAC AGCATTTGGA GGCGCATTAA GCTGATTCCT
TTCACCGTTA CCATACCCGA GGATAAAAAG GATAAGCAGC TACCGGCAAA GCTGAGAGAA
GAGCTGCCTG GCATATTGTC ATGGGCAGTA GAGGGTTGCC TTCTTTGGCA GAAGGAAGGA
CTTGAAATGC CTGATGAAGT CAAAGAAGCT ACAGAAGGTT ACCGGGAGGA AATGGATACC
TTCTCAAGTT TTATAGAGGA ATGCTGCATT GTGGAGGAGG GCAGGAAAGT CTCCAATAGA
AGCATCAGGT ATGCTTACGA AACATGGTGC CGGGAAAATG GAGACTACCC TCTTGGACAA
AAGCTATTCA ATGCAAAAAT GACGGAGCGC GGCTTTGCTG TCAAACGCAG CGGAGCCAAT
GGCAGCAGGG ACTGGCATGG CATTGGTCTT GCGGATGAGG GGATACTTTT GTGA
 
Protein sequence
MTMMDAALKY AEANIPVIPL HWICEDGSCS CKVGSNCDSK GKHPLYTGWY KNSTADIEQI 
KKWWTKTPNA NIGIPTGEKS GWLVLDVDDG GNETLSALEA THGKLPDTVT AVTGSGGRHY
VFKYPQGRSI PNKTKFVPGL DTRSTGGLIV AAPSIHVSGN RYEWIKDHSP FDRTPAEAPE
WLLKLMEREE VLLTPFEGSS ITAEIMEGSR NSTLTSLAGT MRARGMTEEG IYAALLAENN
ARCNPPLDEA EVRNIVHSVS RYQPNPPMKK HYHRTDSGNA ERLRDRFGSI IRYCPAFKYW
LVYDGCCWRK ETGELMQFAI KTARDMLAEA SQIEDEATRK ELVHHAMQSE NAGRLKAMID
VASNLEGLII MPDELDSDIW KLNCKNGVVD LKTGELLPHK REYYMSKICP VEYSPESKAP
RWIEFLNTIT GGSNELVRYL QKAVGSSLSG DISEQALFVL YGTGANGKST FLNTISDLLG
DYARNTPSET FMAKRIEAIG NDIARLQGAR LVTAIEINEG QRLSEALIKS FTGGDRITAR
FLYGEYFDFQ PQFTPFLVVN HRPVIRDTSH SIWRRIKLIP FTVTIPEDKK DKQLPAKLRE
ELPGILSWAV EGCLLWQKEG LEMPDEVKEA TEGYREEMDT FSSFIEECCI VEEGRKVSNR
SIRYAYETWC RENGDYPLGQ KLFNAKMTER GFAVKRSGAN GSRDWHGIGL ADEGILL