Gene Cthe_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1156 
Symbol 
ID4810824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1373054 
End bp1375117 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content34% 
IMG OID640106578 
Productzinc finger, CHC2-type 
Protein accessionYP_001037581 
Protein GI125973671 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000142218 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAA GTATAGATAG CATCATAGCC AAAAAAGTTT CTCACGATGT TGGTATTTTA 
ATATTTCAAT ACCTTAATGA TATCGAACCC TTAATCCAGC AATCTCTTAC TGCCGCAAAA
TTGAAAGACT TTTACGGGCT TAAAGGGAAA TATAGCAAAG GACGGTTTGT TTGTCCGTTC
CATCACGGTG CAGATAACCC AACTTCCCTT TCTATGAATG ATAACAAGCG ATGTTTTCAC
TGCAACGCCT GTGGAAAAAG CGGCACATAC CTTGAATTTA TTATGTATAT GCAAAATATT
TCTGGAAGGA ACAGTTATAA TGATGCAAAA ATTTTTGCTG CAAACCATTT TACTAAACTG
AACTTAGGAT TCAATTCAAT CGCAGATTTT GAACAGAAAT TGAAAGATAA AATATTAGAA
CGTTATCGTA AAACTAATAG TCTTAGATAT ACGGACTACT ATGATATTTG GCTTCTACCC
GAAAGATATT TTTCTTCTAA AGAACAGTCA CAAGTCCGCC AATACCAAGA AGAACAACAA
ACAAAGCCTT CTTTAGAAGT TTCTCTTTCA AACTCTACAA ACAAGCCCGG TTTTATACTG
GAATCACTGG ACCTTGAAAT GACTATCAGG CATCTTAAAG CCCAAAATAT CATTGTTAAT
GGAATTACTG ATTATGACGA GGCGGTTCAG GTGGCTAAGA ACAGTAAACT GATTAAGGAT
TTTACCAAAA ATGCAGATAA ATTAAATTCT GTCGATAAAC TCACTGATTA TATGAGTAAG
AAATATAATA TAGACATAGA TACTGCTTTG AAATATGGTT TAATATTTTT TGATAAAAGC
AGCCAACATC AACTCTACTA TAGCGATTTC TTCATGTTGA ATAACAGAGT CTTGTTCCCT
GTAAGAGACC ATGAAACCGG CATTATTGTC GGGTATCAGT GCCGGCGGAC TGACTTAAGT
GCTCCAAGAC ATTATAAATA TCTAAACATA ACCGACTACC AGGACAACTT GATAACAAAT
GAAAACGGAA CAACATATAG AGATTTTGTT CCTTTTAAGG TTGGAAACTT CCTTTTTAAC
CTTTATGAAC TTAAGGGTAA GTGCATTAAT ACCTTATGGA TTACAGAAGG GATTGCAGAT
GCGATTAAAC TTTCCAGTAT GGGATATGAT GCTGTTTCTC TTGGGCAGGC AAACTTGACC
GACTATCAAA TATATTTAAT TGACAAGTAT TTTGGCAAAG ATGTGACTCT TAACCTATTC
TTCGACAATG ATGACAATAA AATTGGGCAA AACAAAAGTA TCGCAGCAGC CTACCGCCTT
TGGCAGTTTG GTTTCAGGAA TATTAGAATT ATAAATACCT TTAAGGAAAT GGGAAAAGAT
ATAACTGATT GCTCTGTTAA ACTGCATGAT GATGATATGC TAAGACTTTT TATTAATCAC
TGGGAAAAGC AGGCATATTC ATTTGCCCCT GCTAGCAATG AAGATTTAAG TGCTTTATTA
AAGACAGGTC TTTATAGTGA AAGTGAAATC CTGTTTATTG ACCCAAGAGA TGTTGAGCGG
ATAATTAGTT TTGGTGAAAT GCTTAATAAA TATTTAGATT TTAAGAATAT GACTTATCAG
CAACTTAAAT TATTAAAGAA ATTATGCTCC TTTAAGGAAG ACGAAATTAA AATTCTATTA
AGCCTTTCTA CAAAGGATTT CCTTGACGTA CAGAATACTA ATGAAACAGA GGCTGCCATA
AAAGAAAACA GCAATACTAA TATAGACAGT ACCCAAGAAG AAGGAAATGT TGAAGGAAGC
CTAATCAACA TTTCTAAAGC GCAACTGTTT CACCTTAAAA AGAGATTTGA TATAGGCATA
ATTAAAAAAA TAGATAACGA ATGTTCCAAA AAACAGATTG CCGCTATCGT CGGTAATATA
ATAAAAAATA AAGATTTTGA TGTATGGGAT TACATCCATA AAAAAGGAAG TCAGCATTCT
TCAACTAGTA TAGCCCCTAG CACCACTATA CTGGATGATG GCAACTTCAC TCCTGTATAT
GACGATGTCA GTATTCCTTT TTAA
 
Protein sequence
MNRSIDSIIA KKVSHDVGIL IFQYLNDIEP LIQQSLTAAK LKDFYGLKGK YSKGRFVCPF 
HHGADNPTSL SMNDNKRCFH CNACGKSGTY LEFIMYMQNI SGRNSYNDAK IFAANHFTKL
NLGFNSIADF EQKLKDKILE RYRKTNSLRY TDYYDIWLLP ERYFSSKEQS QVRQYQEEQQ
TKPSLEVSLS NSTNKPGFIL ESLDLEMTIR HLKAQNIIVN GITDYDEAVQ VAKNSKLIKD
FTKNADKLNS VDKLTDYMSK KYNIDIDTAL KYGLIFFDKS SQHQLYYSDF FMLNNRVLFP
VRDHETGIIV GYQCRRTDLS APRHYKYLNI TDYQDNLITN ENGTTYRDFV PFKVGNFLFN
LYELKGKCIN TLWITEGIAD AIKLSSMGYD AVSLGQANLT DYQIYLIDKY FGKDVTLNLF
FDNDDNKIGQ NKSIAAAYRL WQFGFRNIRI INTFKEMGKD ITDCSVKLHD DDMLRLFINH
WEKQAYSFAP ASNEDLSALL KTGLYSESEI LFIDPRDVER IISFGEMLNK YLDFKNMTYQ
QLKLLKKLCS FKEDEIKILL SLSTKDFLDV QNTNETEAAI KENSNTNIDS TQEEGNVEGS
LINISKAQLF HLKKRFDIGI IKKIDNECSK KQIAAIVGNI IKNKDFDVWD YIHKKGSQHS
STSIAPSTTI LDDGNFTPVY DDVSIPF