Gene Cthe_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1052 
Symbol 
ID4811350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1256707 
End bp1257948 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content45% 
IMG OID640106474 
Productcompetence/damage-inducible protein CinA 
Protein accessionYP_001037477 
Protein GI125973567 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA
[COG1546] Uncharacterized protein (competence- and mitomycin-induced) 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000170345 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCGG AGATATTAGC GGTTGGAACC GAGCTTTTAA TGGGGCAGAT AGCAAATACC 
AATGCCCAGT ATATATCCAA AAGGCTCAAT GACATTGGTG TGAATGTGTA TTATCACAGT
GTGGTGGGGG ACAATTCCGT TCGGCTGAAA AAATGTCTTC TTGCAGCTTT GGAAAGGTGC
GACCTTGTTA TTATGACCGG AGGACTCGGC CCCACGCAGG ATGACCTTAC AAAGGAGACT
GTTGCGGAAG TTTTGGGGAA AAAGCTTGTT TTACACGAAG AAAGCCTTGA GAGGATTAAA
ACTTTTTTTA CCAGGATAAA CCGAAAAATG ACGGACAATA ATGTCAAGCA GGCATATCTT
CCGGAAGGGT GCACAGTGGT TGAGAATAAC AGCGGCACAG CCCCGGGCTG CATTATTGAA
GATAAGGGAA AGATTGTGGT AATGCTTCCC GGTCCTCCGC CGGAGATGAT GCCGATGCTT
GATGATACCG TTATTCCTTA CCTTGCGGAA AAATCAGGAT ACAGGATAGT GTCAAAATAT
CTGAGGGTTT TTGGAATAGG GGAATCACAG CTTGAAGAGA TGATTATGGA TTTAGTTGAC
AAACAGGACA GGGTCACCAT AGCAACTTAT GCAAAAGACG GGCAGGTGAC CGTAAGACTT
ACCACAAAAG CCAGGACAGA GGAAGAAGGC TTTCGTGAAA TACTTCCTTT GCAAAATGAG
ATAGCTTCAA GACTCAAAGA GGCATTATAC AGTACGGAAG ATGAAGAGCT GGAATATGTG
GCGGCAAAGA TGCTTATTGA CAACAACATT ACAATAGCAA CTGCCGAATC TTGTACCGGT
GGGCTGATTT CAGCAAGGCT TACCGATGTG CCCGGAATAT CAAAGGTTTT TAACAGAGGT
ATTGTATCTT ACAGCAATGA AGCCAAGATG GAAAACCTCG GGGTTAAGCC TGAGACTTTG
GAAAAGTACG GTGCCGTAAG CAGCCGGACT GCAATGGAGA TGGCTGAAGG TGTAAGGAAA
ATCGCCTCAA CTGATATAGG GCTGGCGGTT ACAGGTATTG CAGGTCCTGA CGGAGGCACT
GATGAAAAAC CGGTGGGATT GGTTTATGTT GCCCTGGCCC ATAGCCTGGG GACGGAGGTA
AGGGAACTTA GGCTTGCCGG GAACAGAAAC AGAATAAGAA ACCTTACAGT GCTTAATGCT
TTTGACATGG TAAGAAGATA TGTAATGAAG CTGAAAGGGT AA
 
Protein sequence
MNAEILAVGT ELLMGQIANT NAQYISKRLN DIGVNVYYHS VVGDNSVRLK KCLLAALERC 
DLVIMTGGLG PTQDDLTKET VAEVLGKKLV LHEESLERIK TFFTRINRKM TDNNVKQAYL
PEGCTVVENN SGTAPGCIIE DKGKIVVMLP GPPPEMMPML DDTVIPYLAE KSGYRIVSKY
LRVFGIGESQ LEEMIMDLVD KQDRVTIATY AKDGQVTVRL TTKARTEEEG FREILPLQNE
IASRLKEALY STEDEELEYV AAKMLIDNNI TIATAESCTG GLISARLTDV PGISKVFNRG
IVSYSNEAKM ENLGVKPETL EKYGAVSSRT AMEMAEGVRK IASTDIGLAV TGIAGPDGGT
DEKPVGLVYV ALAHSLGTEV RELRLAGNRN RIRNLTVLNA FDMVRRYVMK LKG