Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1052 |
Symbol | |
ID | 4811350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1256707 |
End bp | 1257948 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106474 |
Product | competence/damage-inducible protein CinA |
Protein accession | YP_001037477 |
Protein GI | 125973567 |
COG category | [R] General function prediction only |
COG ID | [COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA [COG1546] Uncharacterized protein (competence- and mitomycin-induced) |
TIGRFAM ID | [TIGR00177] molybdenum cofactor synthesis domain [TIGR00199] competence/damage-inducible protein CinA C-terminal domain [TIGR00200] competence/damage-inducible protein CinA N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000170345 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCGG AGATATTAGC GGTTGGAACC GAGCTTTTAA TGGGGCAGAT AGCAAATACC AATGCCCAGT ATATATCCAA AAGGCTCAAT GACATTGGTG TGAATGTGTA TTATCACAGT GTGGTGGGGG ACAATTCCGT TCGGCTGAAA AAATGTCTTC TTGCAGCTTT GGAAAGGTGC GACCTTGTTA TTATGACCGG AGGACTCGGC CCCACGCAGG ATGACCTTAC AAAGGAGACT GTTGCGGAAG TTTTGGGGAA AAAGCTTGTT TTACACGAAG AAAGCCTTGA GAGGATTAAA ACTTTTTTTA CCAGGATAAA CCGAAAAATG ACGGACAATA ATGTCAAGCA GGCATATCTT CCGGAAGGGT GCACAGTGGT TGAGAATAAC AGCGGCACAG CCCCGGGCTG CATTATTGAA GATAAGGGAA AGATTGTGGT AATGCTTCCC GGTCCTCCGC CGGAGATGAT GCCGATGCTT GATGATACCG TTATTCCTTA CCTTGCGGAA AAATCAGGAT ACAGGATAGT GTCAAAATAT CTGAGGGTTT TTGGAATAGG GGAATCACAG CTTGAAGAGA TGATTATGGA TTTAGTTGAC AAACAGGACA GGGTCACCAT AGCAACTTAT GCAAAAGACG GGCAGGTGAC CGTAAGACTT ACCACAAAAG CCAGGACAGA GGAAGAAGGC TTTCGTGAAA TACTTCCTTT GCAAAATGAG ATAGCTTCAA GACTCAAAGA GGCATTATAC AGTACGGAAG ATGAAGAGCT GGAATATGTG GCGGCAAAGA TGCTTATTGA CAACAACATT ACAATAGCAA CTGCCGAATC TTGTACCGGT GGGCTGATTT CAGCAAGGCT TACCGATGTG CCCGGAATAT CAAAGGTTTT TAACAGAGGT ATTGTATCTT ACAGCAATGA AGCCAAGATG GAAAACCTCG GGGTTAAGCC TGAGACTTTG GAAAAGTACG GTGCCGTAAG CAGCCGGACT GCAATGGAGA TGGCTGAAGG TGTAAGGAAA ATCGCCTCAA CTGATATAGG GCTGGCGGTT ACAGGTATTG CAGGTCCTGA CGGAGGCACT GATGAAAAAC CGGTGGGATT GGTTTATGTT GCCCTGGCCC ATAGCCTGGG GACGGAGGTA AGGGAACTTA GGCTTGCCGG GAACAGAAAC AGAATAAGAA ACCTTACAGT GCTTAATGCT TTTGACATGG TAAGAAGATA TGTAATGAAG CTGAAAGGGT AA
|
Protein sequence | MNAEILAVGT ELLMGQIANT NAQYISKRLN DIGVNVYYHS VVGDNSVRLK KCLLAALERC DLVIMTGGLG PTQDDLTKET VAEVLGKKLV LHEESLERIK TFFTRINRKM TDNNVKQAYL PEGCTVVENN SGTAPGCIIE DKGKIVVMLP GPPPEMMPML DDTVIPYLAE KSGYRIVSKY LRVFGIGESQ LEEMIMDLVD KQDRVTIATY AKDGQVTVRL TTKARTEEEG FREILPLQNE IASRLKEALY STEDEELEYV AAKMLIDNNI TIATAESCTG GLISARLTDV PGISKVFNRG IVSYSNEAKM ENLGVKPETL EKYGAVSSRT AMEMAEGVRK IASTDIGLAV TGIAGPDGGT DEKPVGLVYV ALAHSLGTEV RELRLAGNRN RIRNLTVLNA FDMVRRYVMK LKG
|
| |