Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0594 |
Symbol | |
ID | 4808196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 730556 |
End bp | 731776 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106008 |
Product | transposase, mutator type |
Protein accession | YP_001037022 |
Protein GI | 125973112 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTACTA ATAATAGAAT GGCACTTTTA GAACAACTTA GCAAGTATGT TGTTGAAAAA GATAAAGATT TTTTAAAAGA AGCATTAACA TTACTCATTA ATGCCCTAAT GGATGCGGAA GTTACATCAA TAATAGGTGC TGAAAAGTAT GAAAGAAATA ATAATAGAAA CAACTATCGC AATGGATATC GTCTAAGAGA ATGGGATACT CGAGTAGGAA CATTACAGTT AAGCATTCCC AAGTTACGTC ACGGAAGTTA TTTTCCAAGT CTTTTAGAAC CGAGGAAAAT GTCAGAGAAA GCATTATTGA ATGTAGTTCA GGAAGCCTAT GTTCATGGAG TAAGTACCAG GAAGGTGGAT GAACTTGTAG AAGCTCTTGG AATGAAAGGG ATTGATAAAA GCGAAGTATC AAGAATCAGT AAGCAACTGG ATGAATTTGT AGAAGAATTT AAAAACCGTA GACTGGAAGG AGAATATCCT TACCTTTGGC TTGATGCCAC TTTCCCCAAG GTTCGGGAAG GAGGCAGGGT ATGCAGTATG GCACTAGTTA TAGCAGTAGG AGTTAATCAA CAAGGTGAAC GGGAAATATT AGGTTTTGAT GTAGGGATGA GTGAAGACGG GGCTTTTTGG GAGGAGTTTT TAAGAAGGCT GGTAGCAAGG GGTCTAAAAG GTGTAAGGCT TGTAATCAGT GATGCACATG AAGGGCTGAA GGCTGCAATA AAGAAGATTT TAACGGGAAG TGCATGGCAA AGATGCCGTG TACATTTTAT GAGAAACGTA TTAAGCCAGG TACCAAAGCA TTATCAGGGA ATGGTATCAT CGATAATACG GACAATATTT GCCCAGAATG ATCAGGAATC TGCGAGGGAA CAGTTAAGGC ATGTAGTAGA TGAGCTTAAA AATCGTTTTC CAAAAGCAAT GAAAATTCTT GAAGAAGCAG AAGAAGAAAT CCTGGCATAT ATGGCTTTTC CCCGTGAGCA TTGGGCACAG ATACACTCCA CCAATCCTCT TGAGAGACTT AACCGGGAAA TTCGCCGTCG AACGGATGTT GTTTGCATAT TTCCAAATCG TGAGGCGGTA ATCCGATTGG TAGGAGCAAT GCTCATGGAA CAAAATGATG AATGGAAAGT AGGGCGGCGC TATTTCAGTC TGGAATCAAT GTCAAAGATT ACATCGATAA ATGAATTTAC ATTGACACCA GTAGCTTTAT TACATAAATG A
|
Protein sequence | MATNNRMALL EQLSKYVVEK DKDFLKEALT LLINALMDAE VTSIIGAEKY ERNNNRNNYR NGYRLREWDT RVGTLQLSIP KLRHGSYFPS LLEPRKMSEK ALLNVVQEAY VHGVSTRKVD ELVEALGMKG IDKSEVSRIS KQLDEFVEEF KNRRLEGEYP YLWLDATFPK VREGGRVCSM ALVIAVGVNQ QGEREILGFD VGMSEDGAFW EEFLRRLVAR GLKGVRLVIS DAHEGLKAAI KKILTGSAWQ RCRVHFMRNV LSQVPKHYQG MVSSIIRTIF AQNDQESARE QLRHVVDELK NRFPKAMKIL EEAEEEILAY MAFPREHWAQ IHSTNPLERL NREIRRRTDV VCIFPNREAV IRLVGAMLME QNDEWKVGRR YFSLESMSKI TSINEFTLTP VALLHK
|
| |